Software Engineering and Programming Languages

Adopting and Sustaining Microservice-Based Software Development

The microservice approach to software development offers an alternative to the conventional monolith style.

Posted Mar 28 2024

The Initial Decision to Adopt MS Architecture
Converting Existing Monoliths to MSs
Building New MSs and Modifying Existing Ones for New Requirements
Operating MSs and the Architecture
Sustaining a MSs Approach in the Long Haul
Conclusion
Acknowledgments
References

Microservice (MS) has become the latest buzzword in software development,¹^,⁶^,¹⁷ and the popularity of MS-based development is gaining momentum. The MS approach to software development offers an alternative to the conventional monolith style. While benefits of MS-based development over monolith style are clear, industry experts agree that neither style provides an absolute advantage in all situations. Proponents contend an MS approach to software development more readily facilitates mapping organizational changes manifesting from a more dynamic business environment to corresponding IT/IS) changes.

MS represents an architectural style where large, complex applications are composed of a suite of smaller MSs. MSs have been advocated to overcome some of the perils of existing approaches to software development.¹^,⁷^,³⁰ Each MS corresponds to a single function around a business capability. Each encompasses its own data resources and is quickly deployable. Examples range from online bill payment on a banking website to an alternate-flights service on an airline website.

Other benefits of MSs include ease of requirements changes and testing, greater scalability, enhanced technology innovation and heterogeneity (for example, support for different programing languages and platforms), ease of monitoring performance such as service reliability, and support for more process variations.⁴^,⁷^,¹⁸^,²⁹ Tom Killalea highlighted eight dividends of MSs beyond the benefits already accepted by industry experts.¹⁵ Extensive adoption by notable companies such as Netflix and Uber has given further credence to MS-based software development.¹³^,¹⁶^,¹⁹

Thanks to the favorable reviews of MSs, many organizations are now considering adopting them. While benefits are clear, there are significant challenges to adopting and sustaining MS-based software development. Many of these challenges stem from the novelty of the approach and the lack of understanding in both technical and organizational parameters affecting MS-based development. Unless these challenges are well understood, companies are unlikely to reap the full benefits of MSs and be even worse off than before the modernization.

This article identifies key challenges from the initial decision to adopt MSs to the ongoing task of sustaining the new paradigm over the long haul. It aims to provide insights to those considering MS-based software development. This work is supplemented by sections throughout the article labeled “Field Report,” which are based on first-hand experiences of one of the authors, Shahir A. Daya, who has been leading significant MS-based development projects for the past two years at several Fortune 100 companies. (He is responsible for helping the IBM Global Business Services organization adopt the skills necessary to deliver MS-based modernization projects. As such, he teaches several classes on the subject and has coauthored an IBM Redbook as well as IBM’s Microservices Decision Guide.)

The Initial Decision to Adopt MS Architecture

For many, the decision to adopt MS architecture generally emanates from out-of-control codebases in large and complex monoliths.²³ While many applications start small and are expected to fulfill a specific business need, over time they become larger and more complex as new requirements are added and bugs are fixed. Code refactoring to counter the effects of mounting complexity is often pursued, but barriers to refactoring in a monolith setting are well known.¹^,²⁵ Ultimately, the decision to adopt MS architecture rests on the profile of the organization’s application suite.

Although extreme cases of spiraling codebases and escalating complexities are likely to be self-evident, other situations complicate the decision to adopt MSs. In practice, the decision should not be based on transforming all monolithic applications all at once to MSs; rather, the first step should be identifying specific business functionalities and associated code segments as initial candidates for modernizing to MSs. Monolithic applications create enormous bottlenecks when frequent code changes warrant the need to deploy them often.

When applications are already subjected to significant enhancements in a relatively short period of time or are expected to be in the future, management should consider adopting MSs. The case for modernizing to MSs should also be made when monoliths take up greater testing resources than the development effort. When an application is identified as a differentiator in the industry and significant investment has been made for its success, the case ought to be made for refactoring it under MSs. When monolithic applications continue to struggle to meet nonfunctional requirements (NFRs) and user expectations, there are more opportunities to reap the benefits of modernizing to MSs. When a higher value is placed on software scalability, refactoring to MSs allows each MS to be scaled independently of the others that make up the application (vis-à-vis a monolith). While scalability in general is desirable, predicting future scalability requirements is not trivial.

Nonetheless, the biggest challenge for MS proponents is to make the value proposition to upper management. The key to making the case for adoption lies in quantifying the benefits of MS and comparing them with costs of implementing an MS infrastructure, including recruiting and retraining personnel. The decision to modernize to MSs must consider all parameters in a holistic, rather than a piecemeal, fashion. Intangibles such as changes to the organizational culture, where a team of individuals is responsible for each MS from development to operation, must be considered as well.

While the MS model offers greater resiliency in terms of the overall system’s ability to continue to function even when a noncritical MS fails,⁷ it still needs to be programmed into the code.¹² Unlike a single monolith, MSs allow for setting more finely granular service-level agreements (SLAs). When SLAs for an individual MS are considered for the system, however, they become multiplicative instead of additive from an end-user point of view. Hence, quantifying SLAs for an application made up of a suite of MSs depending on each other can be quite challenging.

As companies look for ways to cope effectively with organizational changes that manifest from a demand such as new product development and the corresponding IT/IS changes, MSs offer a novel approach to align organizational and IT/IS changes more seamlessly.⁷^,¹⁸^,²⁹ For many, however, the decision to adopt the MS architecture is not a trivial one, and not all applications are candidates for this modernization.

In comparing monolith and MS approaches to software development, Thomas Hunter, author of Advanced Microservices,¹⁴ asserts “there are various reasons to choose one approach instead of the other, and neither approach is absolutely better or worse than the other.” When a company has a large suite of applications that historically have had one or two deployments a year, there is little reason to make the leap to MSs. In making this important decision to adopt the new paradigm, companies must examine their application portfolio and consider all aforementioned criteria. Over time, the experience will help organizations navigate the decision-making landscape to either adopt MSs or retain the conventional monolith-based development.

Field Report

In conversations with organizations, it is apparent they understand the MS architectural style is complex compared with that of a monolith, and the decision to modernize an application leveraging MSs should not be taken lightly. They also understand there is nothing wrong with a well-modularized monolithic application. Only when they start to see issues in their ability to deliver business capabilities in a timely manner in comparison with their competitors, or when they start to experience nonfunctional issues such as scalability, do they start talking about the possibility of modernizing the application architecture.

Organizations are also starting slowly down this MS path. Many will look at the pain points of a monolithic application and address only those pain points using MSs. For example, a performance hot spot in a monolithic application can be pulled into an MS and scaled independently without needing to scale the rest of the monolith.

Organizations have started to understand that not all their applications be modernized to the MS style and that an entire single application does not have to be modernized. An application can consist of both a monolithic portion delivering some of the functionality as well as a modern MS suite delivering other functionality (that is, coexistence of the old and the new). For instance, a large North American bank is modernizing selected parts of its online banking application to an MS style. This opportunistic and incremental approach is also being used by a large U.S.-based airline to modernize its website, mobile apps, and self-service kiosks. This approach is called the strangler fig pattern.¹⁰

Converting Existing Monoliths to MSs

To realize the benefits of the MS paradigm, companies look for ways to break up their existing monoliths. Practically, it makes sense to identify larger MSs that could later be broken into smaller ones. Our experience is that breaking a larger MS into smaller ones is easier than combining two smaller MSs to create a larger one. Others have made similar observations, that most organizations tend to start with bigger MSs and then split them up into smaller ones.²⁶ The challenge is to develop guidelines for systematically carving MSs out of the monolith.

Segmenting code into units where each represents a single function or business capability is referred to as bounded context, a term that has roots in domain-driven design.³^,⁹ Isolating frequently changing code segments within a monolith serves as the starting point in identifying bounded context for segmenting part of the monolith to create MSs. As alluded to earlier, the code segments that are most used, take up significant testing resources, serve as differentiators, and struggle to meet NFRs and user expectations also provide clues about how to segment a monolith to build MSs.

This transformation exercise requires adherence to conventional principles in strong cohesion and loose coupling. Nonetheless, the goal should never be to convert the entire monolith into MSs. Instead, in an iterative fashion, suitable code segments should be identified and refactored into MSs.

Implementing stopping rules for the monolith-to-MS transformation plays a crucial role in reaching desired objectives without getting carried away by MS mania. Moreover, this should not be an exercise in simply copying a codebase from a monolith and pasting it to an MS. It needs to be looked upon as an opportunity (albeit with certain challenges) to improve that code segment in the form of an MS that provides a specific business function with a well-defined interface.

In most instances, not all the original monolith is converted to MSs. The overhauled monolith now must coexist with newly created MSs. Calls that initially went to the monolith may now have to be rerouted to the corresponding MS. The new ecosystem creates an added challenge that is not present in a solely monolithic environment: The user interface of the existing monolith must be updated to point modernized/new functions to the MS-based application delivering that business function. This coexistence of the monolithic application and the MS-based modernized parts requires sharing context between them, including authentication/authorization information, thereby further exacerbating complexities in software development and operation.

Breaking up a monolith invariably involves segmenting data elements in the enterprise repository to individual MSs. While designers are generally well versed in principles of distributed database design, the new ecosystem consisting of monoliths and MSs requires training in data segmentation that transcends our extant knowledge. In this exercise, the challenge is to avoid data duplication and possibly learn to disregard conventional principles such as data normalization. Once data is partitioned, the monolith must be tested to make sure that it does not break. While each MS containing its own data resources has it benefits, practitioners face significant challenges in partitioning the enterprise database.

MS-based development requires additional levels of testing. First, contract testing refers to testing APIs. When a consumer links to an MS, a contract is formed. This contract defines the inputs, outputs, and performance parameters that need to be thoroughly tested.

Second, chaos testing refers to testing for chaos inherent in the natural world. Generally, this exercise involves defining what is “normal” and testing the system under those conditions before introducing chaotic conditions. More chaotic conditions manifest in the new paradigm because unlike a monolith, which has only two states of availability, some of the MSs in the application might be up and running while the others may not.

To mimic this reality in chaos testing, each MS is brought down deliberately to test its implications on the overall system. This testing also involves intentional introduction of network latency to assess its impact on performance. MSs communicate with each other over the network as opposed to the in-process function calls made in a monolith. Testing circuit breakers¹¹—put in place to decide whether to continue operating under normal conditions or resort to operating under limited capacity—places added complexities on the emerging paradigm.

Field Report

We do not recommend creating a new green-field application using an MS architectural style. It is too complicated to start that way. A monolith-first approach should be used for creating an MVP (minimum viable product) and getting it into the hands of your users. Once the monolith becomes large enough, it can be decomposed into MSs. Most organizations are modernizing existing business-critical monolithic applications using an MS architectural style. These legacy applications have gathered significant technical debt over the many years of code changes that has resulted in loss of agility, impacting time to market of new business capabilities.

Almost all the organizations we work with are using the strangler fig pattern⁵ to “strangle” the monolith and pull out one MS at a time. This approach requires thinking through how the monolith portion and the MS portion coexist. This is not simple, but the advantage of being able to quickly modernize one function of a large application and get it into the hands of the users outweighs the complexity of coexistence. The U.S.-based airline mentioned previously is using this strangler fig approach. The entire transformation may take three-plus years to complete; however, end users start to see transformed business capabilities as soon as the individual products (functions/features) go live, and they continue to see transformed capabilities incrementally during the project.

Building New MSs and Modifying Existing Ones for New Requirements

As new requirements emerge, developers must decide whether to build a new MS or modify an existing one. When modifying an existing MS, the challenge is to preserve the bounded context of that MS (that is, high cohesion of what is included in the MS). Unless developers are trained in the new paradigm, they could resort to what is technically expedient rather than maintaining core principles of MS-based development by focusing on business aspects. For example, adding new code to satisfy a requirement might break the tenet that each MS should support a single function around a business capability. Or lack of due diligence in adding code might result in unintentional dependencies (that is, coupling between the modified MS, other MSs, or even the refactored monolith), thereby stymying the ability to change, test, and deploy it independently and quickly.

The key is to educate developers to handle business requirements in the new paradigm and instill discipline to maintain its fundamental tenets. The technical lead in each development team must verify that creating new MSs and modifying existing ones does not break key assumptions of the emerging paradigm. Conventional techniques such as code reviews need to be expanded to ensure the new configuration does not violate the bounded context. Failure to do so would result in the breakdown of the MS architecture and thwart the realization of its much-touted benefits.

The new paradigm operates at the application level rather than at an enterprise level, as is the case with service-oriented architecture (SOA) and Web services. Hence, there could be some duplication of effort. Although one application team might have created an MS that could be used by a second application team, in the absence of an enterprise-wide repository of MSs, the latter might create an identical MS.

These application-based teams tend to move at a fast pace and tend to do little sharing of codebases among them. The modest sharing that does occur manifests by word of mouth or through personal contacts among developers and project teams. In case one team finds an MS already created by another team useful, the former team tends to create its own “copy” of the existing MS to avoid being dependent on the MS created by the latter team. The team essentially forks the codebase and is now responsible for maintaining its own fork of the MS. The best way to promote sharing, however, is to catalog all MSs.

The addition of new MSs creates new challenges in monitoring them. Developers must define metrics for monitoring performance of MSs. One key question is to define what it means to say that an MS is in “good” standing. Monitoring MSs for their availability, reliability, ability to fulfill SLAs, robustness, and resiliency requires metrics and benchmarking. It might require creating synthetic transactions to test the health of each MS periodically. Performance logs of MSs need to be continually monitored to assess their performance and take corrective action when needed.

Field Report

The first delivery of MSs tends to be good. The team has just started on the MS journey, and everyone involved understands the tenets of MSs and why they are necessary. The developers religiously follow the rules and ensure that the granularity of the MSs is appropriate, and they have a bounded business context.

Once the excitement and glamour of an MS-based architecture is gone, however, some developers start to take shortcuts and violate some of the core tenets of the MS style, such as having a bounded business context and not being coupled to another MS via a database. This is already evident at certain organizations and will become a bigger problem as the MS style is more widely adopted.

Discipline is required, and measures such as architecture and code reviews are necessary to avoid taking on unnecessary technical debt. Keeping track of technical debt during an MS-based transformation project is a good idea. Different technical and business constraints may force teams to take shortcuts that result in more technical debt. The U.S.-based airline mentioned earlier is doing a great job of keeping track of technical debt and is creating user stories in the team’s backlog to pay the debt.

Operating MSs and the Architecture

The MS paradigm creates greater operational complexity because of the numerous moving parts that need to be tracked (vis-à-vis conventional monoliths). Each MS provides a specific business functionality and a suite of MSs must work in concert to provide a more overarching business function. While MS-based development relies on developing autonomous services offering some business functionality, designing decoupled application systems is no easy task. It involves messaging and data exchange among corresponding MSs that must be monitored and managed. Unlike maintaining enterprise databases commonly used with monolith systems, maintaining the availability and consistency of data partitioned into MSs creates additional demands.

Versioning brings significant challenges in the new paradigm. Instead of having to deal with a few versions of a monolith, now developers and analysts must deal with multiple build and interface versions of many MSs.²⁰ As developers create new versions, they must provide support for both prior and new versions. Users need to be informed of differences with older versions in terms of functionality and interface. Some developers choose to list a new MS as experimental, beta, or general.⁷ There must be clear guidelines for informing users in case of discontinued support for older MSs. When the service registry contains multiple versions of an MS, locating the right one for a given application creates additional complications. Versioning strategies and guidelines must be in place.

The network has a greater effect on an application with many MSs as opposed to a single monolith where function calls are in the same running process.⁸ Networks inherently introduce latency issues that could adversely impact the performance of an MS-based application. MSs manifest in the form of distributed systems with calls among them managed via the network. Unlike monoliths, an application consisting of a suite of smaller MSs with messaging among them could amplify network latency.

Running performance tests to identify the biggest sources of latency could hold the key to addressing this challenge. Choosing a platform that helps identify potential bottlenecks could ease network latency. While network latency is unavoidable when dealing with applications composed of MSs, greater insights into the inner workings of MSs would aid in identifying markers on acceptable latency levels.

In addition to technical challenges, companies face obstacles to changing the prevailing culture surrounding monoliths when they begin to develop MSs, where incentives and accountability differ considerably. The new paradigm promotes principles of DevOps teams that are charged with both developing and operating each MS. DevOps correspond to a set of practices that strive to unify software development (Dev) and software operation (Ops).²

Unlike conventional teams based on disciplines such as marketing, DevOps teams are organized by business capabilities. They are responsible from inception to the entire operational life of a product, whether it is small or an end-to-end vertical slice through the product implementing a feature of a larger product that includes all the MSs that are part of it.

Instead of seeking technology-focused solutions in monoliths, these DevOps teams seek business service-focused solutions. Before an MS is implemented, a clear business service (that is, function) that the MS is expected to represent must be delineated. The cultural changes, such as willingness to work with cross-functional teams and to take on expanded roles demanded by the emerging paradigm, are likely to face resistance from some quarters. Moving to a new culture based on the principle “if you build it, you run it and support it” could prove challenging. Unless the incentives are aligned to promote cross-functional collaboration and accept expanded roles as in DevOps teams, some may be reluctant to embrace the organizational changes necessary to facilitate MS-based development.⁷^,²⁸

Field Report

While working at different organizations to deliver MS-based projects, we have found that the organizational challenges are much more difficult than the technical challenges. Putting together a cross-functional team/squad that can handle a vertical slice right from the UI to the back end across all the channels (Web, mobile, and so on) is not an easy task. These individuals are likely to be in different parts of the organization and may need to be moved to overcome some of the barriers. Moving all the individuals who make up a team, so they report to the same HR manager certainly helps. Although they have the same HR manager, team members will take technical directions from different leads based on their given roles in the cross-functional team.

Developers also need to get up to speed with the Ops part of DevOps. The team must accept the fact that they maintain and support the code they write. If the MS they write fails, they will get paged in the middle of the night. They need to implement the necessary monitoring, dashboarding, alerting, and runbook automation to avert such emergency interruptions. We are starting to do this with our clients and are seeing positive results. Developers are taking ownership, and code quality is improving, but developers don’t want to maintain their code forever. It is critical to keep them engaged, so it is important to let individuals move from one team/squad to another, one at a time.

Having strong executive sponsorship at the organization for your MS initiatives is necessary for success. All our clients experiencing positive forward movement in the cultural transformation have strong executive sponsorship.

Sustaining a MSs Approach in the Long Haul

As with any new paradigm, the use of MSs for software development and operation warrants justification on a periodic basis. Top management is likely to insist on evidence of performance gains to furnish ongoing investment in people and MS-related technology. This requires initial benchmarking and continuous performance assessment.

While conventional metrics have relevance, new performance metrics need to be developed and instituted. For example, SLAs for an MS-based application as well as SLAs for each MS must be tracked for compliance. In addition to these traditional nonfunctional requirements, other baselining and tracking metrics include an increase in deployments with the modernized application, reduction in the amount of downtime during deployments, reduction in the number of defects/bugs being reported, decrease in the cost of making updates to add new features, among others.

Moreover, processes must be in place to review and refine applications and MSs within each application. Fundamentally, each MS is expected to represent a single business functionality. Checks and balances are needed to confirm that each MS conforms to this core principle. Generally, companies have not instituted such processes, although there is tremendous value in doing so to attest that the company stays true to itself in embracing the new paradigm. Often when existing MSs are modified, some may contain meta functionality and dependencies (that is, tight coupling) with other MSs—two characteristics that are contrary to guiding principles of MS-based development.

On a periodic basis, each MS should be inspected by a different party from the one that developed it to make sure it complies with key tenets of MS-based development. The same holds true for data embedded in MSs. Without clear focus and discipline, revising existing MSs to add a new requirement or fix a defect might result in unwanted data dependencies between MSs and enterprise databases.

With ongoing pressure to move software products out the door, some developers might be tempted to cut corners and abandon the founding tenets of MSs in the process. The developer pool needs continuous training in maintaining core principles of MS-based development.

Moreover, companies need to continue building talent to develop the next generation of MSs that correspond to a single function around a business capability, encompass their own data resources, and are quickly deployable. However, few college graduates receive training in MS-based software development. These are some of the challenges organizations face in sustaining MS-based development over the long run.

Field Report

Full-stack developers are not easy to find. Most organizations today have specialists in a particular area—for example, front-end development, back-end development, or native iOS development. It is rare to find individuals who can do it all. Some have breadth but are deep (that is, specialists) in one area (“T”-shaped). To grow toward more inverted triangle-shaped individuals (that is, varying depth in several specialties), practices such as pair programming and frequent rotation of pairs helps to develop the necessary skills. This takes time and cannot be rushed. The Spotify method is being used by some of our clients and is proving to be a good model for MS-based development.

Large organizations such as IBM are starting to influence academic curricula to include education on cloud-native and MS-based application development to close the skills gap in the industry.

Conclusion

Increasing globalization and hypercompetition have resulted in dynamic business environments that require companies to adapt quickly to rapid and frequent changes. These environmental forces necessitate changes to business strategy that manifest in internal organizational changes. Software systems are intimately intertwined with all aspects of a company and, therefore, organizational changes warrant system changes.⁴^,⁷^,²¹^,²⁴

Companies continue to struggle, however, with effectively mapping organizational changes with IT/IS changes.⁷^,²⁹ This has been attributed to a company’s inability to deploy software applications independently and quickly.⁷ The software industry is enamored with MS-based development’s ability to overcome these perils.

Each MS contains its own data and represents a single function that corresponds to a business capability. This allows for seamless alignment of business services with software services (that is, microservices). While the new paradigm has considerable advantages, it can impose substantial costs as well.¹^,⁷^,²²^,²⁷ Hence, the challenges highlighted in this article must be considered to improve a company’s chances of success in adopting and sustaining MS-based software development.

Acknowledgments

This research was funded by grants from the Robert H. Brethen Operations Management Institute and the Earl V. Snyder Innovation Management Center at the Whitman School of Management, Syracuse University.

Submit an Article to CACM

CACM welcomes unsolicited submissions on topics of relevance and value to the computing community.

You Just Read

Adopting and Sustaining Microservice-Based Software Development

View in the ACM Digital Library

DOI

10.1145/3651620

Join the Discussion (0)

Become a Member or Sign In to Post a Comment

The Latest from CACM

Explore More

BLOG@CACM Apr 26 2024

Optimizing Energy Efficiency in Datacenters with Advanced Cooling Technologies

Alex Williams

Architecture and Hardware

Credit: Getty Images Servers in snowy setting.

News Apr 23 2024

Maximizing Power Grid Security

R. Colin Johnson

Security and Privacy

News Apr 18 2024

Keeping AI Out of Elections

Bennie Mols

Artificial Intelligence and Machine Learning

Shape the Future of Computing

ACM encourages its members to take a direct hand in shaping the future of the association. There are more ways than ever to get involved.

Get Involved

Communications of the ACM (CACM) is now a fully Open Access publication.

By opening CACM to the world, we hope to increase engagement among the broader computer science community and encourage non-members to discover the rich resources ACM has to offer.

Learn More

The Initial Decision to Adopt MS Architecture

Converting Existing Monoliths to MSs

Building New MSs and Modifying Existing Ones for New Requirements

Operating MSs and the Architecture

Sustaining a MSs Approach in the Long Haul

Conclusion

Acknowledgments

Adopting and Sustaining Microservice-Based Software Development

DOI

Related Reading

Join the Discussion (0)

Become a Member or Sign In to Post a Comment

Shape the Future of Computing

Communications of the ACM (CACM) is now a fully Open Access publication.