MicroRCA: Root Cause Localization of Performance Issues in Microservices

Abstract : Software architecture is undergoing a transition from monolithic architectures to microservices to achieve resilience , agility and scalability in software development. However, with microservices it is difficult to diagnose performance issues due to technology heterogeneity, large number of microservices, and frequent updates to both software features and infrastructure. This paper presents MicroRCA, a system to locate root causes of performance issues in microservices. MicroRCA infers root causes in real time by correlating application performance symptoms with corresponding system resource utilization, without any application instrumentation. The root cause localization is based on an attributed graph that model anomaly propagation across services and machines. Our experimental evaluation where common anomalies are injected to a microservice benchmark running in a Kubernetes cluster shows that MicroRCA locates root causes well, with 89% precision and 97% mean average precision, outperforming several state-of-the-art methods.
Complete list of metadatas

Cited literature [31 references]  Display  Hide  Download

https://hal.inria.fr/hal-02441640
Contributor : Guillaume Pierre <>
Submitted on : Thursday, January 16, 2020 - 12:39:35 PM
Last modification on : Friday, January 17, 2020 - 1:15:28 AM

File

main.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02441640, version 1

Collections

Citation

Li Wu, Johan Tordsson, Erik Elmroth, Odej Kao. MicroRCA: Root Cause Localization of Performance Issues in Microservices. IEEE/IFIP Network Operations and Management Symposium (NOMS), Apr 2020, Budapest, Hungary. ⟨hal-02441640⟩

Share

Metrics

Record views

87

Files downloads

230