EXAD: A System for Explainable Anomaly Detection on Big Data Traces

Abstract : Big Data systems are producing huge amounts of data in real-time. Finding anomalies in these systems is becoming increasingly important, since it can help to reduce the number of failures, and improve the mean time of recovery. In this work, we present EXAD, a new prototype system for explainable anomaly detection, in particular for detecting and explaining anomalies in time-series data obtained from traces of Apache Spark jobs. Apache Spark has become the most popular software tool for processing Big Data. The new system contains the most well-known approaches to anomaly detection, and a novel generator of artificial traces, that can help the user to understand the different performances of the different methodologies. In this demo, we will show how this new framework works, and how users can benefit of detecting anomalies in an efficient and fast way when dealing with traces of jobs of Big Data systems.
Document type :
Conference papers
Complete list of metadatas

Cited literature [16 references]  Display  Hide  Download

https://hal.inria.fr/hal-02264598
Contributor : Fei Song <>
Submitted on : Wednesday, August 7, 2019 - 11:40:09 AM
Last modification on : Saturday, August 10, 2019 - 1:13:01 AM

File

exad_icdm2018.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02264598, version 1

Citation

Fei Song, Arnaud Stiegler, Yanlei Diao, Jesse Read, Albert Bifet. EXAD: A System for Explainable Anomaly Detection on Big Data Traces. ICDMW 2018 - IEEE International Conference on Data Mining Workshops, Nov 2018, Singapore, Singapore. ⟨hal-02264598⟩

Share

Metrics

Record views

73

Files downloads

224