Skip to Main content Skip to Navigation
Conference papers

StreamDM: Advanced Data Mining in Spark Streaming

Albert Bifet 1 Silviu Maniu 2 Jianfeng Qian 3 Guangjian Tian 3 Cheng He 3 Wei Fan 4
1 DBWeb
LTCI - Laboratoire Traitement et Communication de l'Information
Abstract : Real-time analytics are becoming increasingly important due to the large amount of data that is being created continuously. Drawing from our experiences at Huawei Noah's Ark Lab, we present and demonstrate here StreamDM, a new open source data mining and machine learning library, designed on top of Spark Streaming, an extension of the core Spark API that enables scalable stream processing of data streams. StreamDM is designed to be easily extended and used, either practitioners, developers, or researchers, and is the first library to contain advanced stream mining algorithms for Spark Streaming .
Complete list of metadata

Cited literature [10 references]  Display  Hide  Download

https://hal.inria.fr/hal-01270606
Contributor : Silviu Maniu <>
Submitted on : Monday, February 8, 2016 - 11:33:51 AM
Last modification on : Wednesday, October 14, 2020 - 1:28:14 PM
Long-term archiving on: : Saturday, November 12, 2016 - 1:25:30 PM

File

bifet2015streamdm.pdf
Files produced by the author(s)

Identifiers

Citation

Albert Bifet, Silviu Maniu, Jianfeng Qian, Guangjian Tian, Cheng He, et al.. StreamDM: Advanced Data Mining in Spark Streaming. International Conference on Data Mining Workshops (ICDMW), IEEE, Nov 2015, Atlantic City, NJ, United States. ⟨10.1109/ICDMW.2015.140⟩. ⟨hal-01270606⟩

Share

Metrics

Record views

636

Files downloads

1582