TurboSream: Towards Low-Latency Data Stream Processing

Song Wu 1 Mi Liu 1 Shadi Ibrahim 2 Hai Jin 1 Lin Gu 1 Fei Chen 1 Zhiyi Liu 1
2 STACK - Software Stack for Massively Geo-Distributed Infrastructures
Inria Rennes – Bretagne Atlantique , LS2N - Laboratoire des Sciences du Numérique de Nantes
Abstract : Data Stream Processing (DSP) applications are often modelled as a directed acyclic graph: operators with data streams among them. Inter-operator communications can have a significant impact on the latency of DSP applications, accounting for 86% of the total latency. Despite their impact, there has been relatively little work on optimizing inter-operator communications, focusing on reducing inter-node traffic but not considering inter-process communication (IPC) inside a node, which often generates high latency due to the multiple memory-copy operations. This paper describes the design and implementation of TurboStream, a new DSP system designed specifically to address the high latency caused by inter-operator communications. To achieve this goal, we introduce (1) an improved IPC framework with OSRBuffer, a DSP-oriented buffer, to reduce memory-copy operations and waiting time of each single message when transmitting messages between the operators inside one node, and (2) a coarse-grained scheduler that consolidates operator instances and assigns them to nodes to diminish the inter-node IPC traffic. Using a prototype implementation, we show that our improved IPC framework reduces the end-to-end latency of intra-node IPC by 45.64% to 99.30%. Moreover, TurboStream reduces the latency of DSP by 83.23% compared to JStorm.
Type de document :
Communication dans un congrès
ICDCS 2018 - 38th IEEE International Conference on Distributed Computing Systems, Jul 2018, Vienna, Austria. IEEE, pp.1-11, 〈10.1109/ICDCS.2018.00099〉
Liste complète des métadonnées

https://hal.inria.fr/hal-01807302
Contributeur : Shadi Ibrahim <>
Soumis le : vendredi 21 septembre 2018 - 10:29:57
Dernière modification le : lundi 24 septembre 2018 - 13:46:36

Fichier

ICDCS2018.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Collections

Citation

Song Wu, Mi Liu, Shadi Ibrahim, Hai Jin, Lin Gu, et al.. TurboSream: Towards Low-Latency Data Stream Processing. ICDCS 2018 - 38th IEEE International Conference on Distributed Computing Systems, Jul 2018, Vienna, Austria. IEEE, pp.1-11, 〈10.1109/ICDCS.2018.00099〉. 〈hal-01807302〉

Partager

Métriques

Consultations de la notice

318

Téléchargements de fichiers

31