TurboSream: Towards Low-Latency Data Stream Processing

Song Wu 1 Mi Liu 1 Shadi Ibrahim 2 Hai Jin 1 Lin Gu 1 Fei Chen 1 Zhiyi Liu 1
2 STACK - Software Stack for Massively Geo-Distributed Infrastructures
Inria Rennes – Bretagne Atlantique , LS2N - Laboratoire des Sciences du Numérique de Nantes
Abstract : Data Stream Processing (DSP) applications are often modelled as a directed acyclic graph: operators with data streams among them. Inter-operator communications can have a significant impact on the latency of DSP applications, accounting for 86% of the total latency. Despite their impact, there has been relatively little work on optimizing inter-operator communications, focusing on reducing inter-node traffic but not considering inter-process communication (IPC) inside a node, which often generates high latency due to the multiple memory-copy operations. This paper describes the design and implementation of TurboStream, a new DSP system designed specifically to address the high latency caused by inter-operator communications. To achieve this goal, we introduce (1) an improved IPC framework with OSRBuffer, a DSP-oriented buffer, to reduce memory-copy operations and waiting time of each single message when transmitting messages between the operators inside one node, and (2) a coarse-grained scheduler that consolidates operator instances and assigns them to nodes to diminish the inter-node IPC traffic. Using a prototype implementation, we show that our improved IPC framework reduces the end-to-end latency of intra-node IPC by 45.64% to 99.30%. Moreover, TurboStream reduces the latency of DSP by 83.23% compared to JStorm.
Type de document :
Communication dans un congrès
ICDCS 2018 - 38th IEEE International Conference on Distributed Computing Systems, Jul 2018, Vienna, Austria
Liste complète des métadonnées

Contributeur : Shadi Ibrahim <>
Soumis le : lundi 4 juin 2018 - 15:51:30
Dernière modification le : lundi 13 août 2018 - 13:34:02


  • HAL Id : hal-01807302, version 1



Song Wu, Mi Liu, Shadi Ibrahim, Hai Jin, Lin Gu, et al.. TurboSream: Towards Low-Latency Data Stream Processing. ICDCS 2018 - 38th IEEE International Conference on Distributed Computing Systems, Jul 2018, Vienna, Austria. 〈hal-01807302〉



Consultations de la notice