Locality-Aware Routing in Stateful Streaming Applications

Abstract : Distributed stream processing engines continuously execute series of operators on data streams. Horizontal scaling is achieved by deploying multiple instances of each operator in order to process data tuples in parallel. As the application is distributed on an increasingly high number of servers, the likelihood that the stream is sent to a different server for each operator increases. This is particularly important in the case of stateful applications that rely on keys to deterministically route messages to a specific instance of an operator. Since network is a bottleneck for many stream applications, this behavior significantly degrades their performance. Our objective is to improve stream locality for stateful stream processing applications. We propose to analyse traces of the application to uncover correlations between the keys used in successive routing operations. By assigning correlated keys to instances hosted on the same server, we significantly reduce network consumption and increase performance while preserving load balance. Furthermore, this approach is executed online, so that the assignment can automatically adapt to changes in the characteristics of the data. Data migration is handled seamlessly with each routing configuration update. We implemented and evaluated our protocol using Apache Storm, with a real workload consisting of geo-tagged Flickr pictures as well as Twitter publications. Our results show a significant improvement in throughput.
Complete list of metadatas

Cited literature [19 references]  Display  Hide  Download

https://hal.inria.fr/hal-01407457
Contributor : Matthieu Caneill <>
Submitted on : Monday, December 12, 2016 - 4:03:31 PM
Last modification on : Friday, October 25, 2019 - 1:28:29 AM
Long-term archiving on : Tuesday, March 21, 2017 - 6:52:11 AM

File

storm-locality-middleware-2016...
Explicit agreement for this submission

Identifiers

Citation

Matthieu Caneill, Ahmed El Rheddane, Vincent Leroy, Noël de Palma. Locality-Aware Routing in Stateful Streaming Applications. Middleware'16 - 17th International Middleware Conference, ACM, IFIP, USENIX, Dec 2016, Trento, Italy. pp.1 - 13, ⟨10.1145/2988336.2988340⟩. ⟨hal-01407457⟩

Share

Metrics

Record views

383

Files downloads

271