Skip to Main content Skip to Navigation
New interface
Conference papers

P2P Join Query Processing over Data Streams

Wenceslao Palma 1, 2 Reza Akbarinia 1 Esther Pacitti 3 Patrick Valduriez 1, 3 
1 ATLAS - Complex data management in distributed systems
UN - Université de Nantes, Inria Rennes – Bretagne Atlantique
3 ZENITH - Scientific Data Management
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier, CRISAM - Inria Sophia Antipolis - Méditerranée
Abstract : Recent years have witnessed the growth of a new class of data-intensive applications that do not fit the DBMS data model and querying paradigm. Instead, the data arrive at high speeds taking the form of an unbounded sequence of values (data streams) and queries run continuously returning new results as new data arrive. In these applications, data streams from external sources flow into a Data Stream Management System (DSMS) where they are processed by different operators. Many applications share the same need for processing data streams in a continuous fashion. For most distributed streaming applications, the centralized processing of continuous queries over distributed data is simply not viable. This paper addresses the problem of computing continuous join queries over distributed data streams. We present a new method, called DHTJoin that exploits the power of a Distributed Hash Table (DHT) combining hash-based placement of tuples and dissemination of queries by exploiting the embedded trees in the underlying DHT, thereby incuring little overhead. Unlike state of the art solutions that index all data, DHTJoin identifies, using query predicates, a subset of tuples in order to index the data required by the user's queries, thus reducing network traffic. DHTJoin tackles the dynamic behavior of DHT networks during query execution and dissemination of queries. We provide a performance evaluation of DHTJoin which shows that it can achieve significant performance gains in terms of network traffic.
Document type :
Conference papers
Complete list of metadata

Cited literature [34 references]  Display  Hide  Download
Contributor : Wenceslao Palma Connect in order to contact the contributor
Submitted on : Tuesday, September 15, 2009 - 12:31:15 PM
Last modification on : Tuesday, September 6, 2022 - 4:59:20 PM
Long-term archiving on: : Thursday, June 30, 2011 - 11:48:34 AM


Files produced by the author(s)


  • HAL Id : inria-00416819, version 1


Wenceslao Palma, Reza Akbarinia, Esther Pacitti, Patrick Valduriez. P2P Join Query Processing over Data Streams. BDA: Bases de Données Avancées, Oct 2009, Namur, Belgium. ⟨inria-00416819⟩



Record views


Files downloads