P2P Join Query Processing over Data Streams

Wenceslao Palma 1, 2 Reza Akbarinia 1 Esther Pacitti 3 Patrick Valduriez 1, 3
1 ATLAS - Complex data management in distributed systems
UN - Université de Nantes, Inria Rennes – Bretagne Atlantique
3 ZENITH - Scientific Data Management
LIRMM - Laboratoire d'Informatique de Robotique et de Microélectronique de Montpellier, CRISAM - Inria Sophia Antipolis - Méditerranée
Abstract : Recent years have witnessed the growth of a new class of data-intensive applications that do not fit the DBMS data model and querying paradigm. Instead, the data arrive at high speeds taking the form of an unbounded sequence of values (data streams) and queries run continuously returning new results as new data arrive. In these applications, data streams from external sources flow into a Data Stream Management System (DSMS) where they are processed by different operators. Many applications share the same need for processing data streams in a continuous fashion. For most distributed streaming applications, the centralized processing of continuous queries over distributed data is simply not viable. This paper addresses the problem of computing continuous join queries over distributed data streams. We present a new method, called DHTJoin that exploits the power of a Distributed Hash Table (DHT) combining hash-based placement of tuples and dissemination of queries by exploiting the embedded trees in the underlying DHT, thereby incuring little overhead. Unlike state of the art solutions that index all data, DHTJoin identifies, using query predicates, a subset of tuples in order to index the data required by the user's queries, thus reducing network traffic. DHTJoin tackles the dynamic behavior of DHT networks during query execution and dissemination of queries. We provide a performance evaluation of DHTJoin which shows that it can achieve significant performance gains in terms of network traffic.
Type de document :
Communication dans un congrès
BDA: Bases de Données Avancées, Oct 2009, Namur, Belgium. 2009
Liste complète des métadonnées

Littérature citée [34 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00416819
Contributeur : Wenceslao Palma <>
Soumis le : mardi 15 septembre 2009 - 12:31:15
Dernière modification le : jeudi 24 mai 2018 - 15:59:21
Document(s) archivé(s) le : jeudi 30 juin 2011 - 11:48:34

Fichier

dhtjoin.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inria-00416819, version 1

Citation

Wenceslao Palma, Reza Akbarinia, Esther Pacitti, Patrick Valduriez. P2P Join Query Processing over Data Streams. BDA: Bases de Données Avancées, Oct 2009, Namur, Belgium. 2009. 〈inria-00416819〉

Partager

Métriques

Consultations de la notice

467

Téléchargements de fichiers

288