Parallel OLAP query processing in database clusters with data replication

Abstract : We consider the problem of improving the performance of OLAP applications in a database cluster (DBC), which is a low cost and effective parallel solution for query processing. Current DBC solutions for OLAP query processing provide for intra-query parallelism only, at the cost of full replication of the database. In this paper, we proposemore efficient distributed database design alternatives which combine physical/virtual partitioning with partial replication.We also propose a new load balancing strategy that takes advantage of an adaptive virtual partitioning to redistribute the load to the replicas. Our experimental validation is based on the implementation of our solution on the SmaQSS DBC middleware prototype. Our experimental results using the TPC-H benchmark and a 32-node cluster show very good speedup.
Type de document :
Article dans une revue
Distributed and Parallel Database (Weston, Conn.)s, 2009, 25 (1-2), pp.97-123
Liste complète des métadonnées

https://hal.inria.fr/inria-00482183
Contributeur : Patrick Valduriez <>
Soumis le : dimanche 9 mai 2010 - 19:14:18
Dernière modification le : mercredi 11 avril 2018 - 01:56:58

Identifiants

  • HAL Id : inria-00482183, version 1

Collections

Citation

Alexandre Lima, Camille Furtado, Patrick Valduriez, Marta Mattoso. Parallel OLAP query processing in database clusters with data replication. Distributed and Parallel Database (Weston, Conn.)s, 2009, 25 (1-2), pp.97-123. 〈inria-00482183〉

Partager

Métriques

Consultations de la notice

204