Benchmarking SQL on MapReduce systems using large astronomy databases

Amin Mesmoudi; Mohand-Saïd Hacid; Farouk Toumani

doi:10.1007/s10619-014-7172-8

Article Dans Une Revue Distributed and Parallel Databases Année : 2016

Benchmarking SQL on MapReduce systems using large astronomy databases

(1) , (1) , (2)

1
2

Amin Mesmoudi

Fonction : Auteur
PersonId : 5650
IdHAL : amin-mesmoudi
ORCID : 0000-0003-1307-591X
IdRef : 191781576

Base de Données

Mohand-Saïd Hacid

Fonction : Auteur
PersonId : 7283
IdHAL : mohand-said-hacid
IdRef : 070848440

Base de Données

Farouk Toumani

Fonction : Auteur
PersonId : 172768
IdHAL : farouk-toumani
IdRef : 139537619

Laboratoire d'Informatique, de Modélisation et d'optimisation des Systèmes

Résumé

In the era of bigdata, with a massive set of digital information of unprecedented volumes being collected and/or produced in several application domains , it becomes more and more difficult to manage and query large data repositories. In the framework of the PetaSky project (http://com.isima.fr/Petasky), we focus on the problem of managing scientific data in the field of cosmology. The data we consider are those of the LSST project (http://www.lsst.org/). The overall size of the database that will be produced is expected to exceed 60 PB [28]. In order to evaluate the performances of existing SQL On MapReduce data management systems, we conducted extensive experiments by using data and queries from the area of cosmology. The goal of this work is to report on the ability of such systems to support large scale declarative queries. We mainly investigated the impact of data partitioning, indexing and compression on query execution performances.

Domaines

Informatique [cs] Base de données [cs.DB]

Fichier principal

bench_sql_mapr.pdf (527.32 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Amin Mesmoudi : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01221665

Soumis le : mercredi 28 octobre 2015-13:32:26

Dernière modification le : mercredi 5 juillet 2023-15:28:04

Archivage à long terme le : vendredi 28 avril 2017-05:49:48

Dates et versions

hal-01221665 , version 1 (28-10-2015)

Identifiants

HAL Id : hal-01221665 , version 1
DOI : 10.1007/s10619-014-7172-8

Citer

Amin Mesmoudi, Mohand-Saïd Hacid, Farouk Toumani. Benchmarking SQL on MapReduce systems using large astronomy databases. Distributed and Parallel Databases, 2016, 34 (3), pp.347-378. ⟨10.1007/s10619-014-7172-8⟩. ⟨hal-01221665⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

PRES_CLERMONT CNRS UNIV-LYON1 UNIV-LYON2 INSA-LYON EC-LYON LIRIS LIMOS LABEXIMU INSA-GROUPE UDL CLERMONT-AUVERGNE-INP

347 Consultations

567 Téléchargements

Benchmarking SQL on MapReduce systems using large astronomy databases

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager