MapIterativeReduce: A Framework for Reduction-Intensive Data Processing on Azure Clouds

Radu Tudoran; Alexandru Costan; Gabriel Antoniu

doi:10.1145/2287016.2287019

Communication Dans Un Congrès Année : 2012

MapIterativeReduce: A Framework for Reduction-Intensive Data Processing on Azure Clouds

(1) , (1) , (1)

Radu Tudoran

Fonction : Auteur
PersonId : 914308

Scalable Storage for Clouds and Beyond

Alexandru Costan

Fonction : Auteur
PersonId : 9361
IdHAL : alexandru-costan
ORCID : 0000-0003-3111-6308
IdRef : 220478279

Scalable Storage for Clouds and Beyond

Gabriel Antoniu

Fonction : Auteur
PersonId : 746326
IdHAL : gabriel-antoniu
ORCID : 0000-0001-6525-3736
IdRef : 095615296

Scalable Storage for Clouds and Beyond

Résumé

With the emergence of cloud computing as an alternative to supercomputers to support data intensive applications, MapReduce has arisen as a major programming model for data analysis on clouds. In this context, reduce-intensive algorithms are becoming increasingly useful in applications such as data clustering, classification and mining. However, platforms like MapReduce or Dryad lack built-in support for reduce-intensive workloads. This paper introduces MapIter- ativeReduce, a framework which 1) extends the MapReduce programming model to better support reduce-intensive ap- plications and 2) substantially improves their efficiency by eliminating the implicit barrier between the Map and the Reduce phase. We evaluated MapIterativeReduce on the Microsoft Azure cloud with synthetic benchmarks and with a real-life application. Compared to state-of-art solutions, our approach reduces the execution times by up to 75%

Mots clés

MapReduce MapIterativeReduce cloud computing Azure data intensive computing

Domaines

Calcul parallèle, distribué et partagé [cs.DC]

Gabriel Antoniu : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00684814

Soumis le : mardi 3 avril 2012-11:33:42

Dernière modification le : vendredi 24 mars 2023-14:52:55

Dates et versions

hal-00684814 , version 1 (03-04-2012)

Identifiants

HAL Id : hal-00684814 , version 1
DOI : 10.1145/2287016.2287019

Citer

Radu Tudoran, Alexandru Costan, Gabriel Antoniu. MapIterativeReduce: A Framework for Reduction-Intensive Data Processing on Azure Clouds. Third International Workshop on MapReduce and its Applications (MAPREDUCE'12), held in conjunction with ACM HPDC'12., Jun 2012, Delft, Netherlands. pp.9-16, ⟨10.1145/2287016.2287019⟩. ⟨hal-00684814⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM EC-PARIS UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA IRISA-INSA-R IRISA-D1 INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES ANR UR1-MATH-NUM

251 Consultations

0 Téléchargements

MapIterativeReduce: A Framework for Reduction-Intensive Data Processing on Azure Clouds

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager