Towards Scalable Data Management for Map-Reduce-based Data-Intensive Applications on Cloud and Hybrid Infrastructures - Inria - Institut national de recherche en sciences et technologies du numérique

Documentation
Français (FR)

Anglais (EN)

Communication Dans Un Congrès Année : 2012

Towards Scalable Data Management for Map-Reduce-based Data-Intensive Applications on Cloud and Hybrid Infrastructures

(1) , (2) , (3) , (1) , (4) , (5, 6, 7) , (1) , (2) , (2) , (2) , (8) , (5, 6) , (2) , (2) , (9) , (2) , (3)

1
2
3
4
5
6
7
8
9

Gabriel Antoniu

Fonction : Auteur
PersonId : 746326
IdHAL : gabriel-antoniu
ORCID : 0000-0001-6525-3736
IdRef : 095615296

Scalable Storage for Clouds and Beyond

Julien Bigot

Fonction : Auteur
PersonId : 2024
IdHAL : julien-bigot
ORCID : 0000-0002-0015-4304
IdRef : 154771996

Algorithms and Software Architectures for Distributed and HPC Platforms

Christophe Blanchet

Fonction : Auteur

Institut de biologie et chimie des protéines [Lyon]

Luc Bougé

Fonction : Auteur
PersonId : 1264
IdHAL : bouge
ORCID : 0000-0002-5510-4443
IdRef : 032062591

Scalable Storage for Clouds and Beyond

François Briant

Fonction : Auteur

IBM PSSC Montpellier - Innovation Lab.

Franck Cappello

Fonction : Auteur

Global parallel and distributed computing

Joint Laboratory for Petascale Computing [Illinois]

Laboratoire de Recherche en Informatique

Alexandru Costan

Fonction : Auteur
PersonId : 9361
IdHAL : alexandru-costan
ORCID : 0000-0003-3111-6308
IdRef : 220478279

Scalable Storage for Clouds and Beyond

Frédéric Desprez

Fonction : Auteur
PersonId : 6600
IdHAL : frederic-desprez
IdRef : 034430563

Algorithms and Software Architectures for Distributed and HPC Platforms

Gilles Fedak

Fonction : Auteur
PersonId : 2289
IdHAL : gilles-fedak
IdRef : 076982327

Algorithms and Software Architectures for Distributed and HPC Platforms

Sylvain Gault

Fonction : Auteur

Algorithms and Software Architectures for Distributed and HPC Platforms

Kate Keahey

Fonction : Auteur
PersonId : 884576

Argonne National Laboratory [Lemont]

Bogdan Nicolae

Fonction : Auteur
PersonId : 21945
IdHAL : bnicolae
ORCID : 0000-0002-0661-7509

Global parallel and distributed computing

Joint Laboratory for Petascale Computing [Illinois]

Christian Pérez

Fonction : Auteur
PersonId : 3022
IdHAL : chperez
IdRef : 094180962

Algorithms and Software Architectures for Distributed and HPC Platforms

Anthony Simonet

Fonction : Auteur
PersonId : 7366
IdHAL : asimonet

Algorithms and Software Architectures for Distributed and HPC Platforms

Frédéric Suter

Fonction : Auteur
PersonId : 739871
IdHAL : frederic-suter
ORCID : 0000-0003-1902-1955
IdRef : 078831962

Centre de Calcul de l'IN2P3

Bing Tang

Fonction : Auteur

Algorithms and Software Architectures for Distributed and HPC Platforms

Raphael Terreux

Fonction : Auteur

Institut de biologie et chimie des protéines [Lyon]

Résumé

As Map-Reduce emerges as a leading programming paradigm for data-intensive computing, today's frameworks which support it still have substantial shortcomings that limit its potential scalability. In this paper we discuss several directions where there is room for such progress: they concern storage efficiency under massive data access concurrency, scheduling, volatility and fault-tolerance. We place our discussion in the perspective of the current evolution towards an increasing integration of large-scale distributed platforms (clouds, cloud federations, enterprise desktop grids, etc.). We propose an approach which aims to overcome the current limitations of existing Map-Reduce frameworks, in order to achieve scalable, concurrency-optimized, fault-tolerant Map-Reduce data processing on hybrid infrastructures. This approach will be evaluated with real-life bio-informatics applications on existing Nimbus-powered cloud testbeds interconnected with desktop grids.

Mots clés

MapReduce cloud computing data-intensive computing hybrid infrastructures BlobSeer BitDew Nimbus HLCM Grid'5000

Domaines

Calcul parallèle, distribué et partagé [cs.DC]

Fichier principal

Vignette du fichier

ICACON2012-MapReduce.pdf (1.07 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Gabriel Antoniu : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00684866

Soumis le : vendredi 20 avril 2012-11:43:30

Dernière modification le : jeudi 11 avril 2024-13:18:11

Archivage à long terme le : samedi 21 juillet 2012-02:20:32

Dates et versions

hal-00684866 , version 1 (20-04-2012)

Identifiants

HAL Id : hal-00684866 , version 1

Citer

Gabriel Antoniu, Julien Bigot, Christophe Blanchet, Luc Bougé, François Briant, et al.. Towards Scalable Data Management for Map-Reduce-based Data-Intensive Applications on Cloud and Hybrid Infrastructures. 1st International IBM Cloud Academy Conference - ICA CON 2012, Apr 2012, Research Triangle Park, North Carolina, United States. ⟨hal-00684866⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

IN2P3 ENS-LYON INSTITUT-TELECOM EC-PARIS UNIV-RENNES1 UNIV-LILLE3 CNRS INRIA UNIV-LYON1 INSA-RENNES IRISA PRIMES PRIMES_WP5 UMR8623 IRISA-INSA-R IRISA-D1 INRIA2 UR1-MATH-STIC UNIV-PARIS-SACLAY UR1-UFR-ISTIC UNIV-RENNES UDL ANR UR1-MATH-NUM CC-IN2P3

1840 Consultations

335 Téléchargements

Partager