Resilin: Elastic MapReduce for Private and Community Clouds

Abstract : The MapReduce programming model, introduced by Google, offers a simple and efficient way of performing distributed computation over large data sets. Although Google's implementation is proprietary, MapReduce can be leveraged by anyone using the free and open source Apache Hadoop framework. To simplify the usage of Hadoop in the cloud, Amazon Web Services offers Elastic MapReduce, a web service enabling users to run MapReduce jobs. Elastic MapReduce takes care of resource provisioning, Hadoop configuration and performance tuning, data staging, fault tolerance, etc. This service drastically reduces the entry barrier to perform MapReduce computations in the cloud, allowing users to concentrate on the problem to solve. However, Elastic MapReduce is restricted to Amazon EC2 resources, and is provided at an additional cost. In this paper, we present Resilin, a system implementing the Elastic MapReduce API with resources from other clouds than Amazon EC2, such as private and community clouds. Furthermore, we explore a feature going beyond the current Amazon Elastic MapReduce offering: performing MapReduce computations over multiple distributed clouds.
Complete list of metadatas

Cited literature [23 references]  Display  Hide  Download

https://hal.inria.fr/inria-00632040
Contributor : Pierre Riteau <>
Submitted on : Thursday, October 13, 2011 - 12:00:58 PM
Last modification on : Thursday, November 15, 2018 - 11:57:44 AM

File

RR-7767.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : inria-00632040, version 1

Citation

Pierre Riteau, Ancuta Iordache, Christine Morin. Resilin: Elastic MapReduce for Private and Community Clouds. [Research Report] RR-7767, INRIA. 2011, pp.18. ⟨inria-00632040⟩

Share

Metrics

Record views

524

Files downloads

687