R. E. Bryant, Data-intensive supercomputing: The case for disc, Tech. rep., CMU, 2007.

R. Buyya, Market-oriented cloud computing: Vision, hype, and reality of delivering computing as the 5th utility. Cluster Computing and the Grid, IEEE International Symposium on, vol.0, issue.1, 2009.

J. Dean and S. Ghemawat, MapReduce, Communications of the ACM, vol.51, issue.1, pp.107-113, 2008.
DOI : 10.1145/1327452.1327492

D. Dewitt and J. Gray, Parallel database systems: the future of high performance database systems, Communications of the ACM, vol.35, issue.6, pp.85-98, 1992.
DOI : 10.1145/129888.129894

S. Ghandeharizadeh, C. Papadopoulos, P. Pol, and R. Zhou, Nam: a network adaptable middleware to enhance response time of web services, MASCOTS '03: 11th IEEE/ACM International Symposium on Modeling, pp.136-145, 2003.

S. Ghemawat, H. Gobioff, and S. T. Leung, The Google file system, ACM SIGOPS Operating Systems Review, vol.37, issue.5, pp.29-43, 2003.
DOI : 10.1145/1165389.945450

M. Isard, M. Budiu, Y. Yu, A. Birrell, and D. Fetterly, Dryad, ACM SIGOPS Operating Systems Review, vol.41, issue.3, pp.59-72, 2007.
DOI : 10.1145/1272998.1273005

E. Jeannot, B. Knutsson, and M. Björkman, Adaptive online data compression, Proceedings 11th IEEE International Symposium on High Performance Distributed Computing, p.379, 2002.
DOI : 10.1109/HPDC.2002.1029938

URL : https://hal.archives-ouvertes.fr/inria-00100877

Y. Jégou, S. Lantéri, J. Leduc, M. Noredine, G. Mornet et al., Grid'5000: a large scale and highly reconfigurable experimental grid testbed, International Journal of High Performance Computing Applications, vol.20, issue.4, pp.481-494, 2006.

C. Krintz and S. Sucu, Adaptive on-the-fly compression, IEEE Transactions on Parallel and Distributed Systems, vol.17, issue.1, pp.15-24, 2006.
DOI : 10.1109/TPDS.2006.3

B. Nicolae, G. Antoniu, and L. Bougé, BlobSeer, Proceedings of the 2009 EDBT/ICDT Workshops on, EDBT/ICDT '09, 2009.
DOI : 10.1145/1698790.1698796

URL : https://hal.archives-ouvertes.fr/hal-00803430

B. Nicolae, G. Antoniu, and L. Bougé, Enabling High Data Throughput in Desktop Grids through Decentralized Data and Metadata Management: The BlobSeer Approach, Proc. 15th International Euro-Par Conference on Parallel Processing (Euro-Par '09). Lect. Notes in Comp. Science, pp.404-416, 2009.
DOI : 10.1177/1094342006070078

URL : https://hal.archives-ouvertes.fr/inria-00410956

B. Nicolae, D. Moise, G. Antoniu, L. Bougé, and M. Dorier, BlobSeer: Bringing high throughput under heavy concurrency to Hadoop Map-Reduce applications, 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS), 2010.
DOI : 10.1109/IPDPS.2010.5470433

URL : https://hal.archives-ouvertes.fr/inria-00456801

M. F. Oberhumer, Lempel-ziv-oberhumer, 2009.

A. Pavlo, E. Paulson, A. Rasin, D. J. Abadi, D. J. Dewitt et al., A comparison of approaches to large-scale data analysis, Proceedings of the 35th SIGMOD international conference on Management of data, SIGMOD '09, pp.165-178, 2009.
DOI : 10.1145/1559845.1559865

A. Raghuveer, M. Jindal, M. F. Mokbel, B. Debnath, and D. Du, Towards efficient search on unstructured data, Proceedings of the sixteenth ACM conference on Conference on information and knowledge management , CIKM '07, pp.951-954, 2007.
DOI : 10.1145/1321440.1321583

L. M. Vaquero, L. Rodero-merino, J. Caceres, and M. Lindner, A break in the clouds, ACM SIGCOMM Computer Communication Review, vol.39, issue.1, pp.50-55, 2009.
DOI : 10.1145/1496091.1496100

Y. Wiseman, K. Schwan, and P. Widener, Efficient end to end data exchange using configurable compression, ACM SIGOPS Operating Systems Review, vol.39, issue.3, pp.4-23, 2005.
DOI : 10.1145/1075395.1075396

J. Ziv and A. Lempel, A universal algorithm for sequential data compression, IEEE Transactions on Information Theory, vol.23, issue.3, pp.337-343, 1977.
DOI : 10.1109/TIT.1977.1055714