L. M. Vaquero, L. Rodero-merino, J. Caceres, and M. Lindner, A break in the clouds, ACM SIGCOMM Computer Communication Review, vol.39, issue.1, pp.50-55, 2009.
DOI : 10.1145/1496091.1496100

M. Mao and M. Humphrey, Auto-scaling to minimize cost and meet application deadlines in cloud workflows, Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis on, SC '11, pp.491-4912, 2011.
DOI : 10.1145/2063384.2063449

M. Caballer, C. De-alfonso, F. Alvarruiz, and G. Moltó, EC3: Elastic Cloud Computing Cluster, Journal of Computer and System Sciences, vol.79, issue.8, pp.1341-1351, 2013.
DOI : 10.1016/j.jcss.2013.06.005

URL : http://hdl.handle.net/10251/37501

K. Keahey, P. Armstrong, J. Bresnahan, D. Labissoniere, and P. Riteau, Infrastructure outsourcing in multi-cloud environment, Proceedings of the 2012 workshop on Cloud services, federation, and the 8th open cirrus summit, FederatedClouds '12, pp.33-38
DOI : 10.1145/2378975.2378984

R. N. Calheiros, A. N. Toosi, C. Vecchiola, and R. Buyya, A coordinator for scaling elastic applications across multiple clouds, Future Generation Computer Systems, vol.28, issue.8, pp.1350-1362, 2012.
DOI : 10.1016/j.future.2012.03.010

J. Dean and S. Ghemawat, MapReduce, 6th Symposium on Operating Systems Design and Implementation, pp.137-149, 2004.
DOI : 10.1145/1327452.1327492

P. Marshall, K. Keahey, and T. Freeman, Elastic Site: Using Clouds to Elastically Extend Site Resources, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing, pp.43-52, 2010.
DOI : 10.1109/CCGRID.2010.80

L. Beernaert, M. Matos, R. Vilaça, and R. Oliveira, Automatic elasticity in OpenStack, Proceedings of the Workshop on Secure and Dependable Middleware for Cloud Monitoring and Management, SDMCMM '12, pp.1-2, 2012.
DOI : 10.1145/2405186.2405188

B. Nicolae, On the benefits of transparent compression for costeffective cloud data storage Transactions on Large-Scale Data-and Knowledge-Centered Systems Open source cloud technologies, SoCC '12: Proceedings of the 3rd ACM Symposium on Cloud Computing, pp.167-1841, 2011.

J. L. Gonzalez and T. Cortes, Increasing the capacity of RAID5 by online gradual assimilation, " in SNAPI '04: Proceedings of the international workshop on Storage network architecture and parallel I/Os, pp.17-24, 2004.

W. Zheng and G. Zhang, FastScale: Accelerate RAID scaling by minimizing data migration, FAST'11: Proceedings of the 9th USENIX conference on File and Storage Technologies, 2011.

C. Wu and X. He, GSR: A Global Stripe-Based Redistribution Approach to Accelerate RAID-5 Scaling, 2012 41st International Conference on Parallel Processing, pp.460-469
DOI : 10.1109/ICPP.2012.32

O. Rodeh, J. Bacik, and C. Mason, BTRFS, ACM Transactions on Storage, vol.9, issue.3, pp.1-932, 2013.
DOI : 10.1145/2501620.2501623

B. Nicolae, G. Antoniu, L. Bougé, D. Moise, and A. Carpen-amarie, BlobSeer: Next-generation data management for large scale infrastructures, Journal of Parallel and Distributed Computing, vol.71, issue.2, pp.169-184, 2011.
DOI : 10.1016/j.jpdc.2010.08.004

URL : https://hal.archives-ouvertes.fr/inria-00511414

H. C. Lim, S. Babu, and J. S. Chase, Automated control for elastic storage, Proceeding of the 7th international conference on Autonomic computing, ICAC '10, pp.1-10, 2010.
DOI : 10.1145/1809049.1809051

P. Xia, D. Feng, H. Jiang, L. Tian, and F. Wang, FARMER, Proceedings of the 17th international symposium on High performance distributed computing, HPDC '08, pp.185-196, 2008.
DOI : 10.1145/1383422.1383445

M. Iritani and H. Yokota, Effects on performance and energy reduction by file relocation based on file-access correlations, Proceedings of the 2012 Joint EDBT/ICDT Workshops on, EDBT-ICDT '12, pp.79-86, 2012.
DOI : 10.1145/2320765.2320794

Z. Li, Z. Chen, and Y. Zhou, Mining block correlations to improve storage performance, ACM Transactions on Storage, vol.1, issue.2, pp.213-245, 2005.
DOI : 10.1145/1063786.1063790

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=

M. Stokely, A. Mehrabian, C. Albrecht, F. Labelle, and A. Merchant, Projecting disk usage based on historical trends in a cloud environment, Proceedings of the 3rd workshop on Scientific Cloud Computing Date, ScienceCloud '12, pp.63-70, 2012.
DOI : 10.1145/2287036.2287050

J. He, J. Bent, A. Torres, G. Grider, G. Gibson et al., I/O acceleration with pattern detection, Proceedings of the 22nd international symposium on High-performance parallel and distributed computing, HPDC '13, pp.25-36, 2013.
DOI : 10.1145/2493123.2462909

J. Oly and D. A. Reed, Markov model prediction of I/O requests for scientific applications, Proceedings of the 16th international conference on Supercomputing , ICS '02, pp.147-155, 2002.
DOI : 10.1145/514191.514214

B. Nicolae, J. Bresnahan, K. Keahey, and G. Antoniu, Going back and forth, Proceedings of the 20th international symposium on High performance distributed computing, HPDC '11, pp.147-158, 2011.
DOI : 10.1145/1996130.1996152

URL : https://hal.archives-ouvertes.fr/inria-00570682

N. Draper and H. Smith, Applied regression analysis, ser. Probability and mathematical statistics, 1966.
DOI : 10.1002/9781118625590

F. Bellard, QEMU, a fast and portable dynamic translator, ATEC '05: Proceedings of the 2005 USENIX Annual Technical Conference, pp.41-46, 2005.

Y. Bu, B. Howe, M. Balazinska, and M. D. Ernst, The HaLoop approach to large-scale iterative data analysis, The VLDB Journal, vol.7, issue.1, pp.169-190, 2012.
DOI : 10.1007/s00778-012-0269-7

H. Bock, Clustering methods: A history of K-Means algorithms, " in Selected Contributions in Data Analysis and Classification, ser. Studies in Classification, Data Analysis, and Knowledge Organization, pp.161-172, 2007.

W. Zhao, H. Ma, and Q. He, Parallel K-Means Clustering Based on MapReduce, CloudCom '09: Proceedings of the 1st International Conference on Cloud Computing, pp.674-679, 2009.
DOI : 10.1007/978-3-642-10665-1_71