G. Juve, E. Deelman, G. B. Berriman, B. P. Berman, and P. Maechling, An Evaluation of the Cost and Performance of Scientific Workflows on Amazon EC2, Journal of Grid Computing, vol.3, issue.3???4, pp.5-21, 2012.
DOI : 10.1007/s10723-012-9207-6

S. Sakr, A. Liu, D. M. Batista, and M. Alomari, A Survey of Large Scale Data Management Approaches in Cloud Environments, IEEE Communications Surveys & Tutorials, vol.13, issue.3, pp.311-336, 2011.
DOI : 10.1109/SURV.2011.032211.00087

URL : https://hal.archives-ouvertes.fr/inria-00623093

A. Padmanabhan, S. Wang, G. Cao, M. Hwang, Y. Zhao et al., FluMapper, Proceedings of the Conference on Extreme Science and Engineering Discovery Environment Gateway to Discovery, XSEDE '13, pp.1-33, 2013.
DOI : 10.1145/2484762.2484821

J. Balewski, J. Lauret, D. Olson, I. Sakrejda, D. Arkhipkin et al., Offloading peak processing to virtual farm by STAR experiment at RHIC, Journal of Physics: Conference Series, 2012.
DOI : 10.1088/1742-6596/368/1/012011

K. R. Jackson, L. Ramakrishnan, K. J. Runge, and R. C. Thomas, Seeking supernovae in the clouds, Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, HPDC '10, pp.421-429, 2010.
DOI : 10.1145/1851476.1851538

B. E. Calder, Windows Azure Storage, Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles, SOSP '11, pp.143-157, 2011.
DOI : 10.1145/2043556.2043571

S. J. Kazemitabar, F. Banaei-kashani, and D. Mcleod, Geostreaming in cloud, Proceedings of the 2nd ACM SIGSPATIAL International Workshop on GeoStreaming, IWGS '11, pp.3-9, 2011.
DOI : 10.1145/2064959.2064962

A. Costan, R. Tudoran, G. Antoniu, and G. Brasche, Tomus- Blobs: Scalable Data-intensive Processing on Azure Clouds, Journal of Concurrency, 2013.

M. Dorier, G. Antoniu, F. Cappello, M. Snir, and L. Orf, Damaris: How to Efficiently Leverage Multicore Parallelism to Achieve Scalable, Jitter-free I/O, 2012 IEEE International Conference on Cluster Computing, 2012.
DOI : 10.1109/CLUSTER.2012.26

URL : https://hal.archives-ouvertes.fr/hal-00715252

E. Feller, L. Ramakrishnan, and C. Morin, On the performance and energy efficiency of Hadoop deployment models, 2013 IEEE International Conference on Big Data, p.50005000, 2013.
DOI : 10.1109/BigData.2013.6691564

URL : https://hal.archives-ouvertes.fr/hal-00856330

S. Gamage, R. R. Kompella, D. Xu, and A. Kangarlou, Protocol Responsibility Offloading to Improve TCP Throughput in Virtualized Environments, ACM Transactions on Computer Systems, vol.31, issue.3, pp.1-7, 2013.
DOI : 10.1145/2518037.2491463

E. Yildirim and T. Kosar, Network-aware end-toend data throughput optimization on Network-aware data management, ser. NDM '11, Proceedings of the first international workshop, pp.21-30, 2011.

G. Khanna, U. Catalyurek, T. Kurc, R. Kettimuthu, P. Sadayappan et al., A dynamic scheduling approach for coordinated wide-area data transfers using gridftp, Parallel and Distributed Processing, pp.1-12, 2008.

J. Cala, H. Hiden, S. Woodman, and P. Watson, Cloud computing for fast prediction of chemical activity, Future Generation Computer Systems, vol.29, issue.7, pp.1860-1869, 2013.
DOI : 10.1016/j.future.2013.01.011

L. Hodgkinson, J. Rosa, and E. A. Brewer, Parallel Software Architecture for Experimental Workflows in Computational Biology on Clouds, Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part II, ser. PPAM'11, pp.281-291, 2012.
DOI : 10.1007/978-3-642-31500-8_29

N. Edwards, M. Watkins, M. Gates, A. Coles, E. Deliot et al., High-speed Storage Nodes for the Cloud, 2011 Fourth IEEE International Conference on Utility and Cloud Computing, pp.25-32, 2011.
DOI : 10.1109/UCC.2011.14

R. Tudoran, A. Costan, and G. Antoniu, DataSteward: Using Dedicated Compute Nodes for Scalable Data Management on Public Clouds, 2013 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, 2013.
DOI : 10.1109/TrustCom.2013.129

URL : https://hal.archives-ouvertes.fr/hal-00927283

Z. Hill, J. Li, M. Mao, A. Ruiz-alvarez, and M. Humphrey, Early observations on the performance of windows azure, Sci. Program, vol.19, issue.2-3, pp.121-132, 2011.

J. Dean, S. Ghemawat-fadika, M. Govindaraju, R. Canon, and L. Ramakrishnan, Mapreduce: simplified data processing on large clusters Evaluating hadoop for data-intensive scientific operations, Proceedings of the 2012 IEEE Fifth International Conference on Cloud Computing, ser. CLOUD '12, pp.107-113, 2008.

D. Alves, P. Bizarro, and P. Marques, Flood, Proceedings of the Fourth ACM International Conference on Distributed Event-Based Systems, DEBS '10, pp.113-114, 2010.
DOI : 10.1145/1827418.1827445

M. Zaharia, T. Das, H. Li, T. Hunter, S. Shenker et al., Discretized streams, Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles, SOSP '13, pp.423-438, 2013.
DOI : 10.1145/2517349.2522737

U. Verner, A. Schuster, M. Silberstein, and A. Mendelson, Scheduling processing of real-time data streams on heterogeneous multi-GPU systems, Proceedings of the 5th Annual International Systems and Storage Conference on, SYSTOR '12, pp.1-812, 2012.
DOI : 10.1145/2367589.2367596

Y. Simmhan, C. Van-ingen, G. Subramanian, and J. Li, Bridging the Gap between Desktop and the Cloud for eScience Applications, 2010 IEEE 3rd International Conference on Cloud Computing, pp.474-481, 2010.
DOI : 10.1109/CLOUD.2010.72

M. Isard, M. Budiu, Y. Yu, A. Birrell, and D. Fetterly, Dryad: distributed data-parallel programs from sequential building blocks, Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems 2007, ser. EuroSys '07, pp.59-72, 2007.

A. Baptista, B. Howe, J. Freire, D. Maier, and C. T. Silva, Scientific Exploration in the Era of Ocean Observatories, Computing in Science & Engineering, vol.10, issue.3, pp.53-58, 2008.
DOI : 10.1109/MCSE.2008.83

L. Wang, J. Tao, H. Marten, A. Streit, S. U. Khan et al., MapReduce across Distributed Clusters for Data-intensive Applications, 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum, pp.2004-2011, 2012.
DOI : 10.1109/IPDPSW.2012.249

Y. Luo and B. Plale, Hierarchical MapReduce Programming Model and Scheduling Algorithms, 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012), pp.769-774, 2012.
DOI : 10.1109/CCGrid.2012.132