S. Tansley and &. K. Tolle, The fourth paradigm: data-intensive scientific discovery, 2009.

B. Lange and &. P. Fortin, Parallel Dual Tree Traversal on Multi-core and Many-core Architectures for Astrophysical N-body Simulations, 2014.
DOI : 10.1007/978-3-319-09873-9_60

URL : https://hal.archives-ouvertes.fr/hal-00947130

W. Dehnen, A Hierarchical (N) Force Calculation Algorithm, Journal of Computational Physics, vol.179, issue.1, pp.27-42, 2002.
DOI : 10.1006/jcph.2002.7026

W. Fan and A. Bifet, Mining big data, ACM SIGKDD Explorations Newsletter, vol.14, issue.2, pp.1-5, 2013.
DOI : 10.1145/2481244.2481246

D. Laney, 3d data management: Controlling data volume, velocity and variety, META Group Research Note, vol.6, 2001.

J. Dean and L. A. Barroso, The tail at scale, Communications of the ACM, vol.56, issue.2, pp.74-80, 2013.
DOI : 10.1145/2408776.2408794

J. Dean and S. Ghemawat, MapReduce, Communications of the ACM, vol.51, issue.1, pp.107-113, 2008.
DOI : 10.1145/1327452.1327492

F. Chang, J. Dean, S. Ghemawat, W. C. Hsieh, D. A. Wallach et al., Bigtable, ACM Transactions on Computer Systems, vol.26, issue.2, p.4, 2008.
DOI : 10.1145/1365815.1365816

S. Ghemawat, H. Gobioff, and S. T. Leung, The Google file system, ACM SIGOPS Operating Systems Review, vol.37, issue.5, pp.29-43, 2003.
DOI : 10.1145/1165389.945450

K. Saroj, Idc report to hadoop leads the big data analytics tool for enterprises -http://cloudtimes.org/2013/11/06/idc-report-hadoop-leads- the-big-data-analytics-tool-for-enterprises

D. Borthakur, The hadoop distributed file system: Architecture and design, Hadoop Project Website, vol.11, issue.21, 2007.

C. Lam, Hadoop in action, 2010.

M. N. Vora, Hadoop-HBase for large-scale data, Computer Science and Network Technology (ICCSNT), 2011 International Conference on, pp.601-605, 2011.

R. C. Taylor, An overview of the Hadoop/MapReduce/HBase framework and its current applications in bioinformatics, BMC Bioinformatics, vol.11, issue.Suppl 12, p.1, 2010.
DOI : 10.1186/1471-2105-11-S12-S1

M. Isard, M. Budiu, Y. Yu, A. Birrell, and D. Fetterly, Dryad, ACM SIGOPS Operating Systems Review, vol.41, issue.3, pp.59-72, 2007.
DOI : 10.1145/1272998.1273005

V. K. Vavilapalli, A. C. Murthy, C. Douglas, S. Agarwal, M. Konar et al., Apache Hadoop YARN, Proceedings of the 4th annual Symposium on Cloud Computing, SOCC '13, p.2012
DOI : 10.1145/2523616.2523633

B. He, W. Fang, Q. Luo, N. K. Govindaraju, and T. Wang, Mars, Proceedings of the 17th international conference on Parallel architectures and compilation techniques, PACT '08, pp.260-269, 2008.
DOI : 10.1145/1454115.1454152

Z. Xiao, H. Chen, and B. Zang, A Hierarchical Approach to Maximizing MapReduce Efficiency, 2011 International Conference on Parallel Architectures and Compilation Techniques, pp.167-168, 2011.
DOI : 10.1109/PACT.2011.22

R. M. Yoo, A. Romano, and C. Kozyrakis, Phoenix rebirth: Scalable MapReduce on a large-scale shared-memory system, 2009 IEEE International Symposium on Workload Characterization (IISWC), pp.198-207, 2009.
DOI : 10.1109/IISWC.2009.5306783

Z. Fadika, E. Dede, M. Govindaraju, and L. Ramakrishnan, MARIANE: MApReduce Implementation Adapted for HPC Environments, 2011 IEEE/ACM 12th International Conference on Grid Computing, pp.82-89, 2011.
DOI : 10.1109/Grid.2011.20

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.636.6107

Z. Fadika, E. Dede, J. Hartog, and M. Govindaraju, MARLA: MapReduce for Heterogeneous Clusters, 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012), pp.49-56, 2012.
DOI : 10.1109/CCGrid.2012.135

E. Dede, Z. Fadika, J. Hartog, M. Govindaraju, L. Ramakrishnan et al., Marissa: Mapreduce implementation for streaming science applications myhadoop-hadoop-on-demand on traditional hpc resources Ceph: A scalable, high-performance distributed file system, 2012 IEEE 8th International Conference on Proceedings of the 7th symposium on Operating systems design and implemen-tation, pages 307 to 320. USENIX Association, 2006.

W. Felter, A. Ferreira, R. Rajamony, and J. Rubio, An Updated Performance Comparison of Virtual Machines and Linux Containers, IBM Research Division, 2014.