. Bigsql, http://www.bigsql.org/se/. [Online; accessed 21, 2014.

. Impala, cloudera.com/content/cloudera/en/products-and-services/cdh/ impala.html. [Online; accessed 21, 2014.

S. Benchmark, http://sortbenchmark.org/. [Online; accessed 21, 2014.

. Spark, http://spark.apache.org/. [Online; accessed 21, 2014.

T. Http, Online; accessed 21, 2014.

A. Abouzeid, K. Bajda-pawlikowski, D. Abadi, A. Silberschatz, and A. Rasin, HadoopDB, Proceedings of the VLDB Endowment, pp.922-933, 2009.
DOI : 10.14778/1687627.1687731

F. N. Afrati and J. D. Ullman, Optimizing joins in a map-reduce environment, Proceedings of the 13th International Conference on Extending Database Technology, EDBT '10, pp.99-110, 2010.
DOI : 10.1145/1739041.1739056

F. N. Afrati and J. D. Ullman, Optimizing multiway joins in a map-reduce environment. Knowledge and Data Engineering, IEEE Transactions on, vol.23, issue.9, pp.1282-1298, 2011.

K. Bajda-pawlikowski, D. J. Abadi, A. Silberschatz, and E. Paulson, Efficient processing of data warehousing queries in a split execution environment, Proceedings of the 2011 international conference on Management of data, SIGMOD '11, pp.1165-1176, 2011.
DOI : 10.1145/1989323.1989447

J. Dean and S. Ghemawat, MapReduce, OSDI 14. I. Gartner. Big data challenges, pp.137-150, 2004.
DOI : 10.1145/1327452.1327492

B. He, W. Fang, Q. Luo, N. K. Govindaraju, and T. Wang, Mars, Proceedings of the 17th international conference on Parallel architectures and compilation techniques, PACT '08, pp.260-269, 2008.
DOI : 10.1145/1454115.1454152

Y. He, R. Lee, Y. Huai, Z. Shao, N. Jain et al., RCFile: A fast and space-efficient data placement structure in MapReduce-based warehouse systems, 2011 IEEE 27th International Conference on Data Engineering, pp.1199-1208, 2011.
DOI : 10.1109/ICDE.2011.5767933

S. Huang, J. Huang, Y. Liu, L. Yi, and J. Dai, Hibench: A representative and comprehensive hadoop benchmark suite, Proceedings of the ICDE Workshops, 2010.

Y. Kim and K. Shim, Parallel Top-K Similarity Join Algorithms Using MapReduce, 2012 IEEE 28th International Conference on Data Engineering, pp.510-521, 2012.
DOI : 10.1109/ICDE.2012.87

W. Lu, Y. Shen, S. Chen, and B. C. Ooi, Efficient processing of k nearest neighbor joins using MapReduce, Proceedings of the VLDB Endowment, pp.1016-1027, 2012.
DOI : 10.14778/2336664.2336674

A. Mesmoudi and M. Hacid, A Comparison of Systems to Large-Scale Data Access, Database Systems for Advanced Applications -19th International Conference, pp.161-175, 2014.
DOI : 10.1007/978-3-662-43984-5_12

URL : https://hal.archives-ouvertes.fr/hal-01313176

A. Mesmoudi and M. Hacid, A test framework for large scale declarative queries, Proceedings of the 29th Annual ACM Symposium on Applied Computing, SAC '14, pp.858-859, 2014.
DOI : 10.1145/2554850.2555102

URL : https://hal.archives-ouvertes.fr/hal-01313175

A. Metwally and C. Faloutsos, V-SMART-join, Proceedings of the VLDB Endowment, pp.704-715, 2012.
DOI : 10.14778/2212351.2212353

A. Okcan and M. Riedewald, Processing theta-joins using MapReduce, Proceedings of the 2011 international conference on Management of data, SIGMOD '11, pp.949-960, 2011.
DOI : 10.1145/1989323.1989423

A. Pavlo, E. Paulson, A. Rasin, D. J. Abadi, D. J. Dewitt et al., A comparison of approaches to large-scale data analysis, Proceedings of the 35th SIGMOD international conference on Management of data, SIGMOD '09, pp.165-178, 2009.
DOI : 10.1145/1559845.1559865

S. Rao, R. Ramakrishnan, A. Silberstein, M. Ovsiannikov, and D. Reeves, Sailfish, Proceedings of the Third ACM Symposium on Cloud Computing, SoCC '12, p.4, 2012.
DOI : 10.1145/2391229.2391233

A. Rasmussen, M. Conley, G. Porter, R. Kapoor, and A. Vahdat, Themis, Proceedings of the Third ACM Symposium on Cloud Computing, SoCC '12, p.13, 2012.
DOI : 10.1145/2391229.2391242

R. Shaw, Lsst data challenge handbook. Version2, p.2012

K. Shvachko, H. Kuang, S. Radia, and R. Chansler, The Hadoop Distributed File System, 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST), pp.1-10, 2010.
DOI : 10.1109/MSST.2010.5496972

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=

Y. N. Silva and J. M. Reed, Exploiting MapReduce-based similarity joins, Proceedings of the 2012 international conference on Management of Data, SIGMOD '12, pp.693-696, 2014.
DOI : 10.1145/2213836.2213935

M. Stonebraker, S. Madden, D. J. Abadi, S. Harizopoulos, N. Hachem et al., The end of an architectural era:(it's time for a complete rewrite), Proceedings of the 33rd international conference on Very large data bases, pp.1150-1160, 2007.

A. Thusoo, J. S. Sarma, N. Jain, Z. Shao, P. Chakka et al., Hive - a petabyte scale data warehouse using Hadoop, 2010 IEEE 26th International Conference on Data Engineering (ICDE 2010), pp.996-1005, 2010.
DOI : 10.1109/ICDE.2010.5447738

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=

L. G. Valiant, A bridging model for parallel computation, Communications of the ACM, vol.33, issue.8, pp.103-111, 1990.
DOI : 10.1145/79173.79181

V. K. Vavilapalli, A. C. Murthy, C. Douglas, S. Agarwal, M. Konar et al., Apache Hadoop YARN, Proceedings of the 4th annual Symposium on Cloud Computing, SOCC '13, p.5, 2013.
DOI : 10.1145/2523616.2523633

R. S. Xin, J. Rosen, M. Zaharia, M. J. Franklin, S. Shenker et al., Shark, Proceedings of the 2013 international conference on Management of data, SIGMOD '13, pp.13-24, 2013.
DOI : 10.1145/2463676.2465288

Y. Zhu, J. Zhan, C. Weng, R. Nambiar, J. Zhang et al., BigOP: Generating Comprehensive Big Data Workloads as a Benchmarking Framework, 2014.
DOI : 10.1007/978-3-319-05813-9_32