M. J. Eppler and J. Mengis, The Concept of Information Overload: A Review of Literature from Organization Science, Accounting, Marketing, MIS, and Related Disciplines, The Information Society, vol.45, issue.5, pp.325-344, 2004.
DOI : 10.2307/3151298

D. Cosley, S. K. Lam, I. Albert, J. A. Konstan, and J. Riedl, Is seeing believing?, Proceedings of the conference on Human factors in computing systems , CHI '03, pp.585-592, 2003.
DOI : 10.1145/642611.642713

R. Burke, Hybrid recommender systems: Survey and experiments. User modeling and user-adapted interaction, pp.331-370, 2002.

M. Zaharia, M. Chowdhury, M. J. Franklin, S. Shenker, and I. Stoica, Spark: Cluster computing with working sets, pp.10-10, 2010.

R. Burke, Hybrid Systems for Personalized Recommendations, pp.133-152, 2005.
DOI : 10.1145/223904.223931

D. Goldberg, D. Nichols, B. M. Oki, and D. Terry, Using collaborative filtering to weave an information tapestry, Communications of the ACM, vol.35, issue.12, pp.61-70, 1992.
DOI : 10.1145/138859.138867

U. Shardanand and P. Maes, Social information filtering, Proceedings of the SIGCHI conference on Human factors in computing systems, CHI '95, pp.210-217, 1995.
DOI : 10.1145/223904.223931

P. Resnick, N. Iacovou, M. Suchak, P. Bergstrom, and J. Riedl, GroupLens, Proceedings of the 1994 ACM conference on Computer supported cooperative work , CSCW '94, pp.175-186, 1994.
DOI : 10.1145/192844.192905

M. J. Pazzani, A framework for collaborative, content-based and demographic filtering, Artificial Intelligence Review, vol.13, pp.5-6, 1999.

M. Armbrust, R. S. Xin, C. Lian, Y. Huai, D. Liu et al., Spark SQL, Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data, SIGMOD '15, pp.1383-1394, 2015.
DOI : 10.1007/3-540-59451-5_2

M. Zaharia, T. Das, H. Li, S. Shenker, and I. Stoica, Discretized streams: an efficient and fault-tolerant model for stream processing on large clusters, 2012.
DOI : 10.21236/ada575859

URL : http://www.eecs.berkeley.edu/Pubs/TechRpts/2012/EECS-2012-259.pdf

R. S. Xin, J. E. Gonzalez, M. J. Franklin, and I. Stoica, GraphX, First International Workshop on Graph Data Management Experiences and Systems, GRADES '13, 2013.
DOI : 10.1145/2484425.2484427

M. Zaharia, M. Chowdhury, T. Das, A. Dave, J. Ma et al., Resilient Distributed Datasets, Proceedings of the 9th USENIX conference on Networked Systems Design and Implementation, USENIX Association, pp.2-2, 2012.
DOI : 10.1145/2886107.2886110

M. Borneas, On a Generalization of the Lagrange Function, American Journal of Physics, vol.27, issue.4, pp.265-267, 1959.
DOI : 10.1119/1.1934822

N. J. Mitra and A. Nguyen, Estimating surface normals in noisy point cloud data, Proceedings of the nineteenth annual symposium on Computational geometry, pp.322-328, 2003.
DOI : 10.1145/777792.777840

D. Borthakur, The hadoop distributed file system: Architecture and design, Hadoop Project Website, vol.11, issue.21, 2007.

D. W. Zhang, F. Q. Sun, X. Cheng, and C. Liu, Research on hadoop-based enterprise file cloud storage system, 3rd International Conference on, pp.434-437, 2011.

J. Han, E. Haihong, G. Le, and J. Du, Survey on nosql database, Pervasive computing and applications (ICPCA) 6th international conference on, pp.363-366, 2011.

A. Thusoo, J. S. Sarma, N. Jain, Z. Shao, P. Chakka et al., Hive, Proceedings of the VLDB Endowment, pp.1626-1629, 2009.
DOI : 10.14778/1687553.1687609

H. Tsukimoto, Logical regression analysis: from mathematical formulas to linguistic rules. In: Foundations and Advances in Data Mining, pp.21-61, 2005.
DOI : 10.1007/11362197_2

C. J. Willmott and K. Matsuura, Advantages of the mean absolute error (MAE) over the root mean square error (RMSE) in assessing average model performance, Climate Research, vol.30, issue.1, p.79, 2005.
DOI : 10.3354/cr030079