T. Akidau, A. Balikov, K. Bekiroglu, S. Chernyak, J. Haberman et al., MillWheel, Very Large Data Bases, pp.734-746, 2013.
DOI : 10.14778/2536222.2536229

T. Akidau, R. Bradshaw, C. Chambers, S. Chernyak, R. J. Fernndez-moctezuma et al., The dataflow model, Proceedings of the VLDB Endowment, pp.1792-1803, 2015.
DOI : 10.14778/2824032.2824076

L. Cao, M. Wei, D. Yang, and E. A. Rundensteiner, Online Outlier Exploration Over Large Datasets, Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD '15, pp.89-98, 2015.
DOI : 10.1145/2783258.2783387

P. Carbone, J. Traub, A. Katsifodimos, S. Haridi, and V. Markl, Cutty, Proceedings of the 25th ACM International on Conference on Information and Knowledge Management , CIKM '16, pp.1201-1210, 2016.
DOI : 10.1145/2983323.2983807

J. Dean and S. Ghemawat, MapReduce, 6th Conference on Symposium on Opearting Systems Design & Implementation, pp.1-10, 2004.
DOI : 10.1145/1327452.1327492

M. A. Hammad, W. G. Aref, and A. K. Elmagarmid, Query processing of multi-way stream window joins, The VLDB Journal, vol.27, issue.1, pp.469-488, 2008.
DOI : 10.1007/s00778-006-0017-y

L. Neumeyer, B. Robbins, A. Kesari, and A. Nair, S4: Distributed Stream Computing Platform, 2010 IEEE International Conference on Data Mining Workshops, pp.170-177, 2010.
DOI : 10.1109/ICDMW.2010.172

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=

B. Nicolae, C. Costa, C. Misale, K. Katrinis, and Y. Park, Leveraging Adaptive I/O to Optimize Collective Data Shuffling Patterns for Big Data Analytics, IEEE Transactions on Parallel and Distributed Systems, 2017.
DOI : 10.1109/TPDS.2016.2627558

B. Nicolae, A. Kochut, and A. Karve, Towards scalable on-demand collective data access in IaaS clouds: An adaptive collaborative content exchange proposal, Journal of Parallel and Distributed Computing, vol.87, pp.67-79, 2016.
DOI : 10.1016/j.jpdc.2015.09.006

URL : https://hal.archives-ouvertes.fr/hal-01355213

T. Hey, S. Tansley, and K. T. , The Fourth Paradigm ??? Data-Intensive Scientific Discovery, 2009.
DOI : 10.1007/978-3-642-33299-9_1

R. Tudoran, A. Costan, O. Nano, I. Santos, H. Soncu et al., JetStream: Enabling high throughput live event streaming on multi-site clouds, Future Generation Computer Systems, vol.54, pp.274-291, 2016.
DOI : 10.1016/j.future.2015.01.016

URL : https://hal.archives-ouvertes.fr/hal-01239124

D. Yang, E. A. Rundensteiner, and M. O. Ward, Shared execution strategy for neighbor-based pattern mining requests over streaming windows, ACM Transactions on Database Systems, vol.37, issue.1, pp.1-544, 2012.
DOI : 10.1145/2109196.2109201

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=

M. Zaharia, M. Chowdhury, T. Das, A. Dave, J. Ma et al., Resilient Distributed Datasets, NSDI'12: The 9th USENIX Symposium on Networked Systems Design and Implementation, 2012.
DOI : 10.1145/2886107.2886110

M. Zaharia, T. Das, H. Li, S. Shenker, and I. Stoica, Discretized streams: An efficient and fault-tolerant model for stream processing on large clusters, HotCloud'12: 4th USENIX Conference on Hot Topics in Cloud Ccomputing, 2012.