G. Ananthanarayanan, A. Ghodsi, S. Shenker, and I. Stoica, Effective straggler mitigation: Attack of the clones, Proceedings of the 10th USENIX Conference on Networked Systems Design and Implementation, nsdi'13, pp.185-198, 2013.

G. Ananthanarayanan, M. C. Hung, X. Ren, I. Stoica, A. Wierman et al., Grass: Trimming stragglers in approximation analytics, Proceedings of the 11th USENIX Conference on Networked Systems Design and Implementation, NSDI'14, pp.289-302, 2014.

T. J. Hacker, B. D. Noble, and B. D. Athey, Adaptive data block scheduling for parallel TCP streams, HPDC-14. Proceedings. 14th IEEE International Symposium on High Performance Distributed Computing, 2005., pp.265-275, 2005.
DOI : 10.1109/HPDC.2005.1520970

G. Khanna, U. Catalyurek, T. Kurc, R. Kettimuthu, P. Sadayappan et al., Using overlays for efficient data transfer over shared wide-area networks, 2008 SC, International Conference for High Performance Computing, Networking, Storage and Analysis, pp.1-4712, 2008.
DOI : 10.1109/SC.2008.5213292

T. Kosar, E. Arslan, B. Ross, and B. Zhang, StorkCloud, Proceedings of the 4th ACM workshop on Scientific cloud computing, Science Cloud '13, pp.29-36, 2013.
DOI : 10.1145/2465848.2465855

Y. Kwon, M. Balazinska, B. Howe, and J. Rolia, SkewTune, Proceedings of the 2012 international conference on Management of Data, SIGMOD '12, pp.25-36
DOI : 10.1145/2213836.2213840

A. Lakshman and P. Malik, Cassandra, ACM SIGOPS Operating Systems Review, vol.44, issue.2, pp.35-40, 2010.
DOI : 10.1145/1773912.1773922

N. Laoutaris, M. Sirivianos, X. Yang, and P. Rodriguez, Inter-datacenter bulk transfers with netstitcher, ACM SIGCOMM Computer Communication Review, vol.41, issue.4, pp.74-85, 2011.
DOI : 10.1145/2043164.2018446

H. Li, A. Ghodsi, M. Zaharia, S. Shenker, and I. Stoica, Tachyon, Proceedings of the ACM Symposium on Cloud Computing, SOCC '14, pp.1-6, 2014.
DOI : 10.1145/2670979.2670985

W. Liu, B. Tieman, R. Kettimuthu, and I. Foster, A data transfer framework for large-scale science experiments, Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, HPDC '10, pp.717-724, 2010.
DOI : 10.1145/1851476.1851582

K. Ousterhout, R. Rasti, S. Ratnasamy, S. Shenker, and B. Chun, Making sense of performance in data analytics frameworks, Proceedings of the 12th USENIX Conference on Networked Systems Design and Implementation, NSDI'15, pp.293-307, 2015.

L. Pineda-morales, A. Costan, and G. Antoniu, Towards Multi-site Metadata Management for Geographically Distributed Cloud Workflows, 2015 IEEE International Conference on Cluster Computing, pp.1-10, 2015.
DOI : 10.1109/CLUSTER.2015.49

URL : https://hal.archives-ouvertes.fr/hal-01239150

C. Raiciu, C. Pluntke, S. Barre, A. Greenhalgh, D. Wischik et al., Data center networking with multipath TCP, Proceedings of the Ninth ACM SIGCOMM Workshop on Hot Topics in Networks, Hotnets '10, pp.1-10, 2010.
DOI : 10.1145/1868447.1868457

A. Thomson and D. J. Abadi, Calvinfs: Consistent wan replication and scalable metadata management for distributed file systems, Proceedings of the 13th USENIX Conference on File and Storage Technologies, FAST'15, pp.1-14

R. Tudoran, A. Costan, G. Antoniu, and H. Soncu, TomusBlobs: Towards Communication-Efficient Storage for MapReduce Applications in Azure, 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (ccgrid 2012), pp.427-434, 2012.
DOI : 10.1109/CCGrid.2012.104

URL : https://hal.archives-ouvertes.fr/hal-00670725

R. Tudoran, O. Nano, I. Santos, A. Costan, H. Soncu et al., JetStream, Proceedings of the 8th ACM International Conference on Distributed Event-Based Systems, DEBS '14, pp.23-34, 2014.
DOI : 10.1145/2611286.2611298

URL : https://hal.archives-ouvertes.fr/hal-01090281

T. White, Hadoop: The Definitive Guide, 2010.

E. Yildirim and T. Kosar, Network-aware end-to-end data throughput optimization, Proceedings of the first international workshop on Network-aware data management, NDM '11, pp.21-30, 2011.
DOI : 10.1145/2110217.2110221

M. Zaharia, M. Chowdhury, T. Das, A. Dave, J. Ma et al., Resilient Distributed Datasets, Proceedings of the 9th USENIX Conference on Networked Systems Design and Implementation, NSDI'12, pp.2-2, 2012.
DOI : 10.1145/2886107.2886110

M. Zaharia, M. Chowdhury, M. J. Franklin, S. Shenker, and I. Stoica, Spark: cluster computing with working sets, HotCloud'10, pp.10-10, 2010.