, Linux Traffic Control, 2006.

. Hdfs-raid and . Wiki, , vol.26, 2011.

. Isa-l-performance and . Report, , vol.26, 2017.

A. Flink, , 2019.

A. Hadoop, , 2019.

A. Spark, , 2019.

, Erasure Code Support in OpenStack Swift, 2019.

, Intel Intelligent Storage Acceleration Library Homepage, 2019.

D. Alistarh, H. Ballani, P. Costa, A. Funnell, J. Benjamin et al., A high-radix, low-latency optical switch for data centers, ACM SIGCOMM Computer Communication Review, vol.45, pp.367-368, 2015.

G. Ananthanarayanan, A. Ghodsi, S. Shenker, and I. Stoica, Disklocality in Datacenter Computing Considered Irrelevant, Proceedings of the 13th USENIX Conference on Hot Topics in Operating Systems, HotOS'13, pp.12-12, 2011.

K. Asanovic and D. Patterson, Firebox: A hardware building block for 2020 warehouse-scale computers, 2014.

H. Ballani, P. Costa, T. Karagiannis, and A. Rowstron, Towards Predictable Datacenter Networks, Proceedings of the ACM SIGCOMM 2011 Conference, SIGCOMM'11, pp.242-253, 2011.

F. Bonomi, R. Milito, J. Zhu, and S. Addepalli, Fog Computing and Its Role in the Internet of Things, Proceedings of the First Edition of the MCC Workshop on Mobile Cloud Computing, MCC'12, pp.13-16, 2012.

Y. Chen, S. Alspaugh, and R. Katz, Interactive Analytical Processing in Big Data Systems: A Cross-industry Study of MapReduce Workloads, Proc. VLDB Endow, vol.5, issue.12, pp.1802-1813, 2012.

J. Darrous, T. Lambert, and S. Ibrahim, On the Importance of container images placement for service provisioning in the Edge, Proceedings of the 28th International Conference on Computer Communications and Networks, ICCCN'19, 2019.

J. Dean and S. Ghemawat, MapReduce: Simplified Data Processing on Large Clusters, Proceedings of the 6th Conference on Symposium on Operating Systems Design & Implementation, vol.6, pp.10-10, 2004.

F. Dinu and T. E. Ng, Understanding the Effects and Implications of Compute Node Related Failures in Hadoop, Proceedings of the 21st International Symposium on High-Performance Parallel and Distributed Computing, HPDC'12, pp.187-198, 2012.

B. Fan, W. Tantisiriroj, L. Xiao, and G. Gibson, DiskReduce: RAID for Data-intensive Scalable Computing, Proceedings of the 4th Annual Workshop on Petascale Data Storage, pp.6-10, 2009.

A. Fikes, Colossus, successor to Google File System, 2010.

S. Ghemawat, H. Gobioff, and S. Leung, The Google File System, Proceedings of the Nineteenth ACM Symposium on Operating Systems Principles, SOSP'03, pp.29-43, 2003.

A. Haeberlen, A. Mislove, and P. Druschel, Glacier: Highly Durable, Decentralized Storage Despite Massive Correlated Failures, Proceedings of the 2nd Conference on Symposium on Networked Systems Design & Implementation, vol.2, pp.143-158, 2005.

K. Hsieh, A. Harlap, N. Vijaykumar, D. Konomis, G. R. Ganger et al., Gaia: Geo-Distributed Machine Learning Approaching LAN Speeds, Proceedings of the 14th USENIX Symposium on Networked Systems Design and Implementation, NSDI'17, pp.629-647, 2017.

P. Hu, S. Dhelim, H. Ning, and T. Qiu, Survey on fog computing: architecture, key technologies, applications and open issues, Journal of network and computer applications, vol.98, pp.27-42, 2017.

C. Huang, H. Simitci, Y. Xu, A. Ogus, B. Calder et al., Erasure Coding in Windows Azure Storage, Proceedings of the 2012 USENIX Conference on Annual Technical Conference, ATC'12, pp.2-2, 2012.

S. Huang, J. Huang, J. Dai, T. Xie, and B. Huang, The HiBench benchmark suite: Characterization of the MapReduce-based data analysis, Proceedings of the IEEE 26th International Conference on Data Engineering Workshops, ICDEW'10, pp.41-51, 2010.

C. Hung, G. Ananthanarayanan, L. Golubchik, M. Yu, and M. Zhang, Wide-area Analytics with Multiple Resources, Proceedings of the Thirteenth EuroSys Conference, vol.18, pp.1-12, 2018.

S. Ibrahim, H. Jin, L. Lu, B. He, G. Antoniu et al., Maestro: Replica-Aware Map Scheduling for MapReduce, Proceedings of the 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, CCGrid'12, pp.435-442, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00670813

H. Jin, S. Ibrahim, L. Qi, H. Cao, S. Wu et al., The MapReduce Programming Model and Implementations, Cloud computing: Principles and Paradigms, pp.373-390, 2011.

K. R. Krish, A. Anwar, and A. R. , Butt. hatS: A Heterogeneity-Aware Tiered Storage for Hadoop, Proceedings of the 14th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, pp.502-511, 2014.

J. Kubiatowicz, D. Bindel, Y. Chen, S. Czerwinski, P. Eaton et al., OceanStore: An Architecture for Global-scale Persistent Storage, SIGPLAN Not, vol.35, issue.11, pp.190-201, 2000.

P. Kumar and H. H. Huang, Falcon: Scaling IO Performance in Multi-SSD Volumes, Proceedings of the USENIX Annual Technical Conference, ATC'17, 2017.

J. Li and B. Li, On Data Parallelism of Erasure Coding in Distributed Storage Systems, Proceedings of the IEEE 37th International Conference on Distributed Computing Systems, ICDCS'17, pp.45-56, 2017.

J. Li and B. Li, Parallelism-Aware Locally Repairable Code for Distributed Storage Systems, Proceedings of the IEEE 38th International Conference on Distributed Computing Systems, ICDCS'18, pp.87-98, 2018.

R. Li, P. P. Lee, and Y. Hu, Degraded-First Scheduling for MapReduce in Erasure-Coded Storage Clusters, Proceedings of the 44th Annual IEEE/IFIP International Conference on Dependable Systems and Networks, DSN'14, pp.419-430, 2014.

X. Lu, N. S. Islam, M. Wasi-ur-rahman, J. Jose, H. Subramoni et al., High-Performance Design of Hadoop RPC with RDMA over InfiniBand, Proceedings of the 42nd International Conference on Parallel Processing, ICPP'13, pp.641-650, 2013.

F. J. Macwilliams and N. J. Sloane, The theory of error-correcting codes, vol.16, 1977.

S. Moon, J. Lee, and Y. S. Kee, Introducing SSDs to the Hadoop MapReduce Framework, Proceedings of the IEEE 7th International Conference on Cloud Computing, CLOUD'14, pp.272-279, 2014.

S. Muralidhar, W. Lloyd, S. Roy, C. Hill, E. Lin et al., F4: Facebook's Warm BLOB Storage System, Proceedings of the 11th USENIX Conference on Operating Systems Design and Implementation, OSDI'14, pp.383-398, 2014.

V. Nitu, B. Teabe, A. Tchana, C. Isci, and D. Hagimont, Welcome to Zombieland: Practical and Energy-efficient Memory Disaggregation in a Datacenter, Proceedings of the Thirteenth EuroSys Conference, EuroSys'18, vol.16, p.12, 2018.

J. Ousterhout, A. Gopalan, A. Gupta, A. Kejriwal, C. Lee et al., The RAMCloud Storage System, ACM Trans. Comput. Syst, vol.33, issue.3, 2015.

K. Rashmi, N. B. Shah, D. Gu, H. Kuang, D. Borthakur et al., Hitchhiker's" Guide to Fast and Efficient Data Reconstruction in Erasure-coded Data Centers, Proceedings of the 2014 ACM Conference on SIGCOMM, SIGCOMM'14, pp.331-342, 2014.

K. V. Rashmi, M. Chowdhury, J. Kosaian, I. Stoica, and K. Ramchandran, EC-cache: Load-balanced, Low-latency Cluster Caching with Online Erasure Coding, Proceedings of the 12th USENIX Conference on Operating Systems Design and Implementation, OSDI'16, pp.401-417, 2016.

I. Reed and G. Solomon, Polynomial codes over certain finite fields, Journal of the Society of Industrial and Applied Mathematics, vol.8, issue.2, pp.300-304, 1960.

R. Rodrigues and B. Liskov, High Availability in DHTs: Erasure Coding vs. Replication, Proceedings of the 4th International Conference on Peer-to-Peer Systems, IPTPS'05, pp.226-239, 2005.

M. Sathiamoorthy, M. Asteris, D. Papailiopoulos, A. G. Dimakis, R. Vadali et al., XORing Elephants: Novel Erasure Codes for Big Data, Proceedings of the 39th international conference on Very Large Data Bases, PVLDB'13, pp.325-336, 2013.

J. Shi, Y. Qiu, U. F. Minhas, L. Jiao, C. Wang et al., Clash of the Titans: MapReduce vs. Spark for Large Scale Data Analytics. Proc. VLDB Endow, vol.8, 2015.

W. Shi, J. Cao, Q. Zhang, Y. Li, and L. Xu, Edge computing: Vision and challenges, IEEE Internet of Things Journal, vol.3, issue.5, pp.637-646, 2016.

K. Shvachko, H. Kuang, S. Radia, and R. Chansler, The Hadoop Distributed File System, Proceedings of the IEEE 26th Symposium on Mass Storage Systems and Technologies, MSST'10, pp.1-10, 2010.

V. K. Vavilapalli, A. C. Murthy, C. Douglas, S. Agarwal, M. Konar et al., Apache Hadoop YARN: Yet Another Resource Negotiator, Proceedings of the 4th Annual Symposium on Cloud Computing, SOCC'13, vol.5, pp.1-5, 2013.

Y. Wang, X. Que, W. Yu, D. Goldenberg, and D. Sehgal, Hadoop Acceleration Through Network Levitated Merge, Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, SC'11, vol.57, pp.1-57, 2011.

G. Yadgar and M. Gabel, Avoiding the Streetlight Effect: I/O Workload Analysis with SSDs in Mind, Proceedings of the 8th USENIX Conference on Hot Topics in Storage and File Systems, HotStorage'16, pp.36-40, 2016.

O. Yildiz, S. Ibrahim, T. A. Phuong, and G. Antoniu, Chronos: Failureaware scheduling in shared Hadoop clusters, Proceedings of the IEEE International Conference on Big Data, Big Data'15, pp.313-318, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01203001

Y. Yu, R. Huang, W. Wang, J. Zhang, and K. B. Letaief, SP-cache: Loadbalanced, Redundancy-free Cluster Caching with Selective Partition, Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis, SC'18, vol.1, pp.1-1, 2018.

M. Zaharia, D. Borthakur, J. Sarma, K. Elmeleegy, S. Shenker et al., Delay Scheduling: A Simple Technique for Achieving Locality and Fairness in Cluster Scheduling, Proceedings of the 5th European Conference on Computer Systems, EuroSys'10, pp.265-278, 2010.

M. Zaharia, M. Chowdhury, M. J. Franklin, S. Shenker, and I. Stoica, Spark: Cluster Computing with Working Sets, Proceedings of the 2Nd USENIX Conference on Hot Topics in Cloud Computing, HotCloud'10, pp.10-10, 2010.

H. Zhang, M. Dong, and H. Chen, Efficient and Available In-memory KV-Store with Hybrid Erasure Coding and Replication, Proceedings of the 14th USENIX Conference on File and Storage Technologies, FAST'16, 2016.

Z. Zhang, A. Deshpande, X. Ma, E. Thereska, and D. Narayanan, Does erasure coding have a role to play in my data center, 2010.

Z. Zhang, A. Wang, K. Zheng, U. M. , and V. B. , Introduction to HDFS Erasure Coding in Apache Hadoop, 2015.