A. Benoit, L. Canon, and L. Marchal, Non-clairvoyant reduction algorithms for heterogeneous platforms, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00926093

A. Benoit, F. Dufossé, M. Gallet, Y. Robert, and B. Gaujal, Computing the throughput of probabilistic and replicated streaming applications, Proc. of SPAA, Symp. on Parallelism in Algorithms and Architectures, pp.166-175, 2010.
URL : https://hal.archives-ouvertes.fr/inria-00555890

L. Canon and G. Antoniu, Scheduling associative reductions with homogeneous costs when overlapping communications and computations, 20th Annual International Conference on High Performance Computing, 2012.
DOI : 10.1109/HiPC.2013.6799124

URL : https://hal.archives-ouvertes.fr/hal-00675964

L. Canon and E. Jeannot, Evaluation and Optimization of the Robustness of DAG Schedules in Heterogeneous Environments, IEEE Transactions on Parallel and Distributed Systems, vol.21, issue.4, pp.532-546, 2010.
DOI : 10.1109/TPDS.2009.84

URL : https://hal.archives-ouvertes.fr/inria-00430920

L. Canon, E. Jeannot, R. Sakellariou, and W. Zheng, Comparative Evaluation Of The Robustness Of DAG Scheduling Heuristics, Proceedings of CoreGRID Integration Workshop, 2008.
DOI : 10.1007/978-0-387-09457-1_7

URL : https://hal.archives-ouvertes.fr/inria-00333903

T. H. Cormen, C. E. Leiserson, R. L. Rivest, and C. Stein, Introduction to Algorithms, 2009.

J. Dean and S. Ghemawat, MapReduce, Communications of the ACM, vol.51, issue.1, pp.107-113, 2008.
DOI : 10.1145/1327452.1327492

D. Feitelson, Workload modeling for computer systems performance evaluation. Book Draft, Version 0, 2013.

A. Legrand, L. Marchal, and Y. Robert, Optimizing the steady-state throughput of scatter and reduce operations on heterogeneous platforms, Journal of Parallel and Distributed Computing, issue.12, pp.651497-1514, 2005.
URL : https://hal.archives-ouvertes.fr/hal-00789425

P. Liu, M. Kuo, and D. Wang, An Approximation Algorithm and Dynamic Programming for Reduction in Heterogeneous Environments, Algorithmica, vol.33, issue.4, pp.425-453, 2009.
DOI : 10.1007/s00453-007-9113-7

U. Lublin and D. G. Feitelson, The workload on parallel supercomputers: modeling the characteristics of rigid jobs, Journal of Parallel and Distributed Computing, vol.63, issue.11, pp.1105-1122, 2003.
DOI : 10.1016/S0743-7315(03)00108-4

J. Pjesivac-grbovic, T. Angskun, G. Bosilca, G. Fagg, E. Gabriel et al., Performance Analysis of MPI Collective Operations, 19th IEEE International Parallel and Distributed Processing Symposium, 2005.
DOI : 10.1109/IPDPS.2005.335

R. Rabenseifner, Optimization of Collective Reduction Operations, Computational Science -ICCS 2004, pp.1-9, 2004.
DOI : 10.1007/978-3-540-24685-5_1

R. Thakur, R. Rabenseifner, and W. Gropp, Optimization of Collective Communication Operations in MPICH, International Journal of High Performance Computing Applications, vol.19, issue.1, pp.49-66, 2005.
DOI : 10.1177/1094342005051521

M. Zaharia, A. Konwinski, A. Joseph, R. Katz, and I. Stoica, Improving mapreduce performance in heterogeneous environments, Proc. of the 8th USENIX conf. on Operating systems design and implementation, pp.29-42, 2008.