N. T. Karonis, B. Supinski, I. Foster, W. Gropp, E. Lusk et al., Exploiting hierarchy in parallel computer networks to optimize collective operation performance, Proceedings 14th International Parallel and Distributed Processing Symposium. IPDPS 2000, pp.377-384, 2000.
DOI : 10.1109/IPDPS.2000.846009

T. Kielmann, R. Hofman, H. Bal, A. Plaat, and R. Bhoedjang, Magpie: MPI's collective communication operations for clustered wide area systems, Proceedings of the 7th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, pp.131-140, 1999.

T. Kielmann, H. Bal, S. Gorlatch, K. Verstoep, and R. Hofman, Network performance-aware collective communication for clustered wide-area systems, Parallel Computing, vol.27, issue.11, pp.1431-1456, 2001.
DOI : 10.1016/S0167-8191(01)00098-9

O. Hartmann, M. Kuhnemann, T. Rauber, and G. Runger, Adaptive selection of communication methods to optimize collective mpi operations, Proceedings of the 12th Workshop on Compilers for Parallel Computers (CPC'06), 2006.

F. Capello, P. Fraigniaud, B. Mans, and A. Rosenberg, AN ALGORITHMIC MODEL FOR HETEROGENEOUS HYPER-CLUSTERS: RATIONALE AND EXPERIENCE, International Journal of Foundations of Computer Science, vol.16, issue.02, pp.195-215, 2005.
DOI : 10.1142/S0129054105002942

A. Bar-noy and S. Kipnis, Designing broadcasting algorithms in the postal model for message-passing systems, Mathematical Systems Theory, vol.83, issue.No. 9, pp.431-452, 1994.
DOI : 10.1007/BF01184933

R. Hockney, The communication challenge for MPP: Intel Paragon and Meiko CS-2, Parallel Computing, vol.20, issue.3, pp.389-398, 1994.
DOI : 10.1016/S0167-8191(06)80021-9

M. Clement, M. Steed, and P. Crandall, Network performance modelling for PM clusters, Proceedings of Supercomputing, 1996.

D. Culler, R. Karp, D. Patterson, A. Sahay, K. E. Schauser et al., LogP: a practical model of parallel computation, Communications of the ACM, vol.39, issue.11, pp.78-85, 1996.
DOI : 10.1145/240455.240477

A. Alexandrov, M. Ionescu, K. Schauser, and C. Scheiman, LogGP, Proceedings of the seventh annual ACM symposium on Parallel algorithms and architectures , SPAA '95, 1995.
DOI : 10.1145/215399.215427

C. A. Moritz and M. I. Frank, LoGPG: Modeling network contention in message-passing programs, IEEE Transactions on Parallel and Distributed Systems, vol.12, issue.4, pp.404-415, 2001.
DOI : 10.1109/71.920589

R. Wolski, N. Spring, and C. Peterson, Implementing a performance forecasting system for metacomputing, Proceedings of the 1997 ACM/IEEE conference on Supercomputing (CDROM) , Supercomputing '97, 1997.
DOI : 10.1145/509593.509600

P. Dinda, T. Gross, R. Karrer, B. Lowekamp, N. Miller et al., The architecture of the Remos system, Proceedings 10th IEEE International Symposium on High Performance Distributed Computing, 2001.
DOI : 10.1109/HPDC.2001.945194

M. Burger, T. Kielmann, and H. Bal, TopoMon: A Monitoring Tool for Grid Network Topology, Proceedings of the International Conference on Computational Science'02, pp.558-567, 2002.
DOI : 10.1007/3-540-46080-2_58

O. Beaumont, L. Marchal, and Y. Robert, Broadcast Trees for Heterogeneous Platforms, 19th IEEE International Parallel and Distributed Processing Symposium, 2005.
DOI : 10.1109/IPDPS.2005.131

URL : https://hal.archives-ouvertes.fr/hal-00807277

B. Lowekamp, Discovery and Application of Network Information, 2000.

M. Barnett, D. Payne, R. Van-de-geijn, and J. Watts, Broadcasting on Meshes with Wormhole Routing, Journal of Parallel and Distributed Computing, vol.35, issue.2, pp.111-122, 1996.
DOI : 10.1006/jpdc.1996.0074

R. Thakur, R. Rabenseifner, and W. Gropp, Optimization of Collective Communication Operations in MPICH, International Journal of High Performance Computing Applications, vol.19, issue.1, pp.49-66, 2005.
DOI : 10.1177/1094342005051521

R. Thakur and W. Gropp, Improving the Performance of Collective Operations in MPICH, Proceedings of the Euro PVM/MPI 2003, pp.257-267, 2003.
DOI : 10.1007/978-3-540-39924-7_38

M. Banikazemi, V. Moorthy, and D. K. Panda, Efficient collective communication on heterogeneous networks of workstations, Proceedings. 1998 International Conference on Parallel Processing (Cat. No.98EX205), pp.460-467, 1998.
DOI : 10.1109/ICPP.1998.708518

P. B. Bhat, C. Raghavendra, and V. Prasanna, Efficient collective communication in distributed heterogeneous systems, Journal of Parallel and Distributed Computing, vol.2003, issue.63, pp.251-279, 2003.

P. Liu, D. Wang, and Y. Guo, An Approximation Algorithm for Broadcast Scheduling in Heterogeneous Clusters, Proceedings of the Real-Time and Embedded Computing Systems and Applications, 9th International Conference LNCS 2968, pp.38-52, 2003.
DOI : 10.1007/978-3-540-24686-2_3

J. L. Park, H. Choi, N. Nupairoj, and L. M. Ni, Construction of optimal multicast trees based on the parameterised communication model, Proceedings of the International Conference on Parallel Processing, pp.180-187, 1996.

G. Mateescu, A Method for MPI Broadcast in Computational Grids, 19th IEEE International Parallel and Distributed Processing Symposium, 2005.
DOI : 10.1109/IPDPS.2005.45

T. Vorakosit and P. Uthayopas, Generating an efficient dynamic multicast tree under grid environnement, Proceedings of the Euro PVM/MPI 2003, pp.636-643, 2003.

B. Lowekamp and A. Beguelin, ECO: Efficient Collective Operations for communication on heterogeneous networks, Proceedings of International Conference on Parallel Processing, pp.399-405, 1996.
DOI : 10.1109/IPPS.1996.508087

N. T. Karonis, I. Foster, B. Supinski, W. Gropp, E. Lusk et al., A multilevel approach to topology-aware collective operations in computational grids, tech. rep, 2002.

S. Lacour, N. T. Karonis, and I. Foster, MPICH-G2 collective operations performance evaluation, optimizations, " tech. rep, 2001.

L. Barchet-steffenel and G. Mounie, Performance characterisation of intra-cluster collective communications, Proceedings of the 16th Symposium on Computer Architecture and High Performance Computing (SBAC-PAD 2004), (Foz do Iguacu, pp.254-261, 2004.

L. Barchet-steffenel and G. Mounie, Identifying logical homogeneous clusters for efficient wide-area communication, Proceedings of the Euro PVM/MPI 2004, pp.319-326, 2004.

I. Unité-de-recherche-inria-lorraine and L. , Technopôle de Nancy-Brabois -Campus scientifique 615, rue du Jardin Botanique -BP 101 -54602 Villers-lès-Nancy Cedex (France) Unité de recherche INRIA Futurs : Parc Club Orsay Université -ZAC des Vignes 4

I. Unité-de-recherche and . Rennes, IRISA, Campus universitaire de Beaulieu -35042 Rennes Cedex (France) Unité de recherche INRIA Rhône-Alpes : 655, avenue de l'Europe -38334 Montbonnot Saint-Ismier (France) Unité de recherche INRIA Rocquencourt : Domaine de Voluceau -Rocquencourt -BP 105 -78153 Le Chesnay Cedex (France) Unité de recherche, 2004.