J. Barbosa, J. Tavares, and A. J. Padilha, Linear algebra algorithms in a heterogeneous cluster of personal computers, 9th Heterogeneous Computing Workshop (HCW'2000), pp.147-159, 2000.

O. Beaumont, V. Boudet, A. Petitet, F. Rastello, and Y. Robert, A proposal for a heterogeneous cluster ScaLAPACK (dense linear solvers), IEEE Trans. Computers, vol.50, issue.10, pp.1052-1070, 2001.
URL : https://hal.archives-ouvertes.fr/hal-00808287

O. Beaumont, V. Boudet, F. Rastello, and Y. Robert, Matrix multiplication on heterogeneous platforms, IEEE Trans. Parallel Distributed Systems, vol.12, issue.10, pp.1033-1051, 2001.
URL : https://hal.archives-ouvertes.fr/hal-00808288

A. Bevilacqua, A dynamic load balancing method on a heterogeneous cluster of workstations, Informatica, vol.23, issue.1, pp.49-56, 1999.

L. S. Blackford, J. Choi, A. Cleary, E. Azevedo, J. Demmel et al., ScaLAPACK Users' Guide. SIAM, 1997.

A. Bourgeade and B. Nkonga, Dynamic load balancing computation of pulses propagating in a nonlinear medium, The Journal of Supercomputing, vol.28, issue.3, pp.279-294, 2004.

R. P. Brent, The LINPACK Benchmark on the AP1000: Preliminary Report, CAP Workshop 91. Australian National University, 1991.

L. Brunie, A. Flory, and H. Kosch, New static scheduling and elastic load balancing methods for parallel query processing, Basque International Workshop on Information Technology BIWIT, 1995.

R. Buyya, High Performance Cluster Computing, Architecture and Systems, vol.1, 1999.

K. L. Calvert, M. B. Doar, and E. W. Zegura, Modeling internet topology, IEEE Communications Magazine, vol.35, issue.6, pp.160-163, 1997.

C. H. Hsu, Y. Chung, D. Yang, and C. Dow, A generalized processor mapping technique for array redistribution, IEEE Trans. Parallel Distributed Systems, vol.12, issue.7, pp.743-757, 2001.

P. E. Crandall and M. J. Quinn, Block data decomposition for data-parallel programming on a heterogeneous workstation network, 2nd International Symposium on High Performance Distributed Computing, pp.42-49, 1993.

E. Deelman and B. Szymanski, Dynamic load balancing in parallel discrete event simulation for spatially explicit problems, PADS'98, 12th Workshop on Parallel and Distributed Simulation, pp.46-53, 1998.

F. Desprez, J. Dongarra, A. Petitet, C. Randriamaro, and Y. Robert, Scheduling blockcyclic array redistribution, IEEE Trans. Parallel Distributed Systems, vol.9, issue.2, pp.192-205, 1998.
URL : https://hal.archives-ouvertes.fr/hal-00856854

M. Doar, A better model for generating test networks, Proceedings of Globecom '96, 1996.

A. B. Downey, Using pathchar to estimate internet link characteristics, Measurement and Modeling of Computer Systems, pp.222-223, 1999.

J. E. Flaherty, R. M. Loy, C. Ozturan, M. S. Shephard, B. K. Szymanski et al., Parallel structures and dynamic load balancing for adaptive finite element computation, Applied Numerical Mathematics, vol.26, issue.1-2, pp.241-263, 1997.

J. E. Flaherty, R. M. Loy, M. S. Shephard, B. K. Szymanski, J. D. Teresco et al., Adaptive local refinement with octree load balancing for the parallel solution of three-dimensional conservation laws, J. Parallel and Distributed Computing, vol.47, issue.2, pp.139-152, 1997.

J. Garcia, E. Ayguadé, and J. Labarta, A framework for integrating data alignment, distribution, and redistribution in distributed memory multiprocessors, IEEE Trans. Parallel Distributed Systems, vol.12, issue.4, pp.416-431, 2001.

M. Hamdi and C. Lee, Dynamic load balancing of data parallel applications on a distributed network, 9th International Conference on Supercomputing ICS'95, pp.170-179, 1995.

Y. Hu and R. Blake, Load balancing for unstructured mesh applications, Parallel and Distributed Computing Practices, vol.2, 1999.

M. Kaddoura, S. Ranka, and A. Wang, Array decomposition for nonuniform computational environments, Journal of Parallel and Distributed Computing, vol.36, pp.91-105, 1996.

E. T. Kalns and L. M. Ni, Processor mapping techniques towards efficient data redistribution, IEEE Trans. Parallel Distributed Systems, vol.6, issue.12, p.5207, 1995.

J. Knoop and E. Mehofer, Distribution assignment placement: effective optimization of redistribution costs, IEEE Trans. Parallel Distributed Systems, vol.13, issue.6, pp.628-647, 2002.

C. H. Koelbel, D. B. Loveman, R. S. Schreiber, G. L. , and M. E. Zosel, The High Performance Fortran Handbook, 1994.

U. Kremer, NP-Completeness of dynamic remapping, Proceedings of the Fourth Workshop on Compilers for Parallel Computers, 1993.

Z. Lan, V. Taylor, and G. Bryan, Dynamic load balancing of samr applications on distributed systems, Proceedings of the ACM/IEEE Symposium on Supercomputing (SC'01), 2001.

C. Lee and M. Hamdi, Parallel image processing applications on a network of workstations, Parallel Computing, vol.21, pp.137-160, 1995.

A. Legrand, L. Marchal, and H. Casanova, Scheduling Distributed Applications: The SimGrid Simulation Framework, Proceedings of the Third IEEE International Symposium on Cluster Computing and the Grid (CCGrid'03), 2003.
URL : https://hal.archives-ouvertes.fr/hal-00789451

S. Miguet and Y. Robert, Elastic load balancing for image processing algorithms, Parallel Computation, vol.591, pp.438-451, 1992.
URL : https://hal.archives-ouvertes.fr/hal-00857073

D. Nicol and J. P. Reynolds, Optimal dynamic remapping of data parallel computations, IEEE Trans. Computers, vol.39, issue.2, pp.206-219, 1990.

D. Nicol and J. Saltz, Dynamic remapping of parallel computations with varying resource demands, IEEE Trans. Computers, vol.37, issue.9, pp.1073-1087, 1988.

N. Park, V. Prasanna, and C. Raghavendra, A framework for integrating data alignment, distribution, and redistribution in distributed memory multiprocessors, IEEE Trans. Parallel Distributed Systems, vol.10, issue.12, pp.1217-1240, 1999.

L. Prylli and B. Tourancheau, Fast runtime block-cyclic data redistribution on multiprocessors, J. Parallel Distributed Computing, vol.45, pp.63-72, 1997.

D. Sarrut and S. Miguet, ARAMIS: a remote access medical imaging system, ISCOPE'99, 3rd International Symposium on Computing in Object-Oriented Parallel Environments, vol.1732, 1999.

K. Schloegel, G. Karypis, and V. Kumar, Multilevel diffusion schemes for repartitioning of adaptive meshes, vol.47, pp.109-124, 1997.

K. Schloegel, G. Karypis, and V. Kumar, A unified algorithm for load-balancing adaptive scientific simulations, Proceedings of the ACM/IEEE Symposium on Supercomputing (SC'00), 2000.

B. A. Shirazi, A. R. Hurson, and K. M. Kavi, Scheduling and load balancing in parallel and distributed systems, 1995.

R. Thakur, A. Choudhary, and J. Ramanujam, Efficient algorithms for array redistribution, IEEE Trans. Parallel and Distributed Systems, vol.7, issue.6, pp.587-594, 1996.

J. Watts and S. Taylor, A practical approach to dynamic load balancing, IEEE Trans. Parallel and Distributed Systems, vol.9, issue.93, pp.235-248, 1998.

M. Wu, On runtime parallel scheduling for processor load balancing, IEEE Trans. Parallel and Distributed Systems, vol.8, issue.2, pp.173-186, 1997.

, Unité de recherche INRIA Rhône-Alpes 655, avenue de l'Europe -38334 Montbonnot

. Unité-de-recherche-inria-futurs, Parc Club Orsay Université -ZAC des Vignes 4, rue Jacques Monod -91893 ORSAY Cedex

. Unité-de-recherche-inria-lorraine, LORIA, Technopôle de Nancy-Brabois -Campus scientifique 615, rue du Jardin Botanique -BP 101 -54602