L. Hollermann, T. S. Hsu, D. R. Lopez, and K. Vertanen, Scheduling problems in a practical allocation model, Journal of Combinatorial Optimization, vol.1, issue.2, pp.129-149, 1997.
DOI : 10.1023/A:1009799631608

T. S. Hsu, J. C. Lee, D. R. Lopez, and W. A. Royce, Task allocation on a network of processors, IEEE Trans. Computers, vol.49, issue.12, pp.1339-1353, 2000.

M. G. Norman and P. Thanisch, Models of machines and computation for mapping in multicomputers, ACM Computing Surveys, vol.25, issue.3, pp.103-117, 1993.
DOI : 10.1145/158439.158908

B. A. Shirazi, A. R. Hurson, and K. M. Kavi, Scheduling and load balancing in parallel and distributed systems, 1995.

H. El-rewini, H. H. Ali, and T. G. Lewis, Task scheduling in multiprocessing systems, Computer, vol.28, issue.12, pp.27-37, 1995.
DOI : 10.1109/2.476197

O. Beaumont, V. Boudet, and Y. Robert, A realistic model and an efficient heuristic for scheduling with heterogeneous processors, Proceedings 16th International Parallel and Distributed Processing Symposium, 2002.
DOI : 10.1109/IPDPS.2002.1015663

URL : https://hal.archives-ouvertes.fr/hal-00807411

P. Bhat, C. Raghavendra, and V. Prasanna, Efficient collective communication in distributed heterogeneous systems, Journal of Parallel and Distributed Computing, vol.63, issue.3, pp.251-263, 2003.
DOI : 10.1016/S0743-7315(03)00008-X

P. Rivera-vega, R. Varadarajan, and S. Navathe, Scheduling data redistribution in distributed databases, [1990] Proceedings. Sixth International Conference on Data Engineering, pp.166-173, 1990.
DOI : 10.1109/ICDE.1990.113466

Y. Kim, Data migration to minimize the total completion time, Journal of Algorithms, vol.55, issue.1, pp.42-57, 2005.
DOI : 10.1016/j.jalgor.2004.07.009

E. G. Coffman, M. R. Garey, D. S. Johnson, and A. S. Lapaugh, Scheduling File Transfers, SIAM Journal on Computing, vol.14, issue.3, pp.744-780, 1985.
DOI : 10.1137/0214054

E. Anderson, J. Hall, J. Hartline, M. Hobbes, A. Karlin et al., Algorithms for Data Migration, Algorithmica, vol.3, issue.3, pp.349-380, 2010.
DOI : 10.1007/s00453-008-9214-y

C. H. Koelbel, D. B. Loveman, R. S. Schreiber, G. L. Jr, and M. E. , The High Performance Fortran Handbook, Computers in Physics, vol.8, issue.4, 1994.
DOI : 10.1063/1.4823319

J. J. Dongarra and D. W. Walker, Software Libraries for Linear Algebra Computations on High Performance Computers, SIAM Review, vol.37, issue.2, pp.151-180, 1995.
DOI : 10.1137/1037042

E. T. Kalns and L. M. Ni, Processor mapping techniques toward efficient data redistribution, IEEE Transactions on Parallel and Distributed Systems, vol.6, issue.12, pp.1234-1247, 1995.
DOI : 10.1109/71.476166

D. W. Walker and S. W. Otto, Redistribution of block???cyclic data distributions using MPI, Concurrency: Practice and Experience, pp.707-728, 1996.
DOI : 10.1002/(SICI)1096-9128(199611)8:9<707::AID-CPE269>3.0.CO;2-V

L. Wang, J. M. Stichnoth, and S. Chatterjee, Runtime performance of parallel array assignment, Proceedings of the 1996 ACM/IEEE conference on Supercomputing (CDROM) , Supercomputing '96, 1996.
DOI : 10.1145/369028.369036

R. Thakur, A. Choudhary, and G. Fox, Runtime array redistribution in HPF programs, Proceedings of IEEE Scalable High Performance Computing Conference, pp.309-316, 1994.
DOI : 10.1109/SHPCC.1994.296659

F. Desprez, J. Dongarra, A. Petitet, C. Randriamaro, and Y. Robert, Scheduling block-cyclic array redistribution, IEEE Transactions on Parallel and Distributed Systems, vol.9, issue.2, pp.192-205, 1998.
DOI : 10.1109/71.663945

URL : https://hal.archives-ouvertes.fr/inria-00073573

M. Guo and Y. Pan, Improving communication scheduling for array redistribution, Journal of Parallel and Distributed Computing, vol.65, issue.5, 2005.
DOI : 10.1016/j.jpdc.2004.12.001

L. Prylli and B. Tourancheau, Efficient block cyclic data redistribution, Lectures Notes in Computer Science, vol.1123, pp.155-164, 1996.
DOI : 10.1007/3-540-61626-8_20

URL : https://hal.archives-ouvertes.fr/inria-00073925

A. Schrijver, Combinatorial Optimization: Polyhedra and Efficiency, ser. Algorithms and Combinatorics, 2003.

J. E. Hopcroft and R. M. Karp, An $n^{5/2} $ Algorithm for Maximum Matchings in Bipartite Graphs, SIAM Journal on Computing, vol.2, issue.4, pp.225-231, 1973.
DOI : 10.1137/0202019

G. Smith, Numerical Solutions of Partial Differential Equations: Finite Difference Methods, 1985.

M. R. Garey and D. S. Johnson, Computers and Intractability, a Guide to the Theory of NP- Completeness, 1979.

G. Bosilca, A. Bouteiller, A. Danalis, T. Herault, P. Lemarinier et al., DAGuE: A generic distributed DAG engine for High Performance Computing, Parallel Computing, vol.38, issue.1-2, pp.37-51, 2012.
DOI : 10.1016/j.parco.2011.10.003

A. Buttari, J. Langou, J. Kurzak, and J. Dongarra, A class of parallel tiled linear algebra algorithms for multicore architectures, Parallel Computing, vol.35, issue.1, pp.38-53, 2009.
DOI : 10.1016/j.parco.2008.10.002

G. Quintana-ortí, E. S. Quintana-ortí, R. A. Van-de-geijn, F. G. Van-zee, and E. Chan, Programming matrix algorithms-by-blocks for thread-level parallelism, ACM Transactions on Mathematical Software, vol.36, issue.3, pp.1-26, 2009.
DOI : 10.1145/1527286.1527288

J. Choi, J. Demmel, I. Dhillon, J. Dongarra, S. Ostrouchov et al., ScaLAPACK: a portable linear algebra library for distributed memory computers ??? design issues and performance, Computer Physics Communications, vol.97, issue.1-2, pp.1-15, 1996.
DOI : 10.1016/0010-4655(96)00017-3

G. Bosilca, A. Bouteiller, A. Danalis, M. Faverge, A. Haidar et al., Flexible Development of Dense Linear Algebra Algorithms on Massively Parallel Architectures with DPLASMA, 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum, 2011.
DOI : 10.1109/IPDPS.2011.299

T. Herault, J. Herrmann, L. Marchal, and Y. Robert, Determining the Optimal Redistribution for a Given Data Partition, 2014 IEEE 13th International Symposium on Parallel and Distributed Computing, pp.95-102, 2014.
DOI : 10.1109/ISPDC.2014.16

URL : https://hal.archives-ouvertes.fr/hal-01111537