A. Aubum and V. Mclean, Efficient exploitation of concurrency using graph decomposition, 1990.

C. Augonnet, S. Thibault, R. Namyst, and P. Wacrenier, Starpu: A unified platform for task scheduling on heterogeneous multicore architectures. Concurrency and Computation: Practice and Experience, pp.187-198, 2011.
URL : https://hal.archives-ouvertes.fr/inria-00384363

D. Robert, . Blumofe, F. Christopher, . Joerg, C. Bradley et al., Cilk: An efficient multithreaded runtime system, 1995.

J. Bueno, L. Martinell, A. Duran, M. Farreras, X. Martorell et al., Productive Cluster Programming with OmpSs, Euro-Par 2011 Parallel Processing, pp.555-566, 2011.
DOI : 10.1147/rd.515.0593

L. Dagum and R. Menon, OpenMP: an industry standard API for shared-memory programming, IEEE Computational Science and Engineering, vol.5, issue.1, pp.46-55, 1998.
DOI : 10.1109/99.660313

J. V. , F. Lima, T. Gautier, N. Maillard, and V. Danjean, Exploiting Concurrent GPU Operations for Efficient Work Stealing on Multi-GPUs, 24rd International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), 2012.
URL : https://hal.archives-ouvertes.fr/hal-00735470

M. Frigo, E. Charles, . Leiserson, H. Keith, and . Randall, The implementation of the Cilk-5 multithreaded language, ACM SIGPLAN Notices, vol.33, issue.5, pp.212-223, 1998.
DOI : 10.1145/277652.277725

R. Michael, R. L. Garey, and . Graham, Bounds for multiprocessor scheduling with resource constraints, SIAM Journal on Computing, vol.4, issue.2, pp.187-200, 1975.

T. Gautier, X. Besseron, and L. Pigeon, KAAPI, Proceedings of the 2007 international workshop on Parallel symbolic computation, PASCO '07, pp.15-23, 2007.
DOI : 10.1145/1278177.1278182

URL : https://hal.archives-ouvertes.fr/hal-00647474

T. Gautier, V. Joao, N. Lima, B. Maillard, and . Raffin, XKaapi: A Runtime System for Data-Flow Task Programming on Heterogeneous Architectures, 2013 IEEE 27th International Symposium on Parallel and Distributed Processing, 2013.
DOI : 10.1109/IPDPS.2013.66

URL : https://hal.archives-ouvertes.fr/hal-00799904

R. L. Graham, Bounds for Certain Multiprocessing Anomalies, Bell System Technical Journal, vol.45, issue.9, pp.1563-1581, 1966.
DOI : 10.1002/j.1538-7305.1966.tb01709.x

R. L. Graham, Bounds on Multiprocessing Timing Anomalies, SIAM Journal on Applied Mathematics, vol.17, issue.2, pp.416-429, 1969.
DOI : 10.1137/0117039

J. J. Hwang, Y. C. Chow, F. D. Anger, and C. Y. Lee, Scheduling Precedence Graphs in Systems with Interprocessor Communication Times, SIAM Journal on Computing, vol.18, issue.2, pp.244-257, 1989.
DOI : 10.1137/0218016

A. Khan, C. L. Mccreary, and M. Jones, A Comparison of Multiprocessor Scheduling Heuristics, 1994 International Conference on Parallel Processing (ICPP'94), pp.243-250, 1994.
DOI : 10.1109/ICPP.1994.19

R. Lepère and D. Trystram, A new clustering algorithm for large communication delays, Proceedings 16th International Parallel and Distributed Processing Symposium, pp.68-73, 2002.
DOI : 10.1109/IPDPS.2002.1015571

E. Lusk, . Huss, M. Saphir, and . Snir, Mpi: A message-passing interface standard, 2009.

C. Mccreary, . Thompson, . Gill, Y. Smith, and . Zhu, Partitioning and scheduling using graph decomposition. Department of Computer Science and Engineering CSE-93-06, 1993.

C. Mccreary and A. Reed, A graph parsing algorithm and implementation, Dept. of Comp. Sci and Eng, 1993.

E. John, D. Stone, G. Gohara, and . Shi, Opencl: A parallel programming standard for heterogeneous computing systems, Computing in science & engineering, vol.12, issue.3, p.66, 2010.

H. Topcuoglu, S. Hariri, and M. Wu, Performance-effective and low-complexity task scheduling for heterogeneous computing. Parallel and Distributed Systems, IEEE Transactions on, vol.13, issue.3, pp.260-274, 2002.

J. D. Ullman, NP-complete scheduling problems, Journal of Computer and System Sciences, vol.10, issue.3, pp.384-393, 1975.
DOI : 10.1016/S0022-0000(75)80008-0

URL : http://doi.org/10.1016/s0022-0000(75)80008-0

T. Yang and A. Gerasoulis, Dsc: Scheduling parallel tasks on an unbounded number of processors. Parallel and Distributed Systems, IEEE Transactions on, vol.5, issue.9, pp.951-967, 1994.