P. Brucker and S. Knust, Complexity results for scheduling problems

R. Bleuse, S. Kedad-sidhoum, F. Monna, G. Mounié, and D. Trystram, Scheduling independent tasks on multi-cores with GPU accelerators, Concurrency and Computation: Practice and Experience, pp.1625-1638, 2015.
DOI : 10.1002/cpe.3359

URL : https://hal.archives-ouvertes.fr/hal-01081625

C. Augonnet, S. Thibault, R. Namyst, and P. Wacrenier, StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures Concurrency and Computation: Practice and Experience, Special Issue: Euro- Par, pp.187-198, 2009.

J. Planas, R. M. Badia, E. Ayguadé, and J. Labarta, Hierarchical Task-Based Programming With StarSs, International Journal of High Performance Computing Applications, vol.23, issue.3, pp.284-299, 2009.
DOI : 10.1177/1094342009106195

A. Yarkhan, J. Kurzak, and J. Dongarra, Guide: QUeueing And Runtime for Kernels, 2011.

G. Bosilca, A. Bouteiller, A. Danalis, M. Faverge, T. Hérault et al., PaRSEC: A programming paradigm exploiting heterogeneity for enhancing scalability, Computing in Science and Engineering, vol.15, issue.6, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00930217

H. Topcuoglu, S. Hariri, and M. Wu, Performance-effective and low-complexity task scheduling for heterogeneous computing, IEEE Transactions on Parallel and Distributed Systems, vol.13, issue.3, pp.260-274, 2002.
DOI : 10.1109/71.993206

A. Buttari, J. Langou, J. Kurzak, and J. Dongarra, A class of parallel tiled linear algebra algorithms for multicore architectures, Parallel Computing, vol.35, issue.1, pp.38-53, 2009.
DOI : 10.1016/j.parco.2008.10.002

F. G. Van-zee, E. Chan, R. A. Van-de-geijn, E. S. Quintana-orti, and G. Quintana-orti, The libflame Library for Dense Matrix Computations, Computing in Science and Engineering, vol.11, issue.6, pp.56-63, 2009.

E. Chan, F. G. Zee, P. Bientinesi, E. S. Quintana-ortí, G. Quintana-ortí et al., SuperMatrix, Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming , PPoPP '08, pp.123-132, 2008.
DOI : 10.1145/1345206.1345227

]. E. Agullo, C. Augonnet, J. Dongarra, H. Ltaief, R. Namyst et al., Faster, Cheaper, Better ? a Hybridization Methodology to Develop Linear Algebra Software for GPUs, GPU Computing Gems, 2010.
URL : https://hal.archives-ouvertes.fr/inria-00547847

G. Bosilca, A. Bouteiller, A. Danalis, M. Faverge, H. Haidar et al., Distributed- Memory Task Execution and Dependence Tracking within DAGuE and the DPLASMA Project, 2010.

H. Casanova, A. Legrand, and M. Quinson, SimGrid: A Generic Framework for Large-Scale Distributed Experiments, Tenth International Conference on Computer Modeling and Simulation (uksim 2008), 2008.
DOI : 10.1109/UKSIM.2008.28

URL : https://hal.archives-ouvertes.fr/inria-00260697

R. L. Graham, Bounds on Multiprocessing Timing Anomalies, SIAM Journal on Applied Mathematics, vol.17, issue.2, pp.416-429, 1969.
DOI : 10.1137/0117039

H. Bouwmeester and J. Langou, A critical path approach to analyzing parallelism of algorithmic variants. application to cholesky inversion, 1010.

H. M. Bouwmeester, Tiled algorithms for matrix computations on multicore architectures

R. D. Blumofe and C. E. Leiserson, Scheduling multithreaded computations by work stealing, Journal of the ACM, vol.46, issue.5, pp.720-748, 1999.
DOI : 10.1145/324133.324234

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.113.7695

P. Baptiste, C. L. Pape, and W. Nuijten, Constraint-based scheduling: applying constraint programming to scheduling problems, 2012.
DOI : 10.1007/978-1-4615-1479-4

URL : https://hal.archives-ouvertes.fr/inria-00123562

A. Z. Shahul and O. Sinnen, Scheduling task graphs optimally with a*, The Journal of Supercomputing, vol.51, issue.3, pp.310-332, 2010.

E. Agullo, O. Beaumont, L. Eyraud-dubois, J. Herrmann, S. Kumar et al., Bridging the Gap between Performance and Bounds of Cholesky Factorization on Heterogeneous Platforms, 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015.
DOI : 10.1109/IPDPSW.2015.35

URL : https://hal.archives-ouvertes.fr/hal-01120507

E. Agullo, B. Bramas, O. Coulaud, E. Darve, M. Messner et al., Task-Based FMM for Multicore Architectures, SIAM Journal on Scientific Computing, vol.36, issue.1, 2014.
DOI : 10.1137/130915662

URL : https://hal.archives-ouvertes.fr/hal-00807368

L. Jaulmes, E. Ayguadé, M. Casas, J. Labarta, M. Moretó et al., Exploiting asynchrony from exact forward recovery for DUE in iterative solvers, Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis on, SC '15, pp.531-5312, 2015.
DOI : 10.1145/2807591.2807599

L. Stanisic, S. Thibault, A. Legrand, B. Videau, and J. Méhaut, Modeling and Simulation of a Dynamic Task-Based Runtime System for Heterogeneous Multi-core Architectures, Euro-Par 2014, pp.50-62, 2014.
DOI : 10.1007/978-3-319-09873-9_5

URL : https://hal.archives-ouvertes.fr/hal-01011633