E. Agullo, O. Beaumont, L. Eyraud-dubois, J. Herrmann, S. Kumar et al., Bridging the gap between performance and bounds of Cholesky factorization on heterogeneous platforms, p.15, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01120507

E. Agullo, O. Beaumont, L. Eyraud-dubois, and S. Kumar, Are static schedules so bad? a case study on cholesky factorization, 2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp.1021-1030, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01223573

E. Agullo, B. Hadri, H. Ltaief, and J. Dongarra, Comparative study of one-sided factorizations with multiple software packages on multi-core hardware, SC'09. ACM/IEEE Conference on Supercomputing, 2009.

C. Augonnet, S. Thibault, R. Namyst, and P. Wacrenier, StarPU: a unified platform for task scheduling on heterogeneous multicore architectures. Concurrency and Computation: Practice and Experience, Special Issue: Euro-Par, pp.187-198, 2011.
URL : https://hal.archives-ouvertes.fr/inria-00384363

R. M. Badia, J. R. Herrero, J. Labarta, J. M. Pérez, E. S. Quintana-ortí et al., Parallelizing dense and banded linear algebra libraries using SMPSs, Concurrency and Computation: Practice and Experience, vol.21, pp.2438-2456, 2009.

O. Beaumont, L. Canon, L. Eyraud-dubois, G. Lucarelli, L. Marchal et al., Scheduling on two types of resources: a survey, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02432381

G. Bosilca, A. Bouteiller, A. Danalis, T. Herault, P. Lemarinier et al., DAGuE: A generic distributed dag engine for high performance computing, Parallel Computing, vol.38, issue.1-2, pp.37-51, 2012.

E. Chan, F. G. Van-zee, P. Bientinesi, E. S. Quintana-ortí, G. Quintana-ortí et al., Supermatrix: a multithreaded runtime scheduling system for algorithms-by-blocks, PPoPP '08, pp.123-132, 2008.

T. Cojean, A. Guermouche, A. Hugo, R. Namyst, and P. Wacrenier, Resource aggregation for task-based cholesky factorization on top of modern architectures, Parallel Computing, vol.83, pp.73-92, 2019.
URL : https://hal.archives-ouvertes.fr/hal-01409965

M. Cosnard, M. Marrakchi, Y. Robert, and D. Trystram, Parallel gaussian elimination on an MIMD computer, Parallel Computing, vol.6, pp.275-296, 1988.

T. Gautier, X. Besseron, and L. Pigeon, KAAPI: A thread scheduling runtime system for data flow computations on cluster of multi-processors, PASCO'07, 2007.
URL : https://hal.archives-ouvertes.fr/hal-00684843

F. Gustavson, L. Karlsson, and B. Kågström, Distributed SBP Cholesky factorization algorithms with near-optimal scheduling, ACM T. Math. Software, vol.36, pp.1-25, 2009.

M. Jacquelin, Y. Zheng, E. Ng, Y. , and K. , An asynchronous task-based fan-both sparse cholesky solver, 2016.

K. Kim, S. Rajamanickam, G. Stelle, H. C. Edwards, and S. L. Olivier, Task parallel incomplete cholesky factorization using 2d partitioned-block layout, 2016.

J. Kurzak, A. Buttari, and J. Dongarra, Solving systems of linear equations on the CELL processor using Cholesky factorization, IEEE Trans. Parallel Distrib. Syst, vol.19, pp.1175-1186, 2008.
URL : https://hal.archives-ouvertes.fr/hal-02421046

J. Kurzak, H. Ltaief, J. Dongarra, and R. M. Badia, Scheduling dense linear algebra operations on multicore processors, Concurrency and Computation: Practice and Experience, vol.22, pp.15-44, 2010.

M. Marrakchi, R. , and Y. , Optimal algorithms for gaussian elimination on a MIMD computer, Parallel Computing, vol.12, pp.183-194, 1989.
URL : https://hal.archives-ouvertes.fr/hal-00857016

J. M. Pérez, R. M. Badia, and J. Labarta, A flexible and portable programming model for SMP and multi-cores, 2007.

E. S. Quintana-ortí, G. Quintana-ortí, R. A. Van-de-geijn, F. G. Van-zee, C. et al., Programming matrix algorithms-by-blocks for thread-level parallelism, vol.36

Y. Robert and D. Trystram, Optimal scheduling algorithms for parallel gaussian elimination, Theoretical Computer Science, vol.64, pp.159-173, 1989.
URL : https://hal.archives-ouvertes.fr/hal-00857009

F. Song, A. Yarkhan, and J. Dongarra, Dynamic task scheduling for linear algebra algorithms on distributed-memory multicore systems, p.9, 2009.

A. Yarkhan, J. Kurzak, and J. Dongarra, QUARK users' guide: Queueing and runtime for kernels, 2011.