S. Tomov, R. Nath, P. Du, and J. Dongarra, MAGMA version 0.2 User Guide, 2009.

C. Augonnet, S. Thibault, R. Namyst, and P. Wacrenier, StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures . Concurrency and Computation: Practice and Experience, Euro-Par, 2009.
URL : https://hal.archives-ouvertes.fr/inria-00384363

A. Buttari, J. Langou, J. Kurzak, and J. J. Dongarra, A class of parallel tiled linear algebra algorithms for multicore architectures, Parallel Computing, vol.35, issue.1, pp.38-53, 2009.
DOI : 10.1016/j.parco.2008.10.002

E. S. Quintana-ortí and R. A. Van-de-geijn, Updating an LU Factorization with Pivoting, ACM Transactions on Mathematical Software, vol.35, issue.2, p.11, 2008.
DOI : 10.1145/1377612.1377615

H. Topcuoglu, S. Hariri, and M. Wu, Performance-effective and low-complexity task scheduling for heterogeneous computing. Parallel and Distributed Systems, IEEE Transactions on, vol.13, issue.3, pp.260-274, 2002.

S. Tomov, R. Nath, H. Ltaief, and J. Dongarra, Dense linear algebra solvers for multicore with GPU accelerators, 2010 IEEE International Symposium on Parallel & Distributed Processing, Workshops and Phd Forum (IPDPSW)
DOI : 10.1109/IPDPSW.2010.5470941

S. Tomov, R. Nath, and J. Dongarra, Accelerating the reduction to upper Hessenberg, tridiagonal, and bidiagonal forms through hybrid GPU-based computing, Parallel Computing, vol.36, issue.12, 2010.
DOI : 10.1016/j.parco.2010.06.001

G. Bosilca, A. Bouteiller, M. Danalis, H. Faverge, T. Haidar et al., Distributed-Memory Task Execution and Dependence Tracking within DAGuE and the DPLASMA Project

B. Hadri, H. Ltaief, E. Agullo, and J. Dongarra, Tile QR factorization with parallel panel processing for multicore architectures, 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS), 2010.
DOI : 10.1109/IPDPS.2010.5470443

URL : https://hal.archives-ouvertes.fr/inria-00548899