E. Agullo, J. Dongarra, B. Hadri, J. Kurzak, J. Langou et al., PLASMA Users Guide, 2009.

E. Agullo, B. Hadri, H. Ltaief, and J. Dongarra, Comparative study of one-sided factorizations with multiple software packages on multi-core hardware, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, SC '09, 2009.
DOI : 10.1145/1654059.1654080

A. Buttari, J. Langou, J. Kurzak, and J. Dongarra, Parallel tiled QR factorization for multicore architectures. Concurrency and Computation: Practice and Experience, pp.1573-1590, 2008.

A. Buttari, J. Langou, J. Kurzak, and J. Dongarra, A class of parallel tiled linear algebra algorithms for multicore architectures, Parallel Computing, vol.35, issue.1, pp.38-53, 2009.
DOI : 10.1016/j.parco.2008.10.002

J. W. Demmel, L. Grigori, M. F. Hoemmen, and J. Langou, Communication-optimal Parallel and Sequential QR and LU Factorizations, SIAM Journal on Scientific Computing, vol.34, issue.1, 2008.
DOI : 10.1137/080731992

URL : https://hal.archives-ouvertes.fr/hal-00870930

R. W. Freund and M. Malhotra, A block QMR algorithm for non-Hermitian linear systems with multiple right-hand sides, Linear Algebra and its Applications, vol.254, issue.1-3, pp.1-3119, 1997.
DOI : 10.1016/S0024-3795(96)00529-0

G. H. Golub and C. F. Van-loan, Matrix Computation. John Hopkins Studies in the Mathematical Sciences, 1996.

L. Grigori, J. W. Demmel, and H. Xiang, Communication Avoiding Gaussian elimination, 2008 SC, International Conference for High Performance Computing, Networking, Storage and Analysis, pp.1-12, 2008.
DOI : 10.1109/SC.2008.5214287

URL : https://hal.archives-ouvertes.fr/inria-00277901

B. Hadri, H. Ltaief, E. Agullo, and J. Dongarra, Tall and Skinny QR Matrix Factorization Using Tile Algorithms on Multicore Architectures, LAPACK Working Note, vol.222, 2009.

J. Kurzak and J. Dongarra, Fully Dynamic Scheduler for Numerical Computing on Multicore Processors, LAPACK Working Note, vol.220, 2009.

J. Kurzak and J. Dongarra, QR factorization for the, Cell Broadband Engine. Sci. Program, vol.17, issue.12, pp.31-42, 2009.

D. P. Leary, The block conjugate gradient algorithm and related methods, Linear Algebra and its Applications, vol.29, pp.293-322, 1980.
DOI : 10.1016/0024-3795(80)90247-5

A. Pothen and P. Raghavan, Distributed Orthogonal Factorization: Givens and Householder Algorithms, SIAM Journal on Scientific and Statistical Computing, vol.10, issue.6, pp.1113-1134, 1989.
DOI : 10.1137/0910067

R. Schreiber and C. Van-loan, A Storage-Efficient $WY$ Representation for Products of Householder Transformations, SIAM Journal on Scientific and Statistical Computing, vol.10, issue.1, pp.53-57, 1989.
DOI : 10.1137/0910005