C. Lawson, R. Hanson, D. Kincaid, and F. Krogh, Basic Linear Algebra Subprograms for Fortran Usage, ACM Transactions on Mathematical Software, vol.5, issue.3, pp.308-323, 1979.
DOI : 10.1145/355841.355847

J. G. Dumas, T. Gautier, and C. Pernet, Finite field linear algebra subroutines, Proceedings of the 2002 international symposium on Symbolic and algebraic computation , ISSAC '02
DOI : 10.1145/780506.780515

E. Elmroth, F. Gustavson, I. Jonsson, and B. Kagstrom, Recursive Blocked Algorithms and Hybrid Data Structures for Dense Matrix Library Software, SIAM Review, vol.46, issue.1, 2004.
DOI : 10.1137/S0036144503428693

R. A. Chowdhury and V. Ramachandran, The cache-oblivious gaussian elimination paradigm, Proceedings of the nineteenth annual ACM symposium on Parallel algorithms and architectures , SPAA '07, pp.71-80, 2007.
DOI : 10.1145/1248377.1248392

T. M. Low and A. Robert, API for Manipulating Matrices Stored by Blocks, 2004.

R. C. Whaley, A. Petitet, and J. J. Dongarra, Automated empirical optimizations of software and the ATLAS project, Parallel Computing, vol.27, issue.1-2, pp.3-35, 2001.
DOI : 10.1016/S0167-8191(00)00087-9

J. Demmel, J. Dongarra, V. Eijkhout, and E. Fuentes, Self-Adapting Linear Algebra Algorithms and Software, Proceedings of the IEEE 2005 special issue on " Program Generation, Optimization, and Adaptation
DOI : 10.1109/JPROC.2004.840848

K. Goto and R. Van-de-geijn, On reducing tlb misses in matrix multiplication, 2002.

R. Koenker and N. G. Pin, SparseM: A sparse matrix package for R, J. of Statistical Software, vol.8, issue.6, 2003.

N. J. Gu and K. Li, Optimization for BLAS on Loongson 2F architecture, p.38, 2008.