S. Abdulah, H. Ltaief, Y. Sun, M. G. Genton, and D. E. Keyes, Geostatistical modeling and prediction using mixed precision tile cholesky factorization, HIPC, pp.152-162, 2019.

E. Agullo, O. Aumage, M. Faverge, N. Furmento, F. Pruvost et al., Achieving high performance on supercomputers with a sequential task-based programming model, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01618526

E. Agullo, O. Beaumont, L. Eyraud-dubois, J. Herrmann, S. Kumar et al., Bridging the gap between performance and bounds of cholesky factorization on heterogeneous platforms, IPDPSW, pp.34-45, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01120507

E. Agullo, O. Beaumont, L. Eyraud-dubois, and S. Kumar, Are static schedules so bad? a case study on cholesky factorization, IPDPS, pp.1021-1030, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01223573

E. Agullo, B. Hadri, H. Ltaief, and J. Dongarra, Comparative study of one-sided factorizations with multiple software packages on multi-core hardware, SC'09. ACM/IEEE Conference on Supercomputing, 2009.

S. Ambikasaran and E. Darve, An o(n log n) fast direct solver for partial hierarchically semiseparable matrices, Journal of Scientific Computing, vol.57, pp.477-501, 2013.

P. Amestoy, C. Ashcraft, O. Boiteau, A. Buttari, J. L'excellent et al., Improving multifrontal methods by means of block low-rank representations, SIAM Journal on Scientific Computing, vol.37, pp.1451-1474, 2015.
URL : https://hal.archives-ouvertes.fr/hal-00776859

A. Aminfar, S. Ambikasaran, and E. Darve, A fast block low-rank dense solver with applications to finite-element matrices, Journal of Computational Physics, vol.304, pp.170-188, 2016.

B. C. Arnold, N. Balakrishnan, and H. N. Nagaraja, A first course in order statistics, 2008.

C. Augonnet, D. Goudin, M. Kuhn, X. Lacoste, R. Namyst et al., A hierarchical fast direct solver for distributed memory machines with manycore nodes, Research report, 2019.
URL : https://hal.archives-ouvertes.fr/cea-02304706

C. Augonnet, S. Thibault, R. Namyst, and P. Wacrenier, StarPU: a unified platform for task scheduling on heterogeneous multicore architectures. Concurrency and Computation: Practice and Experience, Special Issue: Euro-Par, pp.187-198, 2011.
URL : https://hal.archives-ouvertes.fr/inria-00384363

R. M. Badia, J. R. Herrero, J. Labarta, J. M. Pérez, E. S. Quintana-ortí et al., Parallelizing dense and banded linear algebra libraries using SMPSs. Concurrency and Computation: Practice and Experience, vol.21, pp.2438-2456, 2009.

O. Beaumont, B. A. Becker, A. Deflumere, L. Eyraud-dubois, T. Lambert et al., Recent advances in matrix partitioning for parallel computing on heterogeneous platforms, IEEE Transactions on Parallel and Distributed Systems, vol.30, pp.218-229, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01670672

O. Beaumont, E. Lionel, and M. Verite, 2D Static Resource Allocation strategies for load balancing in for Compressed Linear Algebra and Communication Constraints, 2020.

D. Bergman, C. Cardonha, and S. Mehrani, Binary decision diagrams for bin packing with minimum color fragmentation, CPAIOR, pp.57-66, 2019.

P. Beziau, Data distribution strategies for cholesky decomposition, Compas, 2019.

L. S. Blackford, J. Choi, A. Cleary, E. D'azevedo, J. Demmel et al., ScaLAPACK Users' Guide. Society for Industrial and Applied Mathematics, 1997.

G. Bosilca, A. Bouteiller, A. Danalis, M. Faverge, A. Haidar et al., Flexible development of dense linear algebra algorithms on massively parallel architectures with dplasma, IPDPSW, pp.1432-1441, 2011.

G. Bosilca, A. Bouteiller, A. Danalis, T. Herault, P. Lemarinier et al., DAGuE: A generic distributed dag engine for high performance computing, Parallel Computing, vol.38, issue.1-2, pp.37-51, 2012.

A. Buttari, J. Langou, J. Kurzak, and J. Dongarra, A class of parallel tiled linear algebra algorithms for multicore architectures, Parallel Computing, vol.35, pp.38-53, 2009.
URL : https://hal.archives-ouvertes.fr/hal-02420965

Q. Cao, Y. Pei, T. Herault, K. Akbudak, A. Mikhalev et al., Performance analysis of tile low-rank Cholesky factorization using parsec instrumentation tools, ProTools, 2019.

R. Carratalá-sáez, M. Faverge, G. Pichon, G. Sylvand, and E. Quintana-ortí, Tiled algorithms for efficient task-parallel h-matrix solvers, PDSEC, 2020.

H. Casanova, A. Legrand, and M. Quinson, Simgrid: A generic framework for large-scale distributed experiments, Tenth International Conference on Computer Modeling and Simulation, pp.126-131, 2008.
URL : https://hal.archives-ouvertes.fr/inria-00260697

E. Chan, F. G. Van-zee, P. Bientinesi, E. S. Quintana-ortí, G. Quintana-ortí et al., Supermatrix: a multithreaded runtime scheduling system for algorithms-byblocks, PPoPP '08, pp.123-132, 2008.

J. Choi, J. J. Dongarra, L. S. Ostrouchov, A. P. Petitet, D. W. Walker et al., Design and implementation of the scalapack lu, qr, and cholesky factorization routines, Sci. Program, vol.5, pp.173-184, 1996.

E. Coffman, G. Frederickson, and G. Lueker, Probabilistic analysis of the lpt processor scheduling heuristic, Deterministic and stochastic scheduling, pp.319-331, 1982.

T. Cojean, A. Guermouche, A. Hugo, R. Namyst, and P. Wacrenier, Resource aggregation for task-based cholesky factorization on top of modern architectures, Parallel Computing, vol.83, pp.73-92, 2019.
URL : https://hal.archives-ouvertes.fr/hal-01957086

L. Epstein, C. Imreh, L. , and A. , Class constrained bin packing revisited, Theoretical Computer Science, 2010.

M. Faverge, G. Pichon, P. Ramet, R. , and J. , On the use of h-matrix arithmetic in pastix: a preliminary study, Workshop on Fast Solvers, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01187882

G. H. Golub and C. F. Van-loan, Matrix computations, vol.3, 2012.

L. Grasedyck and W. Hackbusch, Construction and arithmetics of h-matrices, Computing, vol.70, pp.295-334, 2003.

M. Grigni and F. Manne, On the complexity of the generalized block distribution, ternational Workshop on Parallel Algorithms for Irregularly Structured Problems, pp.319-326, 1996.

J. A. Gunnels, F. G. Gustavson, G. M. Henry, and R. A. Van-de-geijn, Flame: Formal linear algebra methods environment, ACM Trans. Math. Softw, vol.27, pp.422-455, 2001.

W. Hackbusch, A sparse matrix arithmetic based on h-matrices, Computing, vol.62, pp.89-108, 1999.

A. Ida, Lattice h-matrices on distributed-memory systems, 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp.389-398, 2018.

D. Irony, S. Toledo, and A. Tiskin, Communication lower bounds for distributed-memory matrix multiplication, J. Parallel Distrib. Comput, vol.64, pp.1017-1026, 2004.

M. Jacquelin, Y. Zheng, E. Ng, Y. , and K. , An asynchronous task-based fan-both sparse cholesky solver, 2016.

K. Jansen, A. Lassota, and M. Maack, Approximation algorithms for scheduling with class constraints, 2019.

A. Kalinov and A. Lastovetsky, Heterogeneous distribution of computations solving linear algebra problems on networks of heterogeneous computers, Journal of Parallel and Distributed Computing, vol.61, pp.520-535, 2001.

K. Kim, S. Rajamanickam, G. Stelle, H. C. Edwards, and S. L. Olivier, Task parallel incomplete cholesky factorization using 2d partitioned-block layout, 2016.

J. Kurzak, H. Ltaief, J. Dongarra, and R. M. Badia, Scheduling dense linear algebra operations on multicore processors, Concurrency and Computation: Practice and Experience, vol.22, pp.15-44, 2010.

T. Mary, Block Low-Rank multifrontal solvers: complexity, performance, and scalability, 2017.
URL : https://hal.archives-ouvertes.fr/tel-01929478

J. M. Pérez, R. M. Badia, and J. Labarta, A flexible and portable programming model for SMP and multi-cores, 2007.

G. Pichon, E. Darve, M. Faverge, P. Ramet, R. et al., Sparse supernodal solver using block low-rank compression: Design, performance and analysis, Journal of computational science, vol.27, pp.255-270, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01660665

E. S. Quintana-ortí, G. Quintana-ortí, R. A. Van-de-geijn, F. G. Van-zee, C. et al., Programming matrix algorithms-by-blocks for thread-level parallelism, vol.36

E. Solomonik and J. Demmel, Communication-optimal parallel 2.5D matrix multiplication and LU factorization algorithms, EuroPar, 2011.

F. Song, A. Yarkhan, and J. Dongarra, Dynamic task scheduling for linear algebra algorithms on distributed-memory multicore systems, p.9, 2009.

I. Yamazaki, A. Ida, R. Yokota, and J. Dongarra, Distributed-memory lattice h-matrix factorization, The International Journal of High Performance Computing Applications, vol.33, pp.1046-1063, 2019.

A. Ya?ar and Ü. V. , Heuristics for symmetric rectilinear matrix partitioning, 2019.