Geostatistical modeling and prediction using mixed precision tile cholesky factorization, HIPC, pp.152-162, 2019. ,
Achieving high performance on supercomputers with a sequential task-based programming model, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01618526
Bridging the gap between performance and bounds of cholesky factorization on heterogeneous platforms, IPDPSW, pp.34-45, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01120507
Are static schedules so bad? a case study on cholesky factorization, IPDPS, pp.1021-1030, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01223573
Comparative study of one-sided factorizations with multiple software packages on multi-core hardware, SC'09. ACM/IEEE Conference on Supercomputing, 2009. ,
An o(n log n) fast direct solver for partial hierarchically semiseparable matrices, Journal of Scientific Computing, vol.57, pp.477-501, 2013. ,
Improving multifrontal methods by means of block low-rank representations, SIAM Journal on Scientific Computing, vol.37, pp.1451-1474, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-00776859
A fast block low-rank dense solver with applications to finite-element matrices, Journal of Computational Physics, vol.304, pp.170-188, 2016. ,
A first course in order statistics, 2008. ,
A hierarchical fast direct solver for distributed memory machines with manycore nodes, Research report, 2019. ,
URL : https://hal.archives-ouvertes.fr/cea-02304706
StarPU: a unified platform for task scheduling on heterogeneous multicore architectures. Concurrency and Computation: Practice and Experience, Special Issue: Euro-Par, pp.187-198, 2011. ,
URL : https://hal.archives-ouvertes.fr/inria-00384363
Parallelizing dense and banded linear algebra libraries using SMPSs. Concurrency and Computation: Practice and Experience, vol.21, pp.2438-2456, 2009. ,
Recent advances in matrix partitioning for parallel computing on heterogeneous platforms, IEEE Transactions on Parallel and Distributed Systems, vol.30, pp.218-229, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01670672
2D Static Resource Allocation strategies for load balancing in for Compressed Linear Algebra and Communication Constraints, 2020. ,
Binary decision diagrams for bin packing with minimum color fragmentation, CPAIOR, pp.57-66, 2019. ,
Data distribution strategies for cholesky decomposition, Compas, 2019. ,
, ScaLAPACK Users' Guide. Society for Industrial and Applied Mathematics, 1997.
Flexible development of dense linear algebra algorithms on massively parallel architectures with dplasma, IPDPSW, pp.1432-1441, 2011. ,
DAGuE: A generic distributed dag engine for high performance computing, Parallel Computing, vol.38, issue.1-2, pp.37-51, 2012. ,
A class of parallel tiled linear algebra algorithms for multicore architectures, Parallel Computing, vol.35, pp.38-53, 2009. ,
URL : https://hal.archives-ouvertes.fr/hal-02420965
Performance analysis of tile low-rank Cholesky factorization using parsec instrumentation tools, ProTools, 2019. ,
Tiled algorithms for efficient task-parallel h-matrix solvers, PDSEC, 2020. ,
Simgrid: A generic framework for large-scale distributed experiments, Tenth International Conference on Computer Modeling and Simulation, pp.126-131, 2008. ,
URL : https://hal.archives-ouvertes.fr/inria-00260697
Supermatrix: a multithreaded runtime scheduling system for algorithms-byblocks, PPoPP '08, pp.123-132, 2008. ,
Design and implementation of the scalapack lu, qr, and cholesky factorization routines, Sci. Program, vol.5, pp.173-184, 1996. ,
Probabilistic analysis of the lpt processor scheduling heuristic, Deterministic and stochastic scheduling, pp.319-331, 1982. ,
Resource aggregation for task-based cholesky factorization on top of modern architectures, Parallel Computing, vol.83, pp.73-92, 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-01957086
Class constrained bin packing revisited, Theoretical Computer Science, 2010. ,
On the use of h-matrix arithmetic in pastix: a preliminary study, Workshop on Fast Solvers, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01187882
, Matrix computations, vol.3, 2012.
Construction and arithmetics of h-matrices, Computing, vol.70, pp.295-334, 2003. ,
On the complexity of the generalized block distribution, ternational Workshop on Parallel Algorithms for Irregularly Structured Problems, pp.319-326, 1996. ,
Flame: Formal linear algebra methods environment, ACM Trans. Math. Softw, vol.27, pp.422-455, 2001. ,
A sparse matrix arithmetic based on h-matrices, Computing, vol.62, pp.89-108, 1999. ,
Lattice h-matrices on distributed-memory systems, 2018 IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp.389-398, 2018. ,
Communication lower bounds for distributed-memory matrix multiplication, J. Parallel Distrib. Comput, vol.64, pp.1017-1026, 2004. ,
An asynchronous task-based fan-both sparse cholesky solver, 2016. ,
Approximation algorithms for scheduling with class constraints, 2019. ,
Heterogeneous distribution of computations solving linear algebra problems on networks of heterogeneous computers, Journal of Parallel and Distributed Computing, vol.61, pp.520-535, 2001. ,
, Task parallel incomplete cholesky factorization using 2d partitioned-block layout, 2016.
Scheduling dense linear algebra operations on multicore processors, Concurrency and Computation: Practice and Experience, vol.22, pp.15-44, 2010. ,
Block Low-Rank multifrontal solvers: complexity, performance, and scalability, 2017. ,
URL : https://hal.archives-ouvertes.fr/tel-01929478
A flexible and portable programming model for SMP and multi-cores, 2007. ,
Sparse supernodal solver using block low-rank compression: Design, performance and analysis, Journal of computational science, vol.27, pp.255-270, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01660665
Programming matrix algorithms-by-blocks for thread-level parallelism, vol.36 ,
Communication-optimal parallel 2.5D matrix multiplication and LU factorization algorithms, EuroPar, 2011. ,
Dynamic task scheduling for linear algebra algorithms on distributed-memory multicore systems, p.9, 2009. ,
Distributed-memory lattice h-matrix factorization, The International Journal of High Performance Computing Applications, vol.33, pp.1046-1063, 2019. ,
, Heuristics for symmetric rectilinear matrix partitioning, 2019.