Bridging the gap between performance and bounds of Cholesky factorization on heterogeneous platforms, p.15, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01120507
Are static schedules so bad? a case study on cholesky factorization, 2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp.1021-1030, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01223573
Comparative study of one-sided factorizations with multiple software packages on multi-core hardware, SC'09. ACM/IEEE Conference on Supercomputing, 2009. ,
StarPU: a unified platform for task scheduling on heterogeneous multicore architectures. Concurrency and Computation: Practice and Experience, Special Issue: Euro-Par, pp.187-198, 2011. ,
URL : https://hal.archives-ouvertes.fr/inria-00384363
Parallelizing dense and banded linear algebra libraries using SMPSs, Concurrency and Computation: Practice and Experience, vol.21, pp.2438-2456, 2009. ,
, Scheduling on two types of resources: a survey, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02432381
DAGuE: A generic distributed dag engine for high performance computing, Parallel Computing, vol.38, issue.1-2, pp.37-51, 2012. ,
Supermatrix: a multithreaded runtime scheduling system for algorithms-by-blocks, PPoPP '08, pp.123-132, 2008. ,
Resource aggregation for task-based cholesky factorization on top of modern architectures, Parallel Computing, vol.83, pp.73-92, 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-01409965
Parallel gaussian elimination on an MIMD computer, Parallel Computing, vol.6, pp.275-296, 1988. ,
KAAPI: A thread scheduling runtime system for data flow computations on cluster of multi-processors, PASCO'07, 2007. ,
URL : https://hal.archives-ouvertes.fr/hal-00684843
Distributed SBP Cholesky factorization algorithms with near-optimal scheduling, ACM T. Math. Software, vol.36, pp.1-25, 2009. ,
An asynchronous task-based fan-both sparse cholesky solver, 2016. ,
, Task parallel incomplete cholesky factorization using 2d partitioned-block layout, 2016.
Solving systems of linear equations on the CELL processor using Cholesky factorization, IEEE Trans. Parallel Distrib. Syst, vol.19, pp.1175-1186, 2008. ,
URL : https://hal.archives-ouvertes.fr/hal-02421046
Scheduling dense linear algebra operations on multicore processors, Concurrency and Computation: Practice and Experience, vol.22, pp.15-44, 2010. ,
Optimal algorithms for gaussian elimination on a MIMD computer, Parallel Computing, vol.12, pp.183-194, 1989. ,
URL : https://hal.archives-ouvertes.fr/hal-00857016
A flexible and portable programming model for SMP and multi-cores, 2007. ,
Programming matrix algorithms-by-blocks for thread-level parallelism, vol.36 ,
Optimal scheduling algorithms for parallel gaussian elimination, Theoretical Computer Science, vol.64, pp.159-173, 1989. ,
URL : https://hal.archives-ouvertes.fr/hal-00857009
Dynamic task scheduling for linear algebra algorithms on distributed-memory multicore systems, p.9, 2009. ,
QUARK users' guide: Queueing and runtime for kernels, 2011. ,