Numerical Methods in Matrix Computations, 2015. ,
More Definite Results from the PluTo Scheduling Algorithm, 1st International Workshop on Polyhedral Compilation Techniques (IMPACT, 2011. ,
Optimization of Triangular and Banded Matrix Operations Using 2d-Packed Layouts, ACM Trans. Archit. Code Optim, vol.14, p.55, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01633724
Efficient Sparse Matrix-Vector Multiplication on CUDA, 2008. ,
Automatic Transformations for Communication-Minimized Parallelization and Locality Optimization in the Polyhedral Model, International Conference on Compiler Construction, 2008. ,
A Practical Automatic Polyhedral Program Optimization System, ACM SIGPLAN Conference on Programming Language Design and Implementation (PLDI), 2008. ,
Challenges and Advances in Parallel Sparse Matrix-Matrix Multiplication, Proceedings of the 2008 37th International Conference on Parallel Processing (ICPP '08), pp.503-510, 2008. ,
A Class of Parallel Tiled Linear Algebra Algorithms for Multicore Architectures, Parallel Comput, vol.35, pp.38-53, 2009. ,
URL : https://hal.archives-ouvertes.fr/hal-02420965
Extendable Pattern-oriented Optimization Directives, ACM Trans. Archit. Code Optim, vol.9, p.14, 2012. ,
Layoutoblivious Compiler Optimization for Matrix Computations, ACM Trans. Archit. Code Optim, vol.9, p.35, 2013. ,
Highperformance Graph Algorithms from Parallel Sparse Matrices, Proceedings of the 8th International Conference on Applied Parallel Computing: State of the Art in Scientific Computing (PARA'07), pp.260-269, 2007. ,
Rectangular Full Packed Format for Cholesky's Algorithm: Factorization, Solution, and Inversion, ACM Trans. Math. Softw, vol.37, issue.18, 2010. ,
Optimization of Dense Matrix Multiplication on IBM Cyclops-64: Challenges and Experiences, pp.134-144, 2006. ,
Automatic Parallelization for a Class of Regular Computations, 1997. ,
Adaptive Multi-level Blocking Optimization for Sparse Matrix Vector Multiplication on GPU, Procedia Computer Science, vol.80, pp.131-142, 2016. ,
Polybench/c 4.1: The polyhedral benchmark suite, 2015. ,
, Iterative Methods for Sparse Linear Systems, 2003.
Integrating Data Layout Transformations with the Polyhedral Model, Proceedings of International Workshop on Polyhedral Compilation Techniques (IMPACT'19), 2019. ,
Benchmarking GPUs to Tune Dense Linear Algebra, Proceedings of the 2008 ACM/IEEE Conference on Supercomputing (SC '08), vol.11, p.31, 2008. ,