The TOP500: History, Trends, and Future Directions in High Performance Computing ,
The design of OpenMP tasks, IEEE Transactions on Parallel and Distributed Systems, vol.20, issue.3, pp.404-418, 2009. ,
OmpSs: a proposal for programming heterogeneous multi-core architectures, Par. Proc. Letters, vol.21, issue.02, 2011. ,
Exploiting heterogeneity to enhance scalability, Computing in Science & Engineering, vol.15, issue.6, pp.36-45, 2013. ,
StarPU: a unified platform for task scheduling on heterogeneous multicore architectures, Conc. and Comp.: Pract. and Exp, vol.23, issue.2, 2011. ,
PARAVER: A Tool to Visualize and Analyze Parallel Code, Proceedings of WoTUG-18: Transputer and occam Developments, pp.17-31, 1995. ,
Combing the Communication Hairball: Visualizing Parallel Execution Traces using Logical Time, IEEE Transactions on Visualization and Computer Graphics, vol.20, issue.12, pp.2349-2358, 2014. ,
DOI : 10.1109/TVCG.2014.2346456
The Vampir performance analysis tool-set, pp.139-155, 2008. ,
A Multi-Language Computing Environment for Literate Programming and Reproducible Research, J. of Stat. Soft, vol.46, issue.3, 2012. ,
Poster: Matrices over Runtime Systems at Exascale, 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, pp.1332-1332, 2012. ,
DOI : 10.1109/SC.Companion.2012.168
StarPU-MPI: Task Programming over Clusters of Machines Enhanced with Accelerators, In: EuroMPI', vol.12, pp.298-299, 2012. ,
Numerical linear algebra on emerging architectures: The PLASMA and MAGMA projects, Journal of Physics: Conference Series, vol.180, issue.1, 2009. ,
DAGuE: A generic distributed {DAG} engine for High Performance Computing, Extensions for Next-Generation Parallel Programming Models, pp.37-51, 2012. ,
Implementation of FEM Application on GPU with StarPU, In: SIAM, 2013. ,
Towards seismic wave modeling on heterogeneous many-core architectures using task-based runtime system, 2015. ,
Task-Based Conjugate Gradient: From Multi-GPU Towards Heterogeneous Architectures, Euro-Par 2016: Parallel Processing Workshops: Euro-Par 2016 International Workshops, pp.69-82, 2016. ,
DOI : 10.1137/1.9780898718003
URL : https://hal.archives-ouvertes.fr/hal-01334734
Taking Advantage of Hybrid Systems for Sparse Direct Solvers via Task-Based Runtimes, 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, pp.29-38, 2014. ,
DOI : 10.1109/IPDPSW.2014.9
URL : https://hal.archives-ouvertes.fr/hal-00925017
Design and analysis of a task-based parallelization over a runtime system of an explicit finite-volume CFD code with adaptive time stepping, Journal of Computational Science, 2017. ,
Earth system modelling on system-level heterogeneous architectures: EMAC (version 2.42) on the Dynamical Exascale Entry Platform (DEEP) Geoscientific Model Development, pp.3483-3491, 2016. ,
Bounds for Certain Multiprocessing Anomalies, Bell System Technical Journal, vol.45, issue.9, pp.1563-1581, 1966. ,
DOI : 10.1002/j.1538-7305.1966.tb01709.x
Performance-effective and low-complexity task scheduling for heterogeneous computing, IEEE Transactions on Parallel and Distributed Systems, vol.13, issue.3, pp.260-274, 2002. ,
DOI : 10.1109/71.993206
Scheduling Multithreaded Computations by Work Stealing, J. ACM, vol.46, issue.5, pp.720-748, 1999. ,
ScaLAPACK user's guide, Society for Industrial and Applied Mathematics, 1997. ,
Using Advanced MPI: Modern Features of the Message-Passing Interface Computer science & intelligent systemsMIT Press, 2014. ,
Gantt charts: A centenary appreciation, European Journal of Operational Research, vol.149, issue.2, pp.430-437, 2003. ,
Kergommeaux Jacques Chassin. The Paje trace file format, 2016. ,
A Case Study: Holistic Performance Analysis on Heterogeneous Architectures using the Vampir Toolchain, pp.793-802, 2013. ,
Edge bundling for visualizing communication behavior, pp.1-8 ,
Visualizing Execution Traces with Task Dependencies, pp.1-2 ,
DAGViz, Proceedings of the 2nd Workshop on Visual Performance Analysis, VPA '15, pp.1-3 ,
DOI : 10.1145/1594835.1504210
Temanejo: Debugging of Thread-Based Task-Parallel Programs in StarSS, Tools for High Performance Computing 2011: Proceedings of the 5th International Workshop on Parallel Tools for High Performance Computing, pp.131-137, 2011. ,
DOI : 10.1007/978-3-642-31476-6_11
A Declarative Framework for Stateful Analysis of Execution Traces, Software Quality Journal, vol.25, issue.1, pp.201-229, 2017. ,
Bridging the Gap Between Performance and Bounds of Cholesky Factorization on Heterogeneous Platforms, Computer Society, vol.15, pp.34-45, 2015. ,
Visualizing More Performance Data Than What Fits on Your Screen, pp.149-162, 2013. ,
R: A Language and Environment for Statistical Computing. R Foundation for Statistical ComputingVienna, 2017. ,
ggplot2: Elegant Graphics for Data Analysis, 2009. ,
tidyverse: Easily Install and Load 'Tidyverse ,
plotly: Create Interactive Web Graphics via 'plotly.js'2016 ,
Implementing Multifrontal Sparse Solvers for Multicore Architectures with Sequential Task Flow Runtime Systems, ACM Trans. Math. Softw, vol.43, issue.2, pp.13-14, 2016. ,
Wacrenier Pierre-André. An Efficient Multi-level Trace Toolkit for Multi-threaded Applications, Euro- Par'05, pp.166-175, 2005. ,
feather: R Bindings to the Feather 'API'2016. R package version 0 ,
The grammar of graphics, pp.2012-375 ,
Analyzing Dynamic Task-Based Applications on Hybrid Platforms: An Agile Scripting Approach, In, vol.2016 ,
The Spack Package Manager: Bringing Order to HPC Software Chaos, In: SC ', vol.1540, pp.1-40 ,
Design and analysis of scheduling strategies for multi-CPU and multi-GPU architectures, Parallel Computing, vol.44, pp.37-52, 2015. ,
A Visual Performance Analysis Framework for Task-based Parallel Applications running on Hybrid Clusters, Concurrency and Computation: Practice and Experience, 2017. ,