H. Werner, S. Erich, D. Jack, S. Horst, and D. , The TOP500: History, Trends, and Future Directions in High Performance Computing

A. Eduard, C. Nawal, and . Duran-alejandro, The design of OpenMP tasks, IEEE Transactions on Parallel and Distributed Systems, vol.20, issue.3, pp.404-418, 2009.

A. Duran-alejandro, B. Eduard, and M. Rosa, OmpSs: a proposal for programming heterogeneous multi-core architectures, Par. Proc. Letters, vol.21, issue.02, 2011.

B. George, B. Aurelien, D. Anthony, F. Mathieu, H. Thomas et al., Exploiting heterogeneity to enhance scalability, Computing in Science & Engineering, vol.15, issue.6, pp.36-45, 2013.

A. Cédric, T. Samuel, N. Raymond, and W. Pierre-andré, StarPU: a unified platform for task scheduling on heterogeneous multicore architectures, Conc. and Comp.: Pract. and Exp, vol.23, issue.2, 2011.

V. Pillet, J. Labarta, T. Cortes, and S. Girona, PARAVER: A Tool to Visualize and Analyze Parallel Code, Proceedings of WoTUG-18: Transputer and occam Developments, pp.17-31, 1995.

E. Isaacs-katherine, B. Peer-timo, and J. Ilir, Combing the Communication Hairball: Visualizing Parallel Execution Traces using Logical Time, IEEE Transactions on Visualization and Computer Graphics, vol.20, issue.12, pp.2349-2358, 2014.
DOI : 10.1109/TVCG.2014.2346456

K. Andreas, B. Holger, and . Jens, The Vampir performance analysis tool-set, pp.139-155, 2008.

S. Eric, D. Dan, D. Thomas, and D. Carsten, A Multi-Language Computing Environment for Literate Programming and Reproducible Research, J. of Stat. Soft, vol.46, issue.3, 2012.

E. Agullo, G. Bosilca, and B. Bramas, Poster: Matrices over Runtime Systems at Exascale, 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, pp.1332-1332, 2012.
DOI : 10.1109/SC.Companion.2012.168

A. Cédric, A. Olivier, F. Nathalie, N. Raymond, and T. Samuel, StarPU-MPI: Task Programming over Clusters of Machines Enhanced with Accelerators, In: EuroMPI', vol.12, pp.298-299, 2012.

A. Emmanuel, D. Jim, and D. Jack, Numerical linear algebra on emerging architectures: The PLASMA and MAGMA projects, Journal of Physics: Conference Series, vol.180, issue.1, 2009.

B. George, B. Aurelien, D. Anthony, H. Thomas, L. Pierre et al., DAGuE: A generic distributed {DAG} engine for High Performance Computing, Extensions for Next-Generation Parallel Programming Models, pp.37-51, 2012.

O. Satoshi, K. Satoshi, N. Kengo, T. Samuel, and N. Raymond, Implementation of FEM Application on GPU with StarPU, In: SIAM, 2013.

M. Víctor, M. David, and D. Fabrice, Towards seismic wave modeling on heterogeneous many-core architectures using task-based runtime system, 2015.

E. Agullo, L. Giraud, A. Guermouche, S. Nakov, and J. Roman, Task-Based Conjugate Gradient: From Multi-GPU Towards Heterogeneous Architectures, Euro-Par 2016: Parallel Processing Workshops: Euro-Par 2016 International Workshops, pp.69-82, 2016.
DOI : 10.1137/1.9780898718003

URL : https://hal.archives-ouvertes.fr/hal-01334734

X. Lacoste, M. Faverge, G. Bosilca, P. Ramet, and S. Thibault, Taking Advantage of Hybrid Systems for Sparse Direct Solvers via Task-Based Runtimes, 2014 IEEE International Parallel & Distributed Processing Symposium Workshops, pp.29-38, 2014.
DOI : 10.1109/IPDPSW.2014.9

URL : https://hal.archives-ouvertes.fr/hal-00925017

C. Jean, M. Couteyen, R. Jean, and B. Pierre, Design and analysis of a task-based parallelization over a runtime system of an explicit finite-volume CFD code with adaptive time stepping, Journal of Computational Science, 2017.

C. Michalis, C. Theodoros, M. Julián, A. Damian, and M. Hendrik, Earth system modelling on system-level heterogeneous architectures: EMAC (version 2.42) on the Dynamical Exascale Entry Platform (DEEP) Geoscientific Model Development, pp.3483-3491, 2016.

R. L. Graham, Bounds for Certain Multiprocessing Anomalies, Bell System Technical Journal, vol.45, issue.9, pp.1563-1581, 1966.
DOI : 10.1002/j.1538-7305.1966.tb01709.x

H. Topcuoglu, S. Hariri, and . Wu-min-you, Performance-effective and low-complexity task scheduling for heterogeneous computing, IEEE Transactions on Parallel and Distributed Systems, vol.13, issue.3, pp.260-274, 2002.
DOI : 10.1109/71.993206

B. Robert, D. , L. Charles, and E. , Scheduling Multithreaded Computations by Work Stealing, J. ACM, vol.46, issue.5, pp.720-748, 1999.

L. S. Blackford, J. Choi, and A. Cleary, ScaLAPACK user's guide, Society for Industrial and Applied Mathematics, 1997.

W. Gropp, T. Hoefler, R. Thakur, and E. Lusk, Using Advanced MPI: Modern Features of the Message-Passing Interface Computer science & intelligent systemsMIT Press, 2014.

W. James and M. , Gantt charts: A centenary appreciation, European Journal of Operational Research, vol.149, issue.2, pp.430-437, 2003.

F. Schnorr-lucas-mello, T. Mathieu, . François, and . Oliveira-stein-benhur, Kergommeaux Jacques Chassin. The Paje trace file format, 2016.

D. Robert, W. Frank, W. Thomas, S. Jonas, H. Robert et al., A Case Study: Holistic Performance Analysis on Heterogeneous Architectures using the Vampir Toolchain, pp.793-802, 2013.

B. Ronny, H. Michael, B. Holger, H. Tobias, and W. Matthias, Edge bundling for visualizing communication behavior, pp.1-8

H. Blake, R. Stephen, K. Jakub, S. Chad, A. et al., Visualizing Execution Traces with Task Dependencies, pp.1-2

T. Huynh-an, P. Douglas, T. Miquel, and . Kenjiro, DAGViz, Proceedings of the 2nd Workshop on Visual Performance Analysis, VPA '15, pp.1-3
DOI : 10.1145/1594835.1504210

B. Keller-rainer, G. Steffen, N. José, and . Christoph, Temanejo: Debugging of Thread-Based Task-Parallel Programs in StarSS, Tools for High Performance Computing 2011: Proceedings of the 5th International Workshop on Parallel Tools for High Performance Computing, pp.131-137, 2011.
DOI : 10.1007/978-3-642-31476-6_11

W. Florian, E. Naser, D. Michel, and R. , A Declarative Framework for Stateful Analysis of Execution Traces, Software Quality Journal, vol.25, issue.1, pp.201-229, 2017.

A. Emmanuel, B. Olivier, and E. Lionel, Bridging the Gap Between Performance and Bounds of Cholesky Factorization on Heterogeneous Platforms, Computer Society, vol.15, pp.34-45, 2015.

S. Lucas, M. , and L. Arnaud, Visualizing More Performance Data Than What Fits on Your Screen, pp.149-162, 2013.

R. Team, R: A Language and Environment for Statistical Computing. R Foundation for Statistical ComputingVienna, 2017.

W. Hadley, ggplot2: Elegant Graphics for Data Analysis, 2009.

W. Hadley, tidyverse: Easily Install and Load 'Tidyverse

S. Carson, P. Chris, and H. Toby, plotly: Create Interactive Web Graphics via 'plotly.js'2016

A. Emmanuel, B. Alfredo, G. Abdou, and L. Florent, Implementing Multifrontal Sparse Solvers for Multicore Architectures with Sequential Task Flow Runtime Systems, ACM Trans. Math. Softw, vol.43, issue.2, pp.13-14, 2016.

D. Vincent and N. Raymond, Wacrenier Pierre-André. An Efficient Multi-level Trace Toolkit for Multi-threaded Applications, Euro- Par'05, pp.166-175, 2005.

W. Hadley, feather: R Bindings to the Feather 'API'2016. R package version 0

W. Leland, The grammar of graphics, pp.2012-375

S. Garcia-pinto-vinicius, L. Luka, . Arnaud, T. Mello-schnorr-lucas, D. Samuel et al., Analyzing Dynamic Task-Based Applications on Hybrid Platforms: An Agile Scripting Approach, In, vol.2016

G. Todd, L. Matthew, C. Michael, and R. , The Spack Package Manager: Bringing Order to HPC Software Chaos, In: SC ', vol.1540, pp.1-40

L. João, V. F. Gautier-thierry, D. Vincent, R. Bruno, and M. Nicolas, Design and analysis of scheduling strategies for multi-CPU and multi-GPU architectures, Parallel Computing, vol.44, pp.37-52, 2015.

. V. How, L. Garcia-pinto, L. Mello-schnorr, A. Stanisic, S. Legrand et al., A Visual Performance Analysis Framework for Task-based Parallel Applications running on Hybrid Clusters, Concurrency and Computation: Practice and Experience, 2017.