G. Bosilca, A. Bouteiller, A. Danalis, M. Faverge, T. Herault et al., PaRSEC: Exploiting Heterogeneity to Enhance Scalability, Computing in Science & Engineering, vol.15, issue.6, pp.36-45, 2013.

A. Yarkhan, Dynamic task execution on shared and distributed memory architectures, 2012.

C. Augonnet, S. Thibault, R. Namyst, and P. Wacrenier, StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures, Lecture Notes in Computer Science, vol.5704, pp.863-874, 2009.
URL : https://hal.archives-ouvertes.fr/inria-00384363

J. Planas, R. M. Badia, E. Ayguadé, and J. Labarta, Hierarchical Task-Based Programming With StarSs, The International Journal of High Performance Computing Applications, vol.23, issue.3, pp.284-299, 2009.

E. Agullo, B. Bramas, O. Coulaud, E. Darve, M. Messner et al., Task-Based FMM for Multicore Architectures, SIAM Journal on Scientific Computing, vol.36, issue.1, pp.C66-C93, 2014.
URL : https://hal.archives-ouvertes.fr/hal-00911856

A. Podobas, M. Brorsson, and K. Faxén, A comparative performance study of common and popular task-centric programming frameworks, Concurrency and Computation: Practice and Experience, vol.27, issue.1, pp.1-28, 2013.

D. Chasapis, M. Casas, M. Moretó, R. Vidal, E. Ayguadé et al., PARSECSs, ACM Transactions on Architecture and Code Optimization, vol.12, issue.4, pp.1-22, 2016.

A. Leist and A. Gilman, A comparative analysis of parallel programming models for c++, ICCGI, 2014.

H. Kaiser, T. Heller, B. Adelstein-lelbach, A. Serio, and D. Fey, HPX, Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models - PGAS '14, pp.1-11, 2014.

, Figure 8: Execution details for StarPU-Stencil on K40 or P100 configurations for a locality coefficient l = (2, 1).

C. Augonnet, J. Clet-ortega, S. Thibault, and R. Namyst, Data-Aware Task Scheduling on Multi-accelerator Based Platforms, 2010 IEEE 16th International Conference on Parallel and Distributed Systems, 2010.
URL : https://hal.archives-ouvertes.fr/inria-00523937

A. Alonazi, H. Ltaief, D. Keyes, I. Said, and S. Thibault, Asynchronous Task-Based Execution of the Reverse Time Migration for the Oil and Gas Industry, 2019 IEEE International Conference on Cluster Computing (CLUSTER), pp.1-11, 2019.
URL : https://hal.archives-ouvertes.fr/hal-02403109

E. Agullo, O. Aumage, M. Faverge, N. Furmento, F. Pruvost et al., Achieving High Performance on Supercomputers with a Sequential Task-based Programming Model, IEEE Transactions on Parallel and Distributed Systems, pp.1-1, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01618526

, Figure 8: Execution details for StarPU-Stencil on K40 or P100 configurations for a locality coefficient l = (2, 1).

L. Papadopoulos, D. Soudris, I. Walulya, and P. Tsigas, Customization methodology for implementation of streaming aggregation in embedded systems, Journal of Systems Architecture, vol.66-67, pp.48-60, 2016.

J. Karimov, T. Rabl, and V. Markl, PolyBench: The First Benchmark for Polystores, Performance Evaluation and Benchmarking for the Era of Artificial Intelligence, pp.24-41, 2019.

A. Thiess, R. Zeller, M. Bolten, P. H. Dederichs, and S. Blügel, Massively parallel density functional calculations for thousands of atoms: KKRnano, Physical Review B, vol.85, issue.23, p.235103, 2012.

D. Soudris, R. Namyst, D. Pleiter, G. Gaydadjiev, T. Becker et al., EXA2PRO programming environment, Proceedings of the 18th International Conference on Embedded Computer Systems Architectures, Modeling, and Simulation - SAMOS '18, pp.202-209, 2018.

C. Merlet, B. Rotenberg, P. A. Madden, P. Taberna, P. Simon et al., On the molecular origin of supercapacitance in nanoporous carbon electrodes, Nature Materials, vol.11, issue.4, pp.306-310, 2012.
URL : https://hal.archives-ouvertes.fr/hal-01153072