PaRSEC: Exploiting Heterogeneity to Enhance Scalability, Computing in Science & Engineering, vol.15, issue.6, pp.36-45, 2013. ,
Dynamic task execution on shared and distributed memory architectures, 2012. ,
StarPU: A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures, Lecture Notes in Computer Science, vol.5704, pp.863-874, 2009. ,
URL : https://hal.archives-ouvertes.fr/inria-00384363
Hierarchical Task-Based Programming With StarSs, The International Journal of High Performance Computing Applications, vol.23, issue.3, pp.284-299, 2009. ,
Task-Based FMM for Multicore Architectures, SIAM Journal on Scientific Computing, vol.36, issue.1, pp.C66-C93, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-00911856
A comparative performance study of common and popular task-centric programming frameworks, Concurrency and Computation: Practice and Experience, vol.27, issue.1, pp.1-28, 2013. ,
PARSECSs, ACM Transactions on Architecture and Code Optimization, vol.12, issue.4, pp.1-22, 2016. ,
A comparative analysis of parallel programming models for c++, ICCGI, 2014. ,
HPX, Proceedings of the 8th International Conference on Partitioned Global Address Space Programming Models - PGAS '14, pp.1-11, 2014. ,
, Figure 8: Execution details for StarPU-Stencil on K40 or P100 configurations for a locality coefficient l = (2, 1).
Data-Aware Task Scheduling on Multi-accelerator Based Platforms, 2010 IEEE 16th International Conference on Parallel and Distributed Systems, 2010. ,
URL : https://hal.archives-ouvertes.fr/inria-00523937
Asynchronous Task-Based Execution of the Reverse Time Migration for the Oil and Gas Industry, 2019 IEEE International Conference on Cluster Computing (CLUSTER), pp.1-11, 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-02403109
Achieving High Performance on Supercomputers with a Sequential Task-based Programming Model, IEEE Transactions on Parallel and Distributed Systems, pp.1-1, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01618526
, Figure 8: Execution details for StarPU-Stencil on K40 or P100 configurations for a locality coefficient l = (2, 1).
Customization methodology for implementation of streaming aggregation in embedded systems, Journal of Systems Architecture, vol.66-67, pp.48-60, 2016. ,
PolyBench: The First Benchmark for Polystores, Performance Evaluation and Benchmarking for the Era of Artificial Intelligence, pp.24-41, 2019. ,
Massively parallel density functional calculations for thousands of atoms: KKRnano, Physical Review B, vol.85, issue.23, p.235103, 2012. ,
EXA2PRO programming environment, Proceedings of the 18th International Conference on Embedded Computer Systems Architectures, Modeling, and Simulation - SAMOS '18, pp.202-209, 2018. ,
On the molecular origin of supercapacitance in nanoporous carbon electrodes, Nature Materials, vol.11, issue.4, pp.306-310, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-01153072