StarPU: a unied platform for task scheduling on heterogeneous multicore architectures, Concurrency and Computation: Practice and Experience, p.187198, 2011. ,
A low level component model easing performance portability of HPC applications, Computing, vol.4, issue.5, pp.961115-1130, 2014. ,
DOI : 10.1007/s00607-013-0368-3
URL : https://hal.archives-ouvertes.fr/hal-00911231
Improving performance portability and exascale software productivity with the ∇ numerical programming language, Proceedings of the 3rd International Conference on Exascale Applications and Software, EASC '15, pp.126131-2015 ,
PATUS: A Code Generation and Autotuning Framework for Parallel Iterative Stencil Computations on Modern Microarchitectures, 2011 IEEE International Parallel & Distributed Processing Symposium, p.676687, 2011. ,
DOI : 10.1109/IPDPS.2011.70
The SIPSim implicit parallelism model and the SkelGIS library. Concurrency and Computation: Practice and Experience, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01216019
Algorithmic skeleton library for scientic simulations ,
Implementation and Performance Analysis of SkelGIS for Network Mesh-Based Simulations, Euro-Par 2014 Parallel Processing -20th International Conference. Proceedings, p.439450, 2014. ,
DOI : 10.1007/978-3-319-09873-9_37
URL : https://hal.archives-ouvertes.fr/hal-01094340
OpenMP: an industry standard API for shared-memory programming, IEEE Computational Science and Engineering, vol.5, issue.1, p.4655, 1998. ,
DOI : 10.1109/99.660313
Liszt, Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis on, SC '11, p.12, 2011. ,
DOI : 10.1145/2063384.2063396
A new two-dimensional Shallow Water model including pressure effects and slow varying bottom topography, ESAIM: Mathematical Modelling and Numerical Analysis, vol.38, issue.2, pp.211-234, 2004. ,
DOI : 10.1051/m2an:2004010
XKaapi: A Runtime System for Data-Flow Task Programming on Heterogeneous Architectures, 2013 IEEE 27th International Symposium on Parallel and Distributed Processing, p.12991308, 2013. ,
DOI : 10.1109/IPDPS.2013.66
URL : https://hal.archives-ouvertes.fr/hal-00799904
The MPI 2.2 Standard and the Emerging MPI 3 Standard, Proceedings of the 16th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface, p.22, 2009. ,
DOI : 10.1007/978-3-642-03770-2_2
PaMPA: Parallel Mesh Partitioning and Adaptation, 21st International Conference on Domain Decomposition Methods (DD21), 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-00879382
Scotch: A software package for static mapping by dual recursive bipartitioning of process and architecture graphs, Proceedings of the International Conference and Exhibition on High-Performance Computing and Networking, p.493498, 1996. ,
DOI : 10.1007/3-540-61142-8_588
Halide: A language and compiler for optimizing parallelism, locality, and recomputation in image processing pipelines, Proceedings of the 34th ACM SIGPLAN Conference on Programming Language Design and Implementation, PLDI '13, p.519530, 2013. ,
ExaSlang: A Domain-Specific Language for Highly Scalable Multigrid Solvers, 2014 Fourth International Workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing, p.4251, 2014. ,
DOI : 10.1109/WOLFHPC.2014.11
OpenCL: A Parallel Programming Standard for Heterogeneous Computing Systems, Computing in Science & Engineering, vol.12, issue.3, p.6673, 2010. ,
DOI : 10.1109/MCSE.2010.69
Composition and reuse with compiled domain-specic languages, Proceedings of the 27th European Conference on Object-Oriented Programming, pp.52-78, 2013. ,
The pochoir stencil compiler, Proceedings of the 23rd ACM symposium on Parallelism in algorithms and architectures, SPAA '11, p.117128 ,
DOI : 10.1145/1989493.1989508
The recognition of series parallel digraphs, Proceedings of the Eleventh Annual ACM Symposium on Theory of Computing, STOC '79, p.112, 1979. ,
Hierarchical DAG Scheduling for Hybrid Distributed Systems, 2015 IEEE International Parallel and Distributed Processing Symposium, pp.249-6399, 2015. ,
DOI : 10.1109/IPDPS.2015.56
URL : https://hal.archives-ouvertes.fr/hal-01078359