D. A. Mallon, N. Eicker, M. E. Innocenti, G. Lapenta, T. Lippert et al., On the scalability of the clusters-booster concept, Proceedings of the Future HPC Systems on the Challenges of Power-Constrained Performance, FutureHPC '12, pp.1-3, 2012.
DOI : 10.1145/2322156.2322159

A. Duran, E. Ayguadé, M. Rosa, J. Badia, L. Labarta et al., OmpSs: a proposal for programming heterogeneous multi-core architectures. Parallel Processing Letters, pp.173-193, 2011.

J. Heichler, Picking the right number of targets per server for BeeGFS R, 2015.

W. Frings, F. Wolf, and V. Petkov, Scalable massively parallel I/O to task-local files, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, SC '09, pp.1-17, 2009.
DOI : 10.1145/1654059.1654077

J. Viquerat, M. Klemm, S. Lanteri, and C. Scheid, Theoretical and numerical analysis of local dispersion models coupled to a Discontinuous Galerkin Time-Domain method for Maxwell's equations, 2013.

L. Fezoui, S. Lanteri, S. Lohrengel, and S. Piperno, Convergence and stability of a discontinuous Galerkin time-domain method for the 3D heterogeneous Maxwell equations on unstructured meshes, ESAIM: Mathematical Modelling and Numerical Analysis, vol.39, issue.6, pp.1149-1176, 2005.
DOI : 10.1051/m2an:2005049

URL : https://hal.archives-ouvertes.fr/hal-00210500

S. Jan, T. Hesthaven, and . Warburton, Nodal Discontinuous Galerkin methods: algorithms, analysis, and applications, 2008.

C. Durochat, S. Lanteri, and R. Léger, A non-conforming multi-element DGTD method for the simulation of human exposure to electromagnetic waves, Modelling: Electronic Networks, Devices and Fields, pp.614-625, 2014.
DOI : 10.1002/jnm.1943

URL : https://hal.archives-ouvertes.fr/hal-00915353

T. Cabel, J. Charles, and S. Lanteri, Performance Evaluation of a Multi-GPU Enabled Finite Element Method for Computational Electromagnetics, Lecture Notes in Computer Science, vol.7156, pp.355-364, 2011.
DOI : 10.1007/978-3-642-29740-3_40

V. Pillet, J. Labarta, T. Cortes, and S. Girona, Paraver: A tool to visualize and analyze parallel code, Proceedings of WoTUG-18: Transputer and occam Developments, pp.17-31, 1995.

G. Karypis and V. Kumar, A Fast and High Quality Multilevel Scheme for Partitioning Irregular Graphs, SIAM Journal on Scientific Computing, vol.20, issue.1, pp.359-392, 1998.
DOI : 10.1137/S1064827595287997

W. Liu and A. Sherman, Comparative Analysis of the Cuthill???McKee and the Reverse Cuthill???McKee Ordering Algorithms for Sparse Matrices, SIAM Journal on Numerical Analysis, vol.13, issue.2, pp.198-213, 1976.
DOI : 10.1137/0713020