C. Hasan-metin-aktulga, . Yang, G. Esmond, P. Ng, . Maris et al., Improving the scalability of a symmetric iterative eigensolver for multi-core platforms, Concurrency and Computation: Practice and Experience, vol.81, issue.2-3, pp.2631-2651, 2014.
DOI : 10.1103/PhysRevC.81.021301

F. Richard, C. T. Barrett, . Vaughan, A. Michael, and . Heroux, Minighost: a miniapp for exploring boundary exchange strategies using stencil computations in scientific parallel computing. Sandia National Laboratories, 2011.

G. Bosilca, C. Foyer, E. Jeannot, G. Mercier, and G. Papauré, Online dynamic monitoring of mpi communication Extended version in https, 23rd International European Conference on Parallel and Distributed Computing (EuroPar), p.12, 2017.

H. Casanova, A. Legrand, and M. Quinson, SimGrid: A Generic Framework for Large-Scale Distributed Experiments, Tenth International Conference on Computer Modeling and Simulation (uksim 2008), pp.126-131, 2008.
DOI : 10.1109/UKSIM.2008.28
URL : https://hal.archives-ouvertes.fr/inria-00260697

H. Chen, W. Chen, J. Huang, B. Robert, and H. Kuhn, MPIPP, Proceedings of the 20th annual international conference on Supercomputing , ICS '06, pp.353-360, 2006.
DOI : 10.1145/1183401.1183451

M. Deveci, S. Rajamanickam, J. Vitus, K. Leung, . Pedretti et al., Exploiting Geometric Partitioning in Task Mapping for Parallel Computers, 2014 IEEE 28th International Parallel and Distributed Processing Symposium, pp.27-36, 2014.
DOI : 10.1109/IPDPS.2014.15
URL : http://bmi.osu.edu/hpc/papers/Deveci14-IPDPS.pdf

M. Diener, H. Eduardo, . Cruz, A. Marco, . Alves et al., Locality and Balance for Communication-Aware Thread Mapping in Multicore Systems, European Conference on Parallel Processing, pp.196-208, 2015.
DOI : 10.1007/978-3-662-48096-0_16

M. Diener, H. Eduardo, . Cruz, L. Laércio, F. Pilla et al., Characterizing communication and page usage of parallel applications for thread and data mapping, Performance Evaluation, vol.88, issue.89, pp.18-36, 2015.
DOI : 10.1016/j.peva.2015.03.001
URL : https://hal.archives-ouvertes.fr/hal-01146859

E. Gabriel, E. Graham, G. Fagg, T. Bosilca, . Angskun et al., Open MPI: Goals, Concept, and Design of a Next Generation MPI Implementation, European Parallel Virtual Machine/Message Passing Interface Users' Group Meeting, pp.97-104, 2004.
DOI : 10.1007/978-3-540-30218-6_19

B. Goglin, Managing the topology of heterogeneous cluster nodes with hardware locality (hwloc), 2014 International Conference on High Performance Computing & Simulation (HPCS), pp.74-81, 2014.
DOI : 10.1109/HPCSim.2014.6903671
URL : https://hal.archives-ouvertes.fr/hal-00985096

W. Gropp, E. Lusk, and A. Skjellum, Using MPI: portable parallel programming with the message-passing interface, 1999.

J. Gustedt, E. Jeannot, and F. Mansouri, Optimizing locality by topologyaware placement for a task based programming model, Cluster Computing (CLUSTER), 2016 IEEE International Conference on, pp.164-165, 2016.
DOI : 10.1109/cluster.2016.87
URL : https://hal.archives-ouvertes.fr/hal-01416284

T. Hoefler, E. Jeannot, and G. Mercier, An overview of process mapping techniques and algorithms in high-performance computing, 2014.
URL : https://hal.archives-ouvertes.fr/hal-00921626

T. Hoefler and M. Snir, Generic topology mapping strategies for large-scale parallel architectures, Proceedings of the international conference on Supercomputing, ICS '11, pp.75-84, 2011.
DOI : 10.1145/1995896.1995909

E. Jeannot, G. Mercier, and F. Tessier, Process Placement in Multicore Clusters:Algorithmic Issues and Practical Techniques, IEEE Transactions on Parallel and Distributed Systems, vol.25, issue.4, pp.993-1002, 2014.
DOI : 10.1109/TPDS.2013.104
URL : https://hal.archives-ouvertes.fr/hal-00803548

V. Laxmikant, S. Kale, and . Krishnan, Charm++: a portable concurrent object oriented system based on c++, ACM Sigplan Notices, pp.91-108, 1993.

G. Mercier and J. Clet-ortega, Towards an Efficient Process Placement Policy for MPI Applications in Multicore Environments, European Parallel Virtual Machine/Message Passing Interface Users' Group Meeting, pp.104-115, 2009.
DOI : 10.1109/PDP.2009.43
URL : https://hal.archives-ouvertes.fr/inria-00392581

T. Jesper-larsson, Implementing the mpi process topology mechanism, In Supercomputing , ACM/IEEE, pp.28-28, 2002.

H. Yu, I. Chung, and J. Moreira, Topology Mapping for Blue Gene/L Supercomputer, ACM/IEEE SC 2006 Conference (SC'06), p.116, 2006.
DOI : 10.1109/SC.2006.63
URL : http://www.civ.cvut.cz/others/konference_supercomputing/Proceedings_SC_2006/sc06/schedule/pdf/pap273.pdf