Improving the scalability of a symmetric iterative eigensolver for multi-core platforms, Concurrency and Computation: Practice and Experience, vol.81, issue.2-3, pp.2631-2651, 2014. ,
DOI : 10.1103/PhysRevC.81.021301
Minighost: a miniapp for exploring boundary exchange strategies using stencil computations in scientific parallel computing. Sandia National Laboratories, 2011. ,
Online dynamic monitoring of mpi communication Extended version in https, 23rd International European Conference on Parallel and Distributed Computing (EuroPar), p.12, 2017. ,
SimGrid: A Generic Framework for Large-Scale Distributed Experiments, Tenth International Conference on Computer Modeling and Simulation (uksim 2008), pp.126-131, 2008. ,
DOI : 10.1109/UKSIM.2008.28
URL : https://hal.archives-ouvertes.fr/inria-00260697
MPIPP, Proceedings of the 20th annual international conference on Supercomputing , ICS '06, pp.353-360, 2006. ,
DOI : 10.1145/1183401.1183451
Exploiting Geometric Partitioning in Task Mapping for Parallel Computers, 2014 IEEE 28th International Parallel and Distributed Processing Symposium, pp.27-36, 2014. ,
DOI : 10.1109/IPDPS.2014.15
URL : http://bmi.osu.edu/hpc/papers/Deveci14-IPDPS.pdf
Locality and Balance for Communication-Aware Thread Mapping in Multicore Systems, European Conference on Parallel Processing, pp.196-208, 2015. ,
DOI : 10.1007/978-3-662-48096-0_16
Characterizing communication and page usage of parallel applications for thread and data mapping, Performance Evaluation, vol.88, issue.89, pp.18-36, 2015. ,
DOI : 10.1016/j.peva.2015.03.001
URL : https://hal.archives-ouvertes.fr/hal-01146859
Open MPI: Goals, Concept, and Design of a Next Generation MPI Implementation, European Parallel Virtual Machine/Message Passing Interface Users' Group Meeting, pp.97-104, 2004. ,
DOI : 10.1007/978-3-540-30218-6_19
Managing the topology of heterogeneous cluster nodes with hardware locality (hwloc), 2014 International Conference on High Performance Computing & Simulation (HPCS), pp.74-81, 2014. ,
DOI : 10.1109/HPCSim.2014.6903671
URL : https://hal.archives-ouvertes.fr/hal-00985096
Using MPI: portable parallel programming with the message-passing interface, 1999. ,
Optimizing locality by topologyaware placement for a task based programming model, Cluster Computing (CLUSTER), 2016 IEEE International Conference on, pp.164-165, 2016. ,
DOI : 10.1109/cluster.2016.87
URL : https://hal.archives-ouvertes.fr/hal-01416284
An overview of process mapping techniques and algorithms in high-performance computing, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-00921626
Generic topology mapping strategies for large-scale parallel architectures, Proceedings of the international conference on Supercomputing, ICS '11, pp.75-84, 2011. ,
DOI : 10.1145/1995896.1995909
Process Placement in Multicore Clusters:Algorithmic Issues and Practical Techniques, IEEE Transactions on Parallel and Distributed Systems, vol.25, issue.4, pp.993-1002, 2014. ,
DOI : 10.1109/TPDS.2013.104
URL : https://hal.archives-ouvertes.fr/hal-00803548
Charm++: a portable concurrent object oriented system based on c++, ACM Sigplan Notices, pp.91-108, 1993. ,
Towards an Efficient Process Placement Policy for MPI Applications in Multicore Environments, European Parallel Virtual Machine/Message Passing Interface Users' Group Meeting, pp.104-115, 2009. ,
DOI : 10.1109/PDP.2009.43
URL : https://hal.archives-ouvertes.fr/inria-00392581
Implementing the mpi process topology mechanism, In Supercomputing , ACM/IEEE, pp.28-28, 2002. ,
Topology Mapping for Blue Gene/L Supercomputer, ACM/IEEE SC 2006 Conference (SC'06), p.116, 2006. ,
DOI : 10.1109/SC.2006.63
URL : http://www.civ.cvut.cz/others/konference_supercomputing/Proceedings_SC_2006/sc06/schedule/pdf/pap273.pdf