Rank reordering for MPI communication optimization, Computers & Fluids, vol.80, 2012. ,
DOI : 10.1016/j.compfluid.2012.01.019
hwloc: A Generic Framework for Managing Hardware Affinities in HPC Applications, 2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing, 2010. ,
DOI : 10.1109/PDP.2010.67
URL : https://hal.archives-ouvertes.fr/inria-00429889
Logp: Towards a realistic model of parallel computation, SIGPLAN Not, vol.28, issue.7, p.112, 1993. ,
DOI : 10.1145/155332.155333
URL : http://www.crhc.uiuc.edu/ece412/papers/logp.pdf
Netloc: Towards a Comprehensive View of the HPC System Topology, 2014 43rd International Conference on Parallel Processing Workshops, p.216225, 2014. ,
DOI : 10.1109/ICPPW.2014.38
URL : https://hal.archives-ouvertes.fr/hal-01010599
Rank reordering strategy for MPI topology creation functions, Recent Advances in Parallel Virtual Machine and Message Passing Interface, p.188195 ,
DOI : 10.1007/BFb0056575
The communication challenge for MPP: Intel Paragon and Meiko CS-2, Parallel Computing, vol.20, issue.3 ,
DOI : 10.1016/S0167-8191(06)80021-9
Locality-Aware Parallel Process Mapping for Multi-core HPC Systems, 2011 IEEE International Conference on Cluster Computing, p.527531, 2011. ,
DOI : 10.1109/CLUSTER.2011.59
Implementing the MPI Process Topology Mechanism, Supercomputing`02Supercomputing`02: Proceedings of the 2002 ACM/IEEE conference on Supercomputing, 2002. ,
Process Placement in Multicore Clusters: Algorithmic Issues and Practical Techniques, IEEE Trans. Parallel Distrib. Syst, vol.25, issue.4, p.9931002, 2014. ,
DOI : 10.1109/tpds.2013.104
URL : https://hal.archives-ouvertes.fr/hal-00803548
Implementing the MPI process topology mechanism, Supercom- puting`02puting`02: Proceedings of the 2002 ACM/IEEE conference on Supercomputing, 2002. ,
Fast Measurement of LogP Parameters for Message Passing Platforms, p.11761183, 2000. ,
DOI : 10.1007/3-540-45591-4_162
URL : http://www.cs.vu.nl/~kielmann/papers/rtspp00.ps.gz
Towards an Ecient Process Placement Policy for MPI Applications in Multicore Environments, EuroPVM/MPI, p.104115, 2009. ,
DOI : 10.1007/978-3-642-03770-2_17
URL : https://hal.inria.fr/inria-00392581/document/
Improving MPI Applications Performance on Multicore Clusters with Rank Reordering, EuroMPI, p.3949, 2011. ,
DOI : 10.1145/1183401.1183451
URL : https://hal.archives-ouvertes.fr/hal-00643151
Plate-forme fédérative pour la recherche en informatique et mathématiques ,
Hierarchical Parallel Matrix Multiplication on Large-Scale Distributed Memory Platforms, 2013 42nd International Conference on Parallel Processing, pp.754762-754763, 2013. ,
DOI : 10.1109/ICPP.2013.89
URL : http://arxiv.org/pdf/1306.4161.pdf
Multi-core and Network Aware MPI Topology Functions, EuroMPI 2011. Recent Advances in the Message Passing Interface -18th European MPI Users' Group Meeting, p.5060 ,
DOI : 10.1109/PDP.2010.67
URL : http://post.queensu.ca/~afsahi/PPRL/papers/EuroMPI-2011.pdf
High Performance Parallelism Pearls, 2015. ,
SUMMA: scalable universal matrix multiplication algorithm, Concurrency: Practice and Experience, vol.9, issue.4, p.255274, 1997. ,
DOI : 10.1002/(SICI)1096-9128(199704)9:4<255::AID-CPE250>3.0.CO;2-2
Process Mapping for MPI Collective Communications, Euro-Par, p.8192, 2009. ,
DOI : 10.1109/ICPP.2005.62
URL : http://hpc.cs.tsinghua.edu.cn/research/cluster/papers_cwg/europar_zhang.pdf
Hierarchical Collectives in MPICH2, Proceedings of the 16th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface, p.325326, 2009. ,
DOI : 10.1109/JSSC.2007.910957
URL : http://www.mcs.anl.gov/uploads/cels/papers/P1622.pdf