R. M. Badia, J. Labarta, J. Giménez, and F. Escalé, Dimemas: Predicting MPI Applications Behaviour in Grid Environments, Proc. of the Workshop on Grid Applications and Programming Tools, 2003.

G. Zheng, G. Kakulapati, and L. Kale, BigSim: A Parallel Simulator for Performance Prediction of Extremely Large Parallel Machines, Proc. of the 18th IPDPS, 2004.

T. Hoefler, C. Siebert, and A. Lumsdaine, LogGOPSim -Simulating Large-Scale Applications in the LogGOPS Model, Proc. of the LSAP Workshop, pp.597-604, 2010.

C. L. Janssen, H. Adalsteinsson, S. Cranford, J. P. Kenny, A. Pinar et al., A simulator for large-scale parallel architectures, International Journal of Parallel and Distributed Systems, vol.1, issue.2, pp.57-73, 2010.

C. Engelmann, Scaling to a million cores and beyond: Using light-weight simulation to understand the challenges ahead on the road to exascale, Future Generation Computer Systems, vol.30, pp.59-65, 2014.
DOI : 10.1016/j.future.2013.04.014

M. Mubarak, C. D. Carothers, R. B. Ross, and P. H. Carns, Enabling Parallel Simulation of Large-Scale HPC Network Systems, IEEE Transactions on Parallel and Distributed Systems, vol.28, issue.1, 2016.
DOI : 10.1109/TPDS.2016.2543725

P. Velho, L. Schnorr, H. Casanova, and A. Legrand, On the validity of flow-level tcp network models for grid and cloud simulations, ACM Transactions on Modeling and Computer Simulation, vol.23, issue.4, p.23, 2013.
DOI : 10.1145/2517448

URL : https://hal.archives-ouvertes.fr/hal-00872476

A. Faraj, X. Yuan, and D. Lowenthal, STAR-MPI, Proceedings of the 20th annual international conference on Supercomputing , ICS '06, pp.199-208, 2006.
DOI : 10.1145/1183401.1183431

M. Tikir, M. Laurenzano, L. Carrington, and A. Snavely, PSINS: An Open Source Event Tracer and Execution Simulator for MPI Applications, Proc. of the 15th International EuroPar Conference, ser. LNCS, pp.135-148, 2009.
DOI : 10.1007/BFb0052218

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=

A. Núñez, J. Fernández, J. Garcia, F. Garcia, and J. Carretero, New techniques for simulating high performance MPI applications on large storage networks, 2008 IEEE International Conference on Cluster Computing, pp.40-57, 2010.
DOI : 10.1109/CLUSTR.2008.4663806

J. Zhai, W. Chen, and W. Zheng, PHANTOM: Predicting Performance of Parallel Applications on Large-Scale Parallel Machines Using a Single Node, Proc. of the 15th ACM SIGPLAN PPoPP Symp, pp.305-314, 2010.

M. Hermanns, M. Geimer, F. Wolf, and B. Wylie, Verifying Causality between Distant Performance Phenomena in Large-Scale MPI Applications, 2009 17th Euromicro International Conference on Parallel, Distributed and Network-based Processing, pp.78-84, 2009.
DOI : 10.1109/PDP.2009.50

G. Zheng, S. Negara, C. L. Mendes, E. R. Rodrigues, and L. Kale, Automatic Handling of Global Variables for Multi-threaded MPI Programs, 2011 IEEE 17th International Conference on Parallel and Distributed Systems, pp.220-227, 2011.
DOI : 10.1109/ICPADS.2011.33

A. Snavely, L. Carrington, N. Wolter, J. Labarta, R. Badia et al., A Framework for Performance Modeling and Prediction, ACM/IEEE SC 2002 Conference (SC'02), 2002.
DOI : 10.1109/SC.2002.10004

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=

L. Carrington, M. Laurenzano, and A. Tiwari, Inferring largescale computation behavior via trace extrapolation, Proc. of the Workshop on Large-Scale Parallel Processing, 2013.
DOI : 10.1109/ipdpsw.2013.137

H. Casanova, F. Desprez, G. S. Markomanolis, and F. Suter, Simulation of MPI applications with time-independent traces, Concurrency and Computation: Practice and Experience, vol.129, issue.014109, pp.1145-1168, 2015.
DOI : 10.1002/cpe.3278

URL : https://hal.archives-ouvertes.fr/hal-01232776

X. Wu and F. Mueller, Elastic and scalable tracing and accurate replay of non-deterministic events, Proceedings of the 27th international ACM conference on International conference on supercomputing, ICS '13, pp.59-68, 2013.
DOI : 10.1145/2464996.2465001

H. Casanova, A. Gupta, and F. Suter, Toward More Scalable Off-Line Simulations of MPI Applications, Parallel Processing Letters, vol.25, issue.03, p.1541002, 2015.
DOI : 10.1142/S0129626415410029

URL : https://hal.archives-ouvertes.fr/hal-01232787

F. Ino, N. Fujimoto, and K. Hagihara, LogGPS: a Parallel Computational Model for Synchronization Analysis, Proc. of the 8th ACM SIGPLAN PPoPP Symp, pp.133-142, 2001.

B. Penoff, A. Wagner, M. Tüxen, and I. Rüngeler, MPI-NeTSim: A Network Simulation Module for MPI, 2009 15th International Conference on Parallel and Distributed Systems, 2009.
DOI : 10.1109/ICPADS.2009.116

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=

G. F. Lucio, M. Paredes-farrera, E. Jammeh, M. Fleury, and M. J. Reed, OPNET Modeler and ns-2: Comparing the Accuracy of Network Simulators for Packet-Level Analysis Using a Network Testbed, Proc. of the 3rd WEAS International Conference on Simulation, Modelling and Optimization, pp.700-707, 2003.

D. Culler, R. Karp, D. Patterson, A. Sahay, K. E. Schauser et al., LogP: Towards a Realistic Model of Parallel Computation, Proc. of the 4th ACM SIGPLAN PPoPP Symp, pp.1-12, 1993.

C. A. Moritz and M. I. Frank, LoGPG: Modeling network contention in message-passing programs, IEEE Transactions on Parallel and Distributed Systems, vol.12, issue.4, pp.404-415, 2001.
DOI : 10.1109/71.920589

L. Yuan, Y. Zhang, Y. Tang, L. Rao, and X. Sun, LogGPH: A Parallel Computational Model with Hierarchical Communication Awareness, 2010 13th IEEE International Conference on Computational Science and Engineering, pp.268-274, 2010.
DOI : 10.1109/CSE.2010.40

J. Rico-gallego and J. Díaz-martín, ? -Lop: Modeling Performance of Shared Memory MPI, Parallel Computing, vol.46, 2015.
DOI : 10.1007/978-3-642-24449-0_42

P. Bedaride, A. Degomme, S. Genaud, A. Legrand, G. Markomanolis et al., Toward Better Simulation of MPI Applications on Ethernet/TCP Networks, Proc. of the 4th Intl. Workshop on Performance Modeling , Benchmarking and Simulation, ser. LNCS, pp.158-181, 2013.
DOI : 10.1007/978-3-319-10214-6_8

URL : https://hal.archives-ouvertes.fr/hal-00919507

R. Bolze, F. Cappello, E. Caron, M. Daydé, F. Desprez et al., Grid'5000: A Large Scale And Highly Reconfigurable Experimental Grid Testbed, International Journal of High Performance Computing Applications, vol.20, issue.4, pp.481-494, 2006.
DOI : 10.1177/1094342006070078

URL : https://hal.archives-ouvertes.fr/hal-00684943

E. León, R. Riesen, and A. Maccabe, Instruction-level simulation of a cluster at scale, Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, SC '09, 2009.
DOI : 10.1145/1654059.1654063

A. Rodrigues, K. Hemmert, B. Barrett, C. Kersey, R. Oldfield et al., The structural simulation toolkit, ACM SIGMETRICS Performance Evaluation Review, vol.38, issue.4, pp.37-42, 2011.
DOI : 10.1145/1964218.1964225

URL : http://www.osti.gov/scitech/servlets/purl/1088057

L. Stanisic, E. Agullo, A. Buttari, A. Guermouche, A. Legrand et al., Fast and Accurate Simulation of Multithreaded Sparse Linear Algebra Solvers, 2015 IEEE 21st International Conference on Parallel and Distributed Systems (ICPADS), 2015.
DOI : 10.1109/ICPADS.2015.67

URL : https://hal.archives-ouvertes.fr/hal-01180272

D. Grove and P. Coddington, Communication Benchmarking and Performance Modelling of MPI Programs on Cluster Computers, The Journal of Supercomputing, vol.23, issue.1/2, pp.201-217, 2005.
DOI : 10.1007/s11227-005-2340-2

T. Hoefler, C. Siebert, and A. Lumsdaine, Group Operation Assembly Language - A Flexible Way to Express Collective Communication, 2009 International Conference on Parallel Processing, 2009.
DOI : 10.1109/ICPP.2009.70

R. Thakur, R. Rabenseifner, and W. Gropp, Optimization of Collective Communication Operations in MPICH, International Journal of High Performance Computing Applications, vol.19, issue.1, pp.49-66, 2005.
DOI : 10.1177/1094342005051521

D. Panda, K. Tomko, K. Schulz, and A. Majumdar, The MVAPICH Project: Evolution and Sustainability of an Open Source Production Quality MPI Library for HPC, Intl. Workshop on Sustainable Software for Science: Practice and Experiences, 2013.

R. Fujimoto, Parallel and distributed simulation systems, Proceeding of the 2001 Winter Simulation Conference (Cat. No.01CH37304), 2000.
DOI : 10.1109/WSC.2001.977259

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=

M. Mubarak, C. D. Carothers, R. Ross, and P. Carns, Modeling a Million-Node Dragonfly Network Using Massively Parallel Discrete-Event Simulation, 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, pp.366-376, 2012.
DOI : 10.1109/SC.Companion.2012.56

W. T. Tang, R. S. Goh, and I. L. Thng, Ladder queue, ACM Transactions on Modeling and Computer Simulation, vol.15, issue.3, pp.175-204, 2005.
DOI : 10.1145/1103323.1103324

M. Quinson, C. Rosa, and C. Thiéry, Parallel simulation of peerto-peer systems, Proc. of the 12th IEEE/ACM Intl. Symposium on Cluster, Cloud and Grid Computing, 2012.

B. Donassolo, H. Casanova, A. Legrand, and P. Velho, Fast and Scalable Simulation of Volunteer Computing Systems Using Sim- Grid, Proc. of the Workshop on Large-Scale System and Application Performance, 2010.

H. Casanova, A. Giersch, A. Legrand, M. Quinson, and F. Suter, Versatile, scalable, and accurate simulation of distributed applications and platforms, Journal of Parallel and Distributed Computing, vol.74, issue.10, pp.2899-2917, 2014.
DOI : 10.1016/j.jpdc.2014.06.008

URL : https://hal.archives-ouvertes.fr/hal-01017319

S. Liang, R. Noronha, and D. K. Panda, Swapping to Remote Memory over InfiniBand: An Approach using a High Performance Network Block Device, 2005 IEEE International Conference on Cluster Computing, 2005.
DOI : 10.1109/CLUSTR.2005.347050

L. Genovese, A. Neelov, S. Goedecker, T. Deutsch, S. A. Ghasemi et al., Daubechies wavelets as a basis set for density functional pseudopotential calculations, The Journal of Chemical Physics, vol.129, issue.1, 2008.
DOI : 10.1063/1.2949547

P. Clauss, M. Stillwell, S. Genaud, F. Suter, H. Casanova et al., Single Node On-Line Simulation of MPI Applications with SMPI, 2011 IEEE International Parallel & Distributed Processing Symposium, 2011.
DOI : 10.1109/IPDPS.2011.69

URL : https://hal.archives-ouvertes.fr/inria-00527150

M. Banikazemi, J. Sampathkumar, S. Prabhu, D. Panda, and P. Sadayappan, Communication modeling of heterogeneous networks of workstations for performance characterization of collective operations, Proceedings. Eighth Heterogeneous Computing Workshop (HCW'99), pp.125-133, 1999.
DOI : 10.1109/HCW.1999.765117

G. Zarza, D. Lugones, D. Franco, and E. Luque, An Innovative Teaching Strategy to Understand High-Performance Systems through Performance Evaluation, Proc. of the International Conference on Computational Science, pp.1733-1742, 2012.
DOI : 10.1016/j.procs.2012.04.191

M. Guthmuller, M. Quinson, and G. Corona, System-level State Equality Detection for the Formal Dynamic Verification of Legacy Distributed Applications, " in Formal Approaches to Parallel and Distributed Systems -Special Session of Parallel, 2015.

L. Stanisic, S. Thibault, A. Legrand, B. Videau, and J. Méhaut, Faithful performance prediction of a dynamic task-based runtime system for heterogeneous multi-core architectures, Concurrency and Computation: Practice and Experience, 2015.
DOI : 10.1002/cpe.3555

URL : https://hal.archives-ouvertes.fr/hal-01147997