E. Agullo, O. Beaumont, L. Eyraud-dubois, and S. Kumar, Are Static Schedules so Bad ? A Case Study on Cholesky Factorization, IPDPS'16. Proceedings of the 30th IEEE International Parallel & Distributed Processing Symposium, IPDPS'16, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01223573

R. Aversa, B. Di-martino, M. Rak, S. Venticinque, and U. Villano, Performance prediction through simulation of a hybrid mpi/openmp application, Parallel Computing, vol.31, issue.10, pp.1013-1033, 2005.

O. A. Board, Openmp application programming interface -version 5, 2018.

R. N. Calheiros, R. Ranjan, A. Beloglazov, C. A. De-rose, and R. Buyya, Cloudsim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms, Software: Practice and Experience, vol.41, issue.1, pp.23-50, 2011.

H. Casanova, Simgrid: a toolkit for the simulation of application scheduling, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid, pp.430-437, 2001.

P. Czarnul, J. Kuchta, M. Matuszek, J. Proficz, P. Ro?ciszewski et al., Merpsys: An environment for simulation of parallel application execution on large scale hpc systems, Simulation Modelling Practice and Theory, vol.77, pp.124-140, 2017.

A. Daumen, P. Carribault, F. Trahay, and G. Thomas, ScalOMP: analyzing the Scalability of OpenMP applications, IWOMP 2019: 15th International Workshop on OpenMP, 11718.
URL : https://hal.archives-ouvertes.fr/hal-02179726

, Programming and Software Engineering book sub series, vol.11718, pp.36-49

. Springer, , 2019.

N. Denoyelle, B. Goglin, A. Ilic, E. Jeannot, and L. Sousa, Modeling non-uniform memory access on large compute nodes with the cache-aware roofline model, IEEE Transactions on Parallel and Distributed Systems, vol.30, issue.6, pp.1374-1389, 2019.
URL : https://hal.archives-ouvertes.fr/hal-01924951

A. E. Eichenberger, J. Mellor-crummey, M. Schulz, M. Wong, N. Copty et al., Ompt: An openmp tools application programming interface for performance analysis, OpenMP in the Era of Low Power Devices and Accelerators, pp.171-185, 2013.

C. Engelmann, Scaling to a million cores and beyond: Using light-weight simulation to understand the challenges ahead on the road to exascale, Cryptography in Cloud Computing and Recent Advances in Parallel and Distributed Systems, ICPADS 2012 Selected Papers, vol.30, pp.59-65, 2014.

C. Feld, S. Convent, M. A. Hermanns, J. Protze, M. Geimer et al., Score-p and ompt: Navigating the perils of callback-driven parallel runtime introspection, OpenMP: Conquering the Full Hardware Spectrum, pp.21-35, 2019.

T. Gautier, C. Pérez, and J. Richard, On the Impact of OpenMP Task Granularity, IWOMP 2018 -14th International Workshop on OpenMP for Evolving Architectures, pp.205-221, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01901806

S. Girona and J. Labarta, Sensitivity of performance prediction of message passing programs, The Journal of Supercomputing, vol.17, 2000.

B. Haugen, Performance analysis and modeling of task-based runtimes, 2016.

B. Haugen, J. Kurzak, A. Yarkhan, P. Luszczek, and J. Dongarra, Parallel simulation of superscalar scheduling, 2014 43rd International Conference on Parallel Processing, pp.121-130, 2014.

F. Heinrich, Modeling, Prediction and Optimization of Energy Consumption of MPI Applications using SimGrid. Theses, 2019.
URL : https://hal.archives-ouvertes.fr/tel-02269894

J. C. De-kergommeaux, C. Guilloud, and B. De-oliveira-stein, Flexible performance debugging of parallel and distributed applications, Euro-Par 2003. Parallel Processing, 9th International Euro-Par Conference, vol.2790, pp.38-46, 2003.

K. Dzmitry and K. S. Pascal, Greencloud: a packet-level simulator of energy-aware cloud computing data centers, The Journal of Supercomputing, 2012.

Y. Liu, Y. Zhu, X. Li, Z. Ni, T. Liu et al., Simnuma: Simulating numa-architecture multiprocessor systems efficiently, 2013 International Conference on Parallel and Distributed Systems, pp.341-348, 2013.

A. Mohammed, A. Eleliemy, F. M. Ciorba, F. Kasielke, and I. Banicescu, Experimental verification and analysis of dynamic loop scheduling in scientific applications, 2018 17th International Symposium on Parallel and Distributed Computing (ISPDC), pp.141-148, 2018.

A. Porterfield, R. Fowler, A. Mandal, and M. Y. Lim, Empirical evaluation of multicore memory concurrency, 2009.

A. Rico, A. Duran, F. Cabarcas, Y. Etsion, A. Ramirez et al., Trace-driven simulation of multithreaded applications, IEEE ISPASS) IEEE International Symposium on Performance Analysis of Systems and Software, pp.87-96, 2011.

M. Slimane and L. Sekhri, Hlsmn: High level multicore numa simulator. Electrotehnica, Electronica, Automatica, vol.65, issue.3, 2017.

L. Stanisic, E. Agullo, A. Buttari, A. Guermouche, A. Legrand et al., Fast and Accurate Simulation of Multithreaded Sparse Linear Algebra Solvers, The 21st IEEE International Conference on Parallel and Distributed Systems, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01180272

L. Stanisic, S. Thibault, A. Legrand, B. Videau, and J. F. Méhaut, Faithful performance prediction of a dynamic task-based runtime system for heterogeneous multi-core architectures, Concurrency and Computation: Practice and Experience, vol.27, issue.16, pp.4075-4090, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01147997

J. Tao, M. Schulz, and W. Karl, Simulation as a tool for optimizing memory accesses on numa machines, Performance Evaluation, vol.60, issue.1-4, pp.31-50, 2005.

P. Virouleau, P. Brunet, F. Broquedis, N. Furmento, S. Thibault et al., Evaluation of openmp dependent tasks with the kastors benchmark suite, International Workshop on OpenMP, pp.16-29, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01081974

P. Virouleau, P. Brunet, F. Broquedis, N. Furmento, S. Thibault et al., Evaluation of openmp dependent tasks with the kastors benchmark suite, Using and Improving OpenMP for Devices, Tasks, and More, pp.16-29, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01081974

P. Virouleau, A. Roussel, F. Broquedis, T. Gautier, F. Rastello et al., Description, implementation and evaluation of an affinity clause for task directives, International Workshop on OpenMP, pp.61-73, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01343442

G. Zheng, G. Kakulapati, and L. V. Kalé, Bigsim: A parallel simulator for performance prediction of extremely large parallel machines, 18th International Parallel and Distributed Processing Symposium, vol.78, 2004.