Are Static Schedules so Bad ? A Case Study on Cholesky Factorization, IPDPS'16. Proceedings of the 30th IEEE International Parallel & Distributed Processing Symposium, IPDPS'16, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01223573
Performance prediction through simulation of a hybrid mpi/openmp application, Parallel Computing, vol.31, issue.10, pp.1013-1033, 2005. ,
Openmp application programming interface -version 5, 2018. ,
Cloudsim: a toolkit for modeling and simulation of cloud computing environments and evaluation of resource provisioning algorithms, Software: Practice and Experience, vol.41, issue.1, pp.23-50, 2011. ,
Simgrid: a toolkit for the simulation of application scheduling, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid, pp.430-437, 2001. ,
Merpsys: An environment for simulation of parallel application execution on large scale hpc systems, Simulation Modelling Practice and Theory, vol.77, pp.124-140, 2017. ,
ScalOMP: analyzing the Scalability of OpenMP applications, IWOMP 2019: 15th International Workshop on OpenMP, 11718. ,
URL : https://hal.archives-ouvertes.fr/hal-02179726
, Programming and Software Engineering book sub series, vol.11718, pp.36-49
, , 2019.
Modeling non-uniform memory access on large compute nodes with the cache-aware roofline model, IEEE Transactions on Parallel and Distributed Systems, vol.30, issue.6, pp.1374-1389, 2019. ,
URL : https://hal.archives-ouvertes.fr/hal-01924951
Ompt: An openmp tools application programming interface for performance analysis, OpenMP in the Era of Low Power Devices and Accelerators, pp.171-185, 2013. ,
Scaling to a million cores and beyond: Using light-weight simulation to understand the challenges ahead on the road to exascale, Cryptography in Cloud Computing and Recent Advances in Parallel and Distributed Systems, ICPADS 2012 Selected Papers, vol.30, pp.59-65, 2014. ,
Score-p and ompt: Navigating the perils of callback-driven parallel runtime introspection, OpenMP: Conquering the Full Hardware Spectrum, pp.21-35, 2019. ,
On the Impact of OpenMP Task Granularity, IWOMP 2018 -14th International Workshop on OpenMP for Evolving Architectures, pp.205-221, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01901806
Sensitivity of performance prediction of message passing programs, The Journal of Supercomputing, vol.17, 2000. ,
Performance analysis and modeling of task-based runtimes, 2016. ,
Parallel simulation of superscalar scheduling, 2014 43rd International Conference on Parallel Processing, pp.121-130, 2014. ,
Modeling, Prediction and Optimization of Energy Consumption of MPI Applications using SimGrid. Theses, 2019. ,
URL : https://hal.archives-ouvertes.fr/tel-02269894
Flexible performance debugging of parallel and distributed applications, Euro-Par 2003. Parallel Processing, 9th International Euro-Par Conference, vol.2790, pp.38-46, 2003. ,
Greencloud: a packet-level simulator of energy-aware cloud computing data centers, The Journal of Supercomputing, 2012. ,
Simnuma: Simulating numa-architecture multiprocessor systems efficiently, 2013 International Conference on Parallel and Distributed Systems, pp.341-348, 2013. ,
Experimental verification and analysis of dynamic loop scheduling in scientific applications, 2018 17th International Symposium on Parallel and Distributed Computing (ISPDC), pp.141-148, 2018. ,
Empirical evaluation of multicore memory concurrency, 2009. ,
Trace-driven simulation of multithreaded applications, IEEE ISPASS) IEEE International Symposium on Performance Analysis of Systems and Software, pp.87-96, 2011. ,
Hlsmn: High level multicore numa simulator. Electrotehnica, Electronica, Automatica, vol.65, issue.3, 2017. ,
Fast and Accurate Simulation of Multithreaded Sparse Linear Algebra Solvers, The 21st IEEE International Conference on Parallel and Distributed Systems, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01180272
Faithful performance prediction of a dynamic task-based runtime system for heterogeneous multi-core architectures, Concurrency and Computation: Practice and Experience, vol.27, issue.16, pp.4075-4090, 2015. ,
URL : https://hal.archives-ouvertes.fr/hal-01147997
Simulation as a tool for optimizing memory accesses on numa machines, Performance Evaluation, vol.60, issue.1-4, pp.31-50, 2005. ,
Evaluation of openmp dependent tasks with the kastors benchmark suite, International Workshop on OpenMP, pp.16-29, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01081974
Evaluation of openmp dependent tasks with the kastors benchmark suite, Using and Improving OpenMP for Devices, Tasks, and More, pp.16-29, 2014. ,
URL : https://hal.archives-ouvertes.fr/hal-01081974
Description, implementation and evaluation of an affinity clause for task directives, International Workshop on OpenMP, pp.61-73, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01343442
Bigsim: A parallel simulator for performance prediction of extremely large parallel machines, 18th International Parallel and Distributed Processing Symposium, vol.78, 2004. ,