M. Deveci, K. Kaya, B. Uçar, and U. V. Catalyurek, Fast and high quality topologyaware task mapping, Proceedings of International Parallel and Distributed Processing Symposium (IPDPS), 2015.
URL : https://hal.archives-ouvertes.fr/hal-01159677

J. Mair, Z. Huang, D. Eyers, and Y. Chen, Quantifying the Energy Efficiency Challenges of Achieving Exascale Computing, 2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing, 2015.
DOI : 10.1109/CCGrid.2015.130

E. L. Padoin, V. Martínez, P. O. Navaux, and J. Méhaut, Using Power Demand and Residual Load Imbalance in the Load Balancing to Save Energy of Parallel Systems, Procedia Computer Science, vol.108, 2017.
DOI : 10.1016/j.procs.2017.05.215

URL : https://hal.archives-ouvertes.fr/hal-01516645

C. Mei, Y. Sun, G. Zheng, E. J. Bohm, L. V. Kalé et al., Enabling and scaling biomolecular simulations of 100 million atoms on petascale machines with a multicoreoptimized message-driven runtime, Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis (SC), 2011.

I. Karlin, A. Bhatele, J. Keasler, B. L. Chamberlain, J. Cohen et al., Exploring Traditional and Emerging Parallel Programming Models Using a Proxy Application, 2013 IEEE 27th International Symposium on Parallel and Distributed Processing, 2013.
DOI : 10.1109/IPDPS.2013.115

M. R. Garey and D. S. Johnson, Computers and Intractability: A Guide to the Theory of NP- Completeness, 1979.

F. Trahay and A. Denis, A scalable and generic task scheduling system for communication libraries, 2009 IEEE International Conference on Cluster Computing and Workshops, 2009.
DOI : 10.1109/CLUSTR.2009.5289169

URL : https://hal.archives-ouvertes.fr/inria-00408521

G. Zheng, A. Bhatelé, E. Meneses, and L. V. Kalé, Periodic hierarchical load balancing for large supercomputers, The International Journal of High Performance Computing Applications, vol.16, issue.4, 2011.
DOI : 10.1109/71.243526

M. H. Willebeek-lemair and A. P. Reeves, Strategies for dynamic load balancing on highly parallel computers, IEEE Transactions on Parallel and Distributed Systems, vol.4, issue.9, 1993.
DOI : 10.1109/71.243526

H. Menon and L. Kalé, A distributed dynamic load balancer for iterative applications, Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis on, SC '13, 2013.
DOI : 10.1145/2503210.2503284

J. Paudel, O. Tardieu, and J. N. Amaral, On the Merits of Distributed Work-Stealing on Selective Locality-Aware Tasks, 2013 42nd International Conference on Parallel Processing
DOI : 10.1109/ICPP.2013.19

L. L. Pilla, P. O. Navaux, C. P. Ribeiro, P. Coucheney, F. Broquedis et al., Asymptotically Optimal Load Balancing for Hierarchical Multi-Core Systems, 2012 IEEE 18th International Conference on Parallel and Distributed Systems, 2012.
DOI : 10.1109/ICPADS.2012.41

URL : https://hal.archives-ouvertes.fr/hal-00788008

U. V. Catalyurek, E. G. Boman, K. D. Devine, D. Bozdag, R. T. Heaphy et al., Hypergraph-based Dynamic Load Balancing for Adaptive Scientific Computations, 2007 IEEE International Parallel and Distributed Processing Symposium, 2007.
DOI : 10.1109/IPDPS.2007.370258

E. Jeannot, G. Mercier, and F. Tessier, Topology and Affinity Aware Hierarchical and Distributed Load-Balancing in Charm++, 2016 First International Workshop on Communication Optimizations in HPC (COMHPC), 2016.
DOI : 10.1109/COMHPC.2016.012

URL : https://hal.archives-ouvertes.fr/hal-01394748

J. Benson, T. Estrada, A. L. Rosenberg, M. Taufer, D. Unat et al., Scheduling matters: Area-oriented heuristic for resource managementSBAC-PAD) Trends in data locality abstractions for HPC systems, Proceedings of International Symposium on Computer Architecture and High Performance Computing IEEE Transactions on Parallel and Distributed Systems (TPDS), 2016.

J. Yang and Q. He, Scheduling Parallel Computations by Work Stealing: A Survey, International Journal of Parallel Programming, vol.31, issue.11, 2018.
DOI : 10.3724/SP.J.1016.2008.01975

R. Al-omairy, G. Miranda, H. Ltaief, R. Badia, X. Martorell et al., Dense matrix computations on numa architectures with distance-aware work stealing, Supercomputing Frontiers and Innovations (SuperFRI), vol.2, issue.1, 2015.

V. Janjic and K. Hammond, How to be a Successful Thief, Proceedings of International Conference on Parallel Processing (EuroPar), 2013.
DOI : 10.1007/978-3-642-40047-6_14

T. Beri, S. Bansal, and S. Kumar, ProSteal: A Proactive Work Stealer for Bulk Synchronous Tasks Distributed on a Cluster of Heterogeneous Machines with Multiple Accelerators, 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015.
DOI : 10.1109/IPDPSW.2015.7

P. H. Penna, M. Castro, P. D. Plentz, H. C. Freitas, F. Broquedis et al., BinLPT: A workload-aware parallel loop scheduler for largescale multicore platforms, Proceedings of Brazilian Symposium on High Performance Computing (WSCAD), 2017.
URL : https://hal.archives-ouvertes.fr/tel-02112723

A. Demers, D. Greene, C. Hauser, W. Irish, J. Larson et al., Epidemic algorithms for replicated database maintenance, Proceedings of Symposium on Principles of Distributed Computing (PODC), 1987.

B. Acun, A. Langer, E. Meneses, H. Menon, O. Sarood et al., Power, Reliability, and Performance: One System to Rule them All, Computer, vol.49, issue.10, 2016.
DOI : 10.1109/MC.2016.310

H. Menon, Adaptive load balancing for HPC applications, 2016.

T. Hoefler, E. Jeannot, and G. Mercier, An Overview of Topology Mapping Algorithms and Techniques in High-Performance Computing
DOI : 10.1142/S0129626408003569

I. Z. Reguly, G. R. Mudalige, and M. B. Giles, Loop tiling in large-scale stencil codes at runtime with ops, IEEE Transactions on Parallel and Distributed Systems (TPDS), vol.29, issue.4, 2018.

N. Cheriere and E. Saule, Considerations on Distributed Load Balancing for Fully Heterogeneous Machines: Two Particular Cases, 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, 2015.
DOI : 10.1109/IPDPSW.2015.36