S. Benkner and T. Brandes, Efficient parallel programming on scalable shared memory systems with High Performance Fortran, Concurrency: Practice and Experience, pp.789-803, 2002.
DOI : 10.1002/cpe.649

F. Broquedis, F. Diakhaté, S. Thibault, O. Aumage, R. Namyst et al., Scheduling Dynamic OpenMP Applications over Multicore Architectures, International Workshop on OpenMP (IWOMP), 2008.
DOI : 10.1007/978-3-540-79561-2_15

URL : https://hal.archives-ouvertes.fr/inria-00329934

W. Carlson, J. Draper, D. Culler, K. Yelick, E. Brooks et al., Introduction to UPC and Language Specification, 1999.

B. M. Chapman and F. Bregier, Amit Patil, and Achal Prabhakar. Achieving performance under OpenMP on ccNUMA and software distributed shared memory systems

R. Dolbeau, S. Bihan, and F. Bodin, HMPP TM : A Hybrid Multi-core Parallel Programming Environment, 2007.

A. Duran, J. M. Perez, E. Ayguade, R. Badia, and J. Labarta, Extending the OpenMP Tasking Model to Allow Dependant Tasks, International Workshop on OpenMP (IWOMP), 2008.

M. Frigo, C. E. Leiserson, and K. H. Randall, The Implementation of the Cilk-5

B. Goglin and N. Furmento, Enabling high-performance memory migration for multithreaded applications on LINUX, 2009 IEEE International Symposium on Parallel & Distributed Processing, 2009.
DOI : 10.1109/IPDPS.2009.5161101

URL : https://hal.archives-ouvertes.fr/inria-00358172

. Intel, Thread Building Blocks

C. Koelbel, D. Loveman, R. Schreiber, G. Steele, and M. Zosel, The High Performance Fortran Handbook, Computers in Physics, vol.8, issue.4, 1994.
DOI : 10.1063/1.4823319

H. Löf and S. Holmgren, affinity-on-next-touch: increasing the performance of an industrial PDE solver on a cc-NUMA system, 19th ACM International Conference on Supercomputing, pp.387-392, 2005.

J. D. Mccalpin, Memory bandwidth and machine balance in current high performance computers, IEEE Computer Society Technical Committee on Computer Architecture (TCCA) Newsletter, pp.19-25, 1995.

D. S. Nikolopoulos, T. S. Papatheodorou, C. D. Polychronopoulos, J. Labarta, and E. Ayguadé, User-level dynamic page migration for multiprogrammed shared-memory multiprocessors, Proceedings 2000 International Conference on Parallel Processing, pp.95-103, 2000.
DOI : 10.1109/ICPP.2000.876083

D. S. Nikolopoulos, C. D. Polychronopoulos, T. S. Papatheodorou, J. Labarta, and E. Ayguadé, Scheduler-Activated Dynamic Page Migration for Multiprogrammed DSM Multiprocessors, Journal of Parallel and Distributed Computing, vol.62, issue.6, pp.1069-1103, 2002.
DOI : 10.1006/jpdc.2001.1817

M. Nordén, H. Löf, J. Rantakokko, and S. Holmgren, Geographical Locality and Dynamic Data Migration for OpenMP Implementations of Adaptive PDE Solvers, Second International Workshop on OpenMP (IWOMP), 2006.
DOI : 10.1007/978-3-540-68555-5_31

C. Terboven, D. Dieter-an-mey, H. Schmidl, T. Jin, and . Reichstein, Data and thread affinity in openmp programs, Proceedings of the 2008 workshop on Memory access on future processors a solved problem?, MAW '08, pp.377-384, 2008.
DOI : 10.1145/1366219.1366222

S. Thibault, F. Broquedis, B. Goglin, R. Namyst, and P. Wacrenier, An Efficient OpenMP Runtime System for Hierarchical Architectures, International Workshop on OpenMP (IWOMP), pp.148-159, 2007.
DOI : 10.1007/978-3-540-69303-1_19

URL : https://hal.archives-ouvertes.fr/inria-00154502

S. Thibault, R. Namyst, and P. Wacrenier, Building Portable Thread Schedulers for Hierarchical Multiprocessors: The BubbleSched Framework, European Conference on Parallel Computing (EuroPar), 2007.
DOI : 10.1007/978-3-540-74466-5_6

URL : https://hal.archives-ouvertes.fr/inria-00154506