Achieving performance under OpenMP on ccNUMA and software distributed shared memory systems, Concurrency : Practice and Experience, pp.713-739, 2002. ,
DOI : 10.1002/cpe.646
« Extending the OpenMP Tasking Model to Allow Dependant Tasks, OpenMP (IWOMP), 2008. ,
« affinity-on-next-touch : increasing the performance of an industrial PDE solver on a cc-NUMA system, 19th ACM International Conference on Supercomputing, pp.387-392, 2005. ,
« Memory Bandwidth and Machine Balance in Current High Performance Computers, IEEE Computer Society Technical Committee on Computer Architecture (TCCA), 1995. ,
User-level dynamic page migration for multiprogrammed shared-memory multiprocessors, Proceedings 2000 International Conference on Parallel Processing, pp.95-103, 2000. ,
DOI : 10.1109/ICPP.2000.876083
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.106.8580
« Geographical Locality and Dynamic Data Migration for OpenMP Implementations of Adaptive PDE Solvers, Second International Workshop on OpenMP (IWOMP), 2006. ,
« Data and Thread Affinity in OpenMP Programs, MAW '08 : Proceedings of the 2008 workshop on Memory access on future processors, pp.377-384, 2008. ,
« An Efficient OpenMP Runtime System for Hierarchical Architectures, OpenMP, pp.148-159, 2007. ,
« Building Portable Thread Schedulers for Hierarchical Multiprocessors : the BubbleSched Framework, European Conference on Parallel Computing (EuroPar), 2007. ,