Is the Schedule Clause Really Necessary in OpenMP?, Proceedings of the OpenMP applications and tools 2003 international conference on OpenMP shared memory parallel programming, WOMPAT'03, pp.147-159, 2003. ,
DOI : 10.1007/3-540-45009-2_12
Structuring the execution of OpenMP applications for multicore architectures, 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS), 2010. ,
DOI : 10.1109/IPDPS.2010.5470442
URL : https://hal.archives-ouvertes.fr/inria-00441472
libKOMP, an Efficient OpenMP Runtime System for Both Fork-Join and Data Flow Paradigms, Proceedings of the 8th international conference on OpenMP in a Heterogeneous World, 2012. ,
DOI : 10.1007/978-3-642-30961-8_8
URL : https://hal.archives-ouvertes.fr/hal-00796253
Dynamic Task and Data Placement over NUMA Architectures: An OpenMP Runtime Perspective, International Workshop on OpenMP (IWOMP), 2009. ,
DOI : 10.1007/978-3-540-74466-5_6
URL : https://hal.archives-ouvertes.fr/inria-00367570
Measuring synchronisation and scheduling overheads in openmp, Proceedings of First European Workshop on OpenMP, pp.99-105, 1999. ,
Rodinia: A benchmark suite for heterogeneous computing, 2009 IEEE International Symposium on Workload Characterization (IISWC), pp.44-54, 2009. ,
DOI : 10.1109/IISWC.2009.5306797
A Packed Memory Array to Keep Moving Particles Sorted, 9th Workshop on Virtual Reality Interaction and Physical Simulation, 2012. ,
URL : https://hal.archives-ouvertes.fr/hal-00762593
The implementation of the Cilk-5 multithreaded language, ACM SIGPLAN Notices, vol.33, issue.5, pp.212-223, 1998. ,
DOI : 10.1145/277652.277725
Fluids v2.0, open source, fluid simulator, 2008. ,
Enabling Locality-Aware Computations in OpenMP, Scientific Programming, vol.18, issue.3-4, pp.3-4169, 2010. ,
DOI : 10.1155/2010/185421
A Parallel SPH Implementation on Multi-Core CPUs, Computer Graphics Forum, vol.87, issue.1-2, pp.99-112, 2011. ,
DOI : 10.1111/j.1467-8659.2010.01832.x
Adaptive OpenMP for Large NUMA Nodes, Proceedings of the 8th international conference on OpenMP in a Heterogeneous World, pp.254-257, 2012. ,
DOI : 10.1007/978-3-642-30961-8_20
OpenMP-oriented applications for distributed shared memory architectures, Concurrency and Computation: Practice and Experience, vol.16, issue.4, 2004. ,
DOI : 10.1002/cpe.752
Memory bandwidth and machine balance in current high performance computers, IEEE Computer Society Technical Committee on Computer Architecture (TCCA) Newsletter, pp.19-25, 1995. ,
Characterizing and mitigating work time inflation in task parallel programs, Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, SC '12, pp.1-65, 2012. ,
OpenMP task scheduling strategies for multicore NUMA systems, International Journal of High Performance Computing Applications, vol.26, issue.2, pp.110-124, 2012. ,
DOI : 10.1177/1094342011434065
Affinity scheduling of unbalanced workloads, Proceedings of the 1994 ACM/IEEE conference on Supercomputing, Supercomputing '94, pp.214-226, 1994. ,
A Work Stealing Scheduler for Parallel Loops on Shared Cache Multicores, Proceedings of the 2010 conference on Parallel processing, pp.99-107, 2010. ,
DOI : 10.1007/978-3-540-85451-7_95
Deque-Free Work-Optimal Parallel STL Algorithms, Proceedings of the 14th international Euro-Par conference on Parallel Processing, Euro-Par '08, pp.887-897, 2008. ,
DOI : 10.1007/978-3-540-85451-7_95
Adaptively scheduling parallel loops in distributed sharedmemory systems, IEEE Trans. on Parallel and Distributed Systems, vol.8, issue.1, 1997. ,