Intel threading building blocks, 2007. ,
Cilk: an efficient multithreaded runtime system, Proceedings of the fifth ACM SIGPLAN symposium on Principles and practice of parallel programming , ser. PPOPP '95, pp.207-216, 1995. ,
Iterative Methods for Sparse Linear Systems, 1996. ,
DOI : 10.1137/1.9780898718003
A comparison of some recent task-based parallel programming models, 3rd Workshop on Programmability Issues for Multi-Core Computers, 2010. ,
The Design of OpenMP Tasks, IEEE Transactions on Parallel and Distributed Systems, vol.20, issue.3, pp.404-418, 2009. ,
DOI : 10.1109/TPDS.2008.105
Productive Cluster Programming with OmpSs, Proceedings of the 17th international conference on Parallel processing -Volume Part I, ser. Euro-Par '11, pp.555-566, 2011. ,
DOI : 10.1147/rd.515.0593
StarPU: A unified platform for task scheduling on heterogeneous multicore architectures Concurrency and Computation: Practice and Experience, Special Issue: Euro-Par, pp.187-198, 2009. ,
X-kaapi: A Multi Paradigm Runtime for Multicore Architectures, 2013 42nd International Conference on Parallel Processing, 2012. ,
DOI : 10.1109/ICPP.2013.86
URL : https://hal.archives-ouvertes.fr/hal-00727827
A Unified Scheduler for Recursive and Task Dataflow Parallelism, 2011 International Conference on Parallel Architectures and Compilation Techniques, pp.1-11, 2011. ,
DOI : 10.1109/PACT.2011.7
Dynamic grain-size adaptation on object oriented parallel programming. The SCOOPP approach, Proceedings 13th International Parallel Processing Symposium and 10th Symposium on Parallel and Distributed Processing. IPPS/SPDP 1999, pp.728-732, 1999. ,
DOI : 10.1109/IPPS.1999.760556
A Comparison of Multiprocessor Scheduling Heuristics, 1994 International Conference on Parallel Processing (ICPP'94), pp.243-250, 1994. ,
DOI : 10.1109/ICPP.1994.19
Task scheduling algorithms for heterogeneous processors, Proceedings. Eighth Heterogeneous Computing Workshop (HCW'99), p.3, 1999. ,
DOI : 10.1109/HCW.1999.765092
A method that determines optimal grain size and inherent parallelism concurrently, International Symposium on Parallel Architectures, Algorithms and Networks, ser. ISPAN '96, pp.200-206, 1996. ,
Triplet: A clustering scheduling algorithm for heterogeneous systems, Proceedings International Conference on Parallel Processing Workshops, pp.231-236, 2001. ,
DOI : 10.1109/ICPPW.2001.951956
URL : https://hal.archives-ouvertes.fr/inria-00100488
Capsules: Expressing Composable Computations in a Parallel Programming Model, Languages and Compilers for Parallel Computing, pp.276-291, 2008. ,
DOI : 10.1007/978-3-540-85261-2_19
affinity-on-next-touch: increasing the performance of an industrial PDE solver on a cc-NUMA system, Proceedings of the 19th annual international conference on Supercomputing, ser. ICS '05, pp.387-392, 2005. ,
Building Portable Thread Schedulers for Hierarchical Multiprocessors: The BubbleSched Framework, Euro-Par 2007 Parallel Processing, pp.42-51, 2007. ,
DOI : 10.1007/978-3-540-74466-5_6
URL : https://hal.archives-ouvertes.fr/inria-00154506