J. Reinders, Intel threading building blocks, 2007.

R. D. Blumofe, C. F. Joerg, B. C. Kuszmaul, C. E. Leiserson, K. H. Randall et al., Cilk: an efficient multithreaded runtime system, Proceedings of the fifth ACM SIGPLAN symposium on Principles and practice of parallel programming , ser. PPOPP '95, pp.207-216, 1995.

Y. Saad, Iterative Methods for Sparse Linear Systems, 1996.
DOI : 10.1137/1.9780898718003

A. Podobas, M. Brorsson, and K. Faxén, A comparison of some recent task-based parallel programming models, 3rd Workshop on Programmability Issues for Multi-Core Computers, 2010.

E. Ayguade, N. Copty, A. Duran, J. Hoeflinger, Y. Lin et al., The Design of OpenMP Tasks, IEEE Transactions on Parallel and Distributed Systems, vol.20, issue.3, pp.404-418, 2009.
DOI : 10.1109/TPDS.2008.105

J. Bueno, L. Martinell, A. Duran, M. Farreras, X. Martorell et al., Productive Cluster Programming with OmpSs, Proceedings of the 17th international conference on Parallel processing -Volume Part I, ser. Euro-Par '11, pp.555-566, 2011.
DOI : 10.1147/rd.515.0593

C. Augonnet, S. Thibault, R. Namyst, and P. Wacrenier, StarPU: A unified platform for task scheduling on heterogeneous multicore architectures Concurrency and Computation: Practice and Experience, Special Issue: Euro-Par, pp.187-198, 2009.

T. Gautier, F. Lementec, V. Faucher, and B. Raffin, X-kaapi: A Multi Paradigm Runtime for Multicore Architectures, 2013 42nd International Conference on Parallel Processing, 2012.
DOI : 10.1109/ICPP.2013.86

URL : https://hal.archives-ouvertes.fr/hal-00727827

H. Vandierendonck, G. Tzenakis, and D. S. Nikolopoulos, A Unified Scheduler for Recursive and Task Dataflow Parallelism, 2011 International Conference on Parallel Architectures and Compilation Techniques, pp.1-11, 2011.
DOI : 10.1109/PACT.2011.7

J. L. Sobral and A. J. Proença, Dynamic grain-size adaptation on object oriented parallel programming. The SCOOPP approach, Proceedings 13th International Parallel Processing Symposium and 10th Symposium on Parallel and Distributed Processing. IPPS/SPDP 1999, pp.728-732, 1999.
DOI : 10.1109/IPPS.1999.760556

A. A. Khan, C. L. Mccreary, and M. S. Jones, A Comparison of Multiprocessor Scheduling Heuristics, 1994 International Conference on Parallel Processing (ICPP'94), pp.243-250, 1994.
DOI : 10.1109/ICPP.1994.19

H. Topcuoglu, S. Hariri, and M. Wu, Task scheduling algorithms for heterogeneous processors, Proceedings. Eighth Heterogeneous Computing Workshop (HCW'99), p.3, 1999.
DOI : 10.1109/HCW.1999.765092

Y. Ge and D. Y. Yun, A method that determines optimal grain size and inherent parallelism concurrently, International Symposium on Parallel Architectures, Algorithms and Networks, ser. ISPAN '96, pp.200-206, 1996.

B. Cirou and E. Jeannot, Triplet: A clustering scheduling algorithm for heterogeneous systems, Proceedings International Conference on Parallel Processing Workshops, pp.231-236, 2001.
DOI : 10.1109/ICPPW.2001.951956

URL : https://hal.archives-ouvertes.fr/inria-00100488

H. Mandviwala, U. Ramachandran, and K. Knobe, Capsules: Expressing Composable Computations in a Parallel Programming Model, Languages and Compilers for Parallel Computing, pp.276-291, 2008.
DOI : 10.1007/978-3-540-85261-2_19

H. Löf and S. Holmgren, affinity-on-next-touch: increasing the performance of an industrial PDE solver on a cc-NUMA system, Proceedings of the 19th annual international conference on Supercomputing, ser. ICS '05, pp.387-392, 2005.

S. Thibault, R. Namyst, and P. Wacrenier, Building Portable Thread Schedulers for Hierarchical Multiprocessors: The BubbleSched Framework, Euro-Par 2007 Parallel Processing, pp.42-51, 2007.
DOI : 10.1007/978-3-540-74466-5_6

URL : https://hal.archives-ouvertes.fr/inria-00154506