N. S. Arora, R. D. Blumofe, and C. G. Plaxton, Thread Scheduling for Multiprogrammed Multiprocessors, Theory of Computing Systems, vol.34, issue.2, pp.115-144, 2001.
DOI : 10.1007/s00224-001-0004-z

G. E. Blelloch and P. B. Gibbons, Effectively sharing a cache among threads, Proceedings of the sixteenth annual ACM symposium on Parallelism in algorithms and architectures , SPAA '04, 2004.
DOI : 10.1145/1007912.1007948

R. D. Blumofe, C. F. Joerg, B. C. Kuszmaul, C. E. Leiserson, K. H. Randall et al., Cilk: An Efficient Multithreaded Runtime System, Journal of Parallel and Distributed Computing, vol.37, issue.1, pp.55-69, 1996.
DOI : 10.1006/jpdc.1996.0107

T. Gautier, X. Besseron, and L. Pigeon, KAAPI, Proceedings of the 2007 international workshop on Parallel symbolic computation, PASCO '07, pp.15-23, 2007.
DOI : 10.1145/1278177.1278182

URL : https://hal.archives-ouvertes.fr/hal-00647474

A. Hassidim, Cache replacement policies for multicore processors, ICS, 2010.

A. Jaleel, M. Mattina, and B. Jacob, Last Level Cache (LLC) Performance of Data Mining Workloads On a CMP - A Case Study of Parallel Bioinformatics Workloads, The Twelfth International Symposium on High-Performance Computer Architecture, 2006.
DOI : 10.1109/HPCA.2006.1598115

A. Robison, M. Voss, and A. Kukanov, Optimization via Reflection on Work Stealing in TBB, 2008 IEEE International Symposium on Parallel and Distributed Processing, 2008.
DOI : 10.1109/IPDPS.2008.4536188

W. Schroeder, K. Martin, and B. Lorensen, The Visualization Toolkit, An Object-Oriented Approach To 3D Graphics, 2004.

M. Tchiboukdjian, V. Danjean, and B. Raffin, Binary Mesh Partitioning for Cache-Efficient Visualization, IEEE Transactions on Visualization and Computer Graphics, vol.16, issue.5, 2010.
DOI : 10.1109/TVCG.2010.19

URL : https://hal.archives-ouvertes.fr/hal-00685930

M. Tchiboukdjian, D. Trystram, J. Roch, and J. Bernard, List scheduling: The price of distribution, 2010.
URL : https://hal.archives-ouvertes.fr/inria-00458133

D. Traoré, J. Roch, N. Maillard, T. Gautier, and J. Bernard, Deque-Free Work-Optimal Parallel STL Algorithms, Euro-Par, 2008.
DOI : 10.1007/978-3-540-85451-7_95

E. Z. Zhang, Y. Jiang, and X. Shen, Does cache sharing on modern cmp matter to the performance of contemporary multithreaded programs, PPoPP, 2010.

H. Zhang, T. S. Newman, and X. Zhang, Case study of multithreaded in-core isosurface extraction algorithms, EGPGV, 2004.