J. Ahrens, K. Brislawn, K. Martin, B. Geveci, C. C. Law et al., Large-scale data visualization using parallel data streaming, IEEE Computer Graphics and Applications, vol.21, issue.4, pp.34-41, 2001.
DOI : 10.1109/38.933522

J. Ahrens, C. Law, W. Schroeder, and K. Martin, Kitware Inc, and Michael Papka. A parallel approach for efficiently visualizing extremely large, time-varying datasets, 2000.

J. Broquedis, S. Clet-ortega, N. Moreaud, B. Furmento, G. Goglin et al., hwloc: A Generic Framework for Managing Hardware Affinities in HPC Applications, 2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing, 2010.
DOI : 10.1109/PDP.2010.67
URL : https://hal.archives-ouvertes.fr/inria-00429889

[. Broquedis, T. Gautier, and V. Danjean, libKOMP, an Efficient OpenMP Runtime System for Both Fork-Join and Data Flow Paradigms, Proceedings of the 8th international conference on OpenMP in a Heterogeneous World, pp.102-115, 2012.
DOI : 10.1007/978-3-642-30961-8_8
URL : https://hal.archives-ouvertes.fr/hal-00796253

E. Guy, P. B. Blelloch, Y. Gibbons, and . Matias, Provably efficient scheduling for languages with fine-grained parallelism, Proceedings of the Seventh Annual ACM Symposium on Parallel Algorithms and Architectures, SPAA '95, pp.1-12, 1995.

D. Robert, C. E. Blumofe, and . Leiserson, Scheduling multithreaded computations by work stealing, J. ACM, vol.46, issue.5, pp.720-748, 1999.

E. Guy and . Blelloch, Vector Models for Data-Parallel Computing, 1990.

. H. Inria, B. Childs, W. Geveci, J. Schroeder, K. Meredith et al., Research challenges for visualization software, Computer, vol.46, issue.5, pp.34-42, 2013.

D. Cederman and P. Tsigas, On dynamic load balancing on graphics processors, Proceedings of the 23rd ACM SIGGRAPH/EUROGRAPHICS symposium on Graphics hardware, GH '08, pp.57-64, 2008.

M. Durand, F. Broquedis, T. Gautier, and B. Raffin, An Efficient OpenMP Loop Scheduler for Irregular Applications on Large-Scale NUMA Machines, International Workshop on OpenMP, pp.141-155
DOI : 10.1007/978-3-642-40698-0_11
URL : https://hal.archives-ouvertes.fr/hal-00867438

M. Ettinger, F. Broquedis, T. Gautier, S. Ploix, and B. Raffin, VtkSMP: Task-based Parallel Operators for VTK Filters, Eurographics 2013 Symposium on Parallel Graphics and Visualization (EGPGV'13), 2013.
URL : https://hal.archives-ouvertes.fr/hal-00926457

M. Frigo, C. E. Leiserson, and K. H. Randall, The implementation of the Cilk-5 multithreaded language, ACM SIGPLAN Notices, vol.33, issue.5, pp.212-223, 1998.
DOI : 10.1145/277652.277725

[. Gautier, X. Besseron, and L. Pigeon, KAAPI, Proceedings of the 2007 international workshop on Parallel symbolic computation, PASCO '07, 2007.
DOI : 10.1145/1278177.1278182
URL : https://hal.archives-ouvertes.fr/hal-00647474

[. Hendler, I. Incze, N. Shavit, and M. Tzafrir, Flat combining and the synchronization-parallelism tradeoff, Proceedings of the 22nd ACM symposium on Parallelism in algorithms and architectures, SPAA '10, pp.355-364, 2010.
DOI : 10.1145/1810479.1810540
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.186.939

A. Edward and . Lee, The problem with threads, Computer, vol.39, pp.33-42, 2006.

F. Le-mentec, V. Danjean, and T. Gautier, X-Kaapi C programming interface, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00647474

U. [. Moreland, B. Ayachit, K. Geveci, and . Ma, Dax Toolkit: A proposed framework for data analysis and visualization at Extreme Scale, 2011 IEEE Symposium on Large Data Analysis and Visualization, pp.97-104, 2011.
DOI : 10.1109/LDAV.2011.6092323

J. S. Meredith, S. Ahern, D. Pugmire, and R. Sisneros, Eavl: The extremescale analysis and visualization library, Eurographics Symposium on Parallel Graphics and Visualization (EGPGV) The Eurographics Association, pp.21-30, 2012.

K. Moreland, B. Geveci, K. Ma, and R. Maynard, A classification of scientific visualization algorithms for massive threading, Proceedings of the 8th International Workshop on Ultrascale Visualization, UltraVis '13, pp.1-210, 2013.
DOI : 10.1145/2535571.2535591

Y. Victor, . Pan, P. Franco, and . Preparata, Work-preserving speed-up of parallel matrix computations, SIAM J. Comput, 1995.

. Prg-+-11-]-t, R. Peterka, A. Ross, V. Gyulassy, W. Pascucci et al., Scalable parallel building blocks for custom data analysis, Large Data Analysis and Visualization (LDAV), 2011 IEEE Symposium on, pp.105-112, 2011.

J. Reinders, Intel threading building blocks, 2007.

C. Sewell, J. Meredith, K. Moreland, T. Peterka, D. Demarle et al., The SDAV Software Frameworks for Visualization and Analysis on Next-Generation Multi-Core and Many-Core Architectures, 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, pp.206-214, 2012.
DOI : 10.1109/SC.Companion.2012.36

[. Sewell, L. Ta-lo, and J. Ahrens, Piston: A portable cross-platform framework for data-parallel visualization operators, Eurographics Symposium on Parallel Graphics ans Visualization, 2012.
DOI : 10.2172/1113789

V. Marc-tchiboukdjian, T. Danjean, F. L. Gautier, B. Mentec, and . Raffin, A Work Stealing Algorithm for Parallel Loops on Shared Cache Multicores, 4th Workshop on Highly Parallel Processing on a Chip, 2010.

[. Tchiboukdjian, V. Danjean, and B. Raffin, Cache-efficient parallel isosurface extraction for shared cache multicores, Eurographics Symposium on Parallel Graphics ans Visualization, 2010.
URL : https://hal.archives-ouvertes.fr/hal-00798445

J. Toss and T. Gautier, A New Programming Paradigm for GPGPU, Euro-Par, pp.895-907, 2012.
DOI : 10.1007/978-3-642-32820-6_88
URL : https://hal.archives-ouvertes.fr/hal-00796257

P. Virouleau, P. Brunet, F. Broquedis, N. Furmento, S. Thibault et al., Evaluation of OpenMP Dependent Tasks with the KASTORS Benchmark Suite, Using and Improving OpenMP for Devices, Tasks, and More, pp.16-29, 2014.
DOI : 10.1007/978-3-319-11454-5_2
URL : https://hal.archives-ouvertes.fr/hal-01081974

. Voc-+-12-]-huy, D. Vo, J. Osmari, P. Comba, C. Lindstrom et al., Hyperflow: A heterogeneous dataflow architecture, Eurographics Symposium on Parallel Graphics ans Visualization, 2012.

. Vos-+-10-]-huy, D. Vo, B. Osmari, J. Summa, V. Comba et al., Streaming-enabled parallel dataflow architecture for multicore systems, Eurographics/IEEE-VGTC Symposium on Visualization, pp.1073-1082, 2010.

I. Wald, Fast construction of sah bvhs on the intel many integrated core (mic) architecture . Visualization and Computer Graphics, IEEE Transactions on, vol.18, issue.1, pp.47-57, 2012.

J. M. Wozniak, T. G. Armstrong, M. Wilde, D. S. Katz, E. Lusk et al., Swift/t: Scalable data flow programming for many-task applications, Proceedings of the 18th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPoPP '13, pp.309-310, 2013.

R. N°-8245 and R. Centre-grenoble-?-rhône-alpes, Inovallée 655 avenue de l'Europe Montbonnot 38334 Saint Ismier Cedex Publisher Inria Domaine de Voluceau -Rocquencourt BP 105 -78153 Le Chesnay Cedex inria, pp.249-6399