Hitting the memory wall : Implications of the obvious, Computer Architecture News, vol.23, issue.1, pp.20-24, 1995. ,
Chip makers turn to multicore processors, Computer, vol.38, issue.5, pp.11-13, 2005. ,
DOI : 10.1109/MC.2005.160
Introduction to the Cell multiprocessor, IBM Journal of Research and Development, vol.49, issue.4.5, pp.589-604, 2005. ,
DOI : 10.1147/rd.494.0589
Microprocessors in the Era of Terascale Integration, 2007 Design, Automation & Test in Europe Conference & Exhibition, pp.237-242, 2007. ,
DOI : 10.1109/DATE.2007.364597
MPI versus MPI+OpenMP on the IBM SP for the NAS Benchmarks, ACM/IEEE SC 2000 Conference (SC'00), p.12, 2000. ,
DOI : 10.1109/SC.2000.10001
The implementation of the cilk- 5 multithreaded language, PLDI '98 : Proceedings of the ACM SIGPLAN 1998 conference on Programming language design and implementation, pp.212-223, 1998. ,
A Survey of General-Purpose Computation on Graphics Hardware, Computer Graphics Forum, vol.7, issue.4, pp.80-113, 2007. ,
DOI : 10.1016/j.rti.2005.04.002
Cg : a system for programming graphics hardware in a c-like language, SIGGRAPH '03 : ACM SIG- GRAPH 2003 Papers, pp.896-907, 2003. ,
OpenGL(R) Shading Language, 2005. ,
Ashli -advanced shading langage interface, 2003. ,
Shader algebra, SIGGRAPH '04 : ACM SIGGRAPH 2004 Papers, pp.787-795, 2004. ,
Brook for gpus : stream computing on graphics hardware, SIG- GRAPH '04 : ACM SIGGRAPH 2004 Papers, pp.777-786, 2004. ,
Scout: a data-parallel programming language for graphics processors, Parallel Computing, vol.33, issue.10-11, pp.10-11648, 2007. ,
DOI : 10.1016/j.parco.2007.09.001
Glift, ACM Transactions on Graphics, vol.25, issue.1, pp.60-99, 2006. ,
DOI : 10.1145/1122501.1122505
Lattice Boltzmann simulation optimization on leading multicore platforms, 2008 IEEE International Symposium on Parallel and Distributed Processing, 2008. ,
DOI : 10.1109/IPDPS.2008.4536295
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.139.2646
Accelerating computing with the cell broadband engine processor, Proceedings of the 2008 conference on Computing frontiers , CF '08, pp.3-12, 2008. ,
DOI : 10.1145/1366230.1366234
Charm++, Offload API, and the Cell Processor, Proceedings of the Workshop on Programming Models for Ubiquitous Parallelism, 2006. ,
Multicore framework : An api for programming heterogeneous multicore processors, Proc. of First Workshop on Software Tools for Multi-Core Systems, 2006. ,
Design and implementation of stream processing system and library for cell broadband engine processors, Proceeding (590) Parallel and Distributed Computing and Systems, 2007. ,
CellSs: a Programming Model for the Cell BE Architecture, ACM/IEEE SC 2006 Conference (SC'06), p.86, 2006. ,
DOI : 10.1109/SC.2006.17
Data-parallel programming on the cell be and the gpu using the rapidmind development platform, 2006. ,
Merge : a programming model for heterogeneous multi-core systems, ASPLOS XIII, 2008. ,
A library-based compiler to execute matlab programs on a heterogeneous platform ,
Sequoia: Programming the Memory Hierarchy, ACM/IEEE SC 2006 Conference (SC'06), 2006. ,
DOI : 10.1109/SC.2006.55
R-stream : A parametric high level compiler ,
Hmpp : A hybrid multi-core parallel programming environment ,
MCUDA: An Efficient Implementation of CUDA Kernels for Multi-core CPUs, 2008. ,
DOI : 10.1007/978-3-540-89740-8_2
Receiver-initiated message passing over RDMA Networks, 2008 IEEE International Symposium on Parallel and Distributed Processing, 2008. ,
DOI : 10.1109/IPDPS.2008.4536262
Mpi microtask for programming the cell broadband enginetm processor, IBM Syst. J, vol.45, issue.1, 2006. ,
Automated empirical optimizations of software and the ATLAS project, Parallel Computing, vol.27, issue.12, pp.3-35, 2001. ,
Impact of NUMA Effects on High-Speed Networking with Multi-Opteron Machines, The 19th IASTED International Conference on Parallel and Distributed Computing and Systems, 2007. ,
URL : https://hal.archives-ouvertes.fr/inria-00175747
Impact des architectures multiprocesseurs sur les communications dans les grappes de calcul : de l'exploration des effets numa au placement automatique, 2007. ,
URL : https://hal.archives-ouvertes.fr/inria-00177495
Building Portable Thread Schedulers for Hierarchical Multiprocessors: The BubbleSched Framework, EuroPar, 2007. ,
DOI : 10.1007/978-3-540-74466-5_6
URL : https://hal.archives-ouvertes.fr/inria-00154506
Newmadeleine : a fast communication scheduling engine for high performance networks, CAC 2007 : Workshop on Communication Architecture for Clusters, held in conjunction with IPDPS 2007, 2007. ,
URL : https://hal.archives-ouvertes.fr/inria-00122723
Analysis of double buffering on two different multicore architectures: Quad-core Opteron and the Cell-BE, 2008 IEEE International Symposium on Parallel and Distributed Processing, 2008. ,
DOI : 10.1109/IPDPS.2008.4536316
Scheduling multithreaded computations by work stealing, J. ACM, vol.46, issue.5, pp.720-748, 1999. ,
An Efficient Multi-level Trace Toolkit for Multi-threaded Applications, EuroPar, 2005. ,
DOI : 10.1007/11549468_21
URL : https://hal.archives-ouvertes.fr/hal-00360309
InitiationàInitiation`Initiationà la méthode deséléméntsdes´deséléménts finis, 2002. ,
Analyse numérique et optimisation. ´ Editions de l' ´ Ecole Polytechnique, 2005. ,
Finite element solution of the poisson equation with dirichlet boundary conditions in a rectangular domain ,