A. Munshi, B. Gaster, T. Mattson, and D. Ginsburg, OpenCL Programming Guide, ser. OpenGL. Pearson Education, 2011.

J. Meng and K. Skadron, A Performance Study for Iterative Stencil Loops on GPUs with Ghost Zone Optimizations, International Journal of Parallel Programming, vol.3, issue.3, pp.115-142, 2011.
DOI : 10.1007/s10766-010-0142-5

F. Irigoin, P. Jouvelot, and R. Triolet, Semantical interprocedural parallelization: an overview of the pips project, ICS, pp.244-251, 1991.
URL : https://hal.archives-ouvertes.fr/hal-00984684

J. Meng and K. Skadron, Performance modeling and automatic ghost zone optimization for iterative stencil loops on GPUs, Proceedings of the 23rd international conference on Conference on Supercomputing, ICS '09, pp.256-265, 2009.
DOI : 10.1145/1542275.1542313

J. Kim, H. Kim, J. H. Lee, and J. Lee, Achieving a single compute device image in opencl for multiple gpus, PPOPP, pp.277-288, 2011.

S. Henry, A. Denis, and D. Barthou, Programmation unifiée multiaccélérateur OpenCL Techniques et Sciences Informatiques, pp.1233-1249, 2012.
DOI : 10.3166/tsi.31.1233-1249

C. Augonnet, S. Thibault, R. Namyst, and P. Wacrenier, StarPU: a unified platform for task scheduling on heterogeneous multicore architectures, Concurrency and Computation: Practice and Experience, vol.23, issue.4, pp.187-198, 2011.
DOI : 10.1002/cpe.1631

URL : https://hal.archives-ouvertes.fr/inria-00384363

I. Gelado, J. H. Kelm, S. Ryoo, S. S. Lumetta, N. Navarro et al., CUBA, Proceedings of the 22nd annual international conference on Supercomputing , ICS '08, pp.299-308, 2008.
DOI : 10.1145/1375527.1375571