A. Corporation, Implementing FPGA Design with the OpenCL Standard, 2011.

O. Chafik, ScalaCL, 2011.

S. Craven and P. Athanas, Examining the viability of FPGA supercomputing, EURASIP Journal on Embedded systems, vol.8, issue.1, 2007.

P. Jääskeläinen, C. De-la-lama, P. Huerta, and J. Takala, OpenCL-based design methodology for application-specific processors, 2010 International Conference on Embedded Computer Systems: Architectures, Modeling and Simulation, pp.223-230, 2010.
DOI : 10.1109/ICSAMOS.2010.5642061

K. Opencl and W. Group, The OpenCL Specification 1.2 " , Specification 1.2, Khronos Group, 2011.

A. Klöckner, N. Pinto, Y. Lee, B. Catanzaro, P. Ivanov et al., PyCUDA and PyOpenCL: A scripting-based approach to GPU run-time code generation, Parallel Computing, vol.38, issue.3, pp.157-174, 2012.
DOI : 10.1016/j.parco.2011.09.001

C. Lattner and V. Adve, LLVM: A compilation framework for lifelong program analysis & transformation, International Symposium on Code Generation and Optimization, 2004. CGO 2004., pp.75-86, 2004.
DOI : 10.1109/CGO.2004.1281665

M. Owaida, N. Bellas, K. Daloukas, and C. Antonopoulos, Synthesis of Platform Architectures from OpenCL Programs, 2011 IEEE 19th Annual International Symposium on Field-Programmable Custom Computing Machines, pp.186-193, 2011.
DOI : 10.1109/FCCM.2011.19

A. Papakonstantinou, K. Gururaj, J. Stratton, D. Chen, J. Cong et al., FCUDA: Enabling efficient compilation of CUDA kernels onto FPGAs, 2009 IEEE 7th Symposium on Application Specific Processors, pp.35-42, 2009.
DOI : 10.1109/SASP.2009.5226333

J. Passerat-palmbach, J. Caux, P. Siregar, and D. Hill, Warp-Level Parallelism: Enabling Multiple Replications In Parallel on GPU, Proceedings of the European Simulation and Modeling Conference 2011, pp.76-83, 2011.
URL : https://hal.archives-ouvertes.fr/hal-01083163

M. Scarpino, OpenCL in Action, 2011.

J. Stratton, S. Stone, and W. Hwu, MCUDA: An Efficient Implementation of CUDA Kernels for Multi-core CPUs, Languages and Compilers for Parallel Computing, pp.16-30, 2008.
DOI : 10.1007/978-3-540-89740-8_2

J. Williams, A. George, J. Richardson, K. Gosrani, and S. Suresh, Computational density of fixed and reconfigurable multi-core devices for application acceleration, 2008.