S. Ryoo, C. I. Rodrigues, S. S. Stone, J. A. Stratton, S. Ueng et al., Program optimization carving for GPU computing, Journal of Parallel and Distributed Computing, vol.68, issue.10, pp.1389-1401, 2008.
DOI : 10.1016/j.jpdc.2008.05.011

S. Che, M. Boyer, J. Meng, D. Tarjan, J. W. Sheaffer et al., A performance study of general-purpose applications on graphics processors using CUDA, Journal of Parallel and Distributed Computing, vol.68, issue.10, pp.1370-1380, 2008.
DOI : 10.1016/j.jpdc.2008.05.014

J. Nickolls, I. Buck, M. Garland, and K. Skadron, Scalable parallel programming with CUDA, Queue, vol.6, issue.2, pp.40-53, 2008.
DOI : 10.1145/1365490.1365500

C. Tenllado, J. Setoain, M. Prieto, L. Piuel, and F. Tirado, Parallel Implementation of the 2D Discrete Wavelet Transform on Graphics Processing Units: Filter Bank versus Lifting, IEEE Transactions on Parallel and Distributed Systems, vol.19, issue.3, pp.299-310, 2008.
DOI : 10.1109/TPDS.2007.70716

J. Li, X. Wang, R. He, and Z. Chi, An efficient finegrained parallel genetic algorithm based on gpu-accelerated, Network and Parallel Computing Workshops NPC Workshops. IFIP International Conference, pp.855-862, 2007.

D. M. Chitty, A data parallel approach to genetic programming using programmable graphics hardware, Proceedings of the 9th annual conference on Genetic and evolutionary computation , GECCO '07, pp.1566-1573, 2007.
DOI : 10.1145/1276958.1277274

T. Wong, M. L. Wong, and ´. D. Taillard, Parallel evolutionary algorithms on consumer-level graphics processing unit Robust taboo search for the quadratic assignment problem, Parallel Evolutionary Computations, pp.133-155, 1991.

M. Dorigo and L. M. Gambardella, Ant colony system: a cooperative learning approach to the traveling salesman problem, IEEE Transactions on Evolutionary Computation, vol.1, issue.1, pp.53-66, 1997.
DOI : 10.1109/4235.585892

E. Lutton and J. L. Véhel, Holder functions and deception of genetic algorithms, IEEE Transactions on Evolutionary Computation, vol.2, issue.2, pp.56-71, 1998.
DOI : 10.1109/4235.728208

URL : https://hal.archives-ouvertes.fr/inria-00592413

E. Talbi, Metaheuristics: From design to implementation, 2009.
DOI : 10.1002/9780470496916

URL : https://hal.archives-ouvertes.fr/hal-00750681

J. Chakrapani and J. Skorin-kapov, Massively parallel tabu search for the quadratic assignment problem, Annals of Operations Research, vol.17, issue.4, pp.327-341, 1993.
DOI : 10.1007/BF02022999

T. Crainic, M. Toulouse, and M. Gendreau, Parallel asynchronous tabu search for multicommodity location-allocation with balancing requirements, Annals of Operations Research, vol.6, issue.2, pp.277-299, 1995.
DOI : 10.1007/BF02125458

B. Garcia, J. Potvin, and J. Rousseau, A parallel implementation of the Tabu search heuristic for vehicle routing problems with time window constraints, Computers & Operations Research, vol.21, issue.9, pp.1025-1033, 1994.
DOI : 10.1016/0305-0548(94)90073-6

J. P. Reuther, M. D. Robertson, B. Theys, D. A. Yao, R. F. Hensgen et al., A comparison of eleven static heuristics for mapping a class of independent tasks onto heterogeneous distributed computing systems, J. Parallel Distrib. Comput, vol.61, issue.6, pp.810-837, 2001.

T. James, C. Rego, and F. Glover, A cooperative parallel tabu search algorithm for the quadratic assignment problem, European Journal of Operational Research, vol.195, issue.3, pp.810-826, 2009.
DOI : 10.1016/j.ejor.2007.06.061

A. Bevilacqua, A Methodological Approach to Parallel Simulated Annealing on an SMP System, Journal of Parallel and Distributed Computing, vol.62, issue.10, pp.1548-1570, 2002.
DOI : 10.1016/S0743-7315(02)91863-0

A. Tantar, N. Melab, and E. Talbi, A Comparative Study of Parallel Metaheuristics for Protein Structure Prediction on the Computational Grid, 2007 IEEE International Parallel and Distributed Processing Symposium, pp.1-10, 2007.
DOI : 10.1109/IPDPS.2007.370439

URL : https://hal.archives-ouvertes.fr/hal-00683904

J. Nickolls and W. J. Dally, The GPU Computing Era, IEEE Micro, vol.30, issue.2, pp.56-69, 2010.
DOI : 10.1109/MM.2010.41

J. W. Choi, A. Singh, and R. W. Vuduc, Model-driven autotuning of sparse matrix-vector multiply on GPUs, ACM SIGPLAN Notices, vol.45, issue.5, pp.115-126, 2010.
DOI : 10.1145/1837853.1693471

A. Nukada and S. Matsuoka, Auto-tuning 3-d fft library for cuda gpus Storage and Analysis, ser. SC '09, Proceedings of the Conference on High Performance Computing Networking, pp.301-3010, 2009.

R. Chelouah and P. Siarry, Tabu Search applied to global optimization, European Journal of Operational Research, vol.123, issue.2, pp.256-270, 2000.
DOI : 10.1016/S0377-2217(99)00255-6

G. Jost, H. Jin, D. A. Mey, and F. F. Hatay, Comparing the openmp, mpi, and hybrid programming paradigm on an smp cluster, 2003.

N. Melab, S. Cahon, and E. Talbi, Grid computing for parallel bioinspired algorithms, Journal of Parallel and Distributed Computing, vol.66, issue.8, pp.1052-1061, 2006.
DOI : 10.1016/j.jpdc.2005.11.006

URL : https://hal.archives-ouvertes.fr/hal-00684951

K. Group, OpenCL 1.1 Quick Reference Card, 2011.

S. Cahon, N. Melab, and E. Talbi, ParadisEO: A Framework for the Reusable Design of Parallel and Distributed Metaheuristics, Journal of Heuristics, vol.10, issue.3, pp.357-380, 2004.
DOI : 10.1023/B:HEUR.0000026900.92269.ec