N. Nextio, Flexible and manageable I/O expansion and virtualization, 2012.

. Mellanox, Connect-IB Single and Dual QSFP+ Port PCI Express Gen3 x16 Adapter Card User Manual. http://www.mellanox.com/related-docs, Connect-IB_Single_and_Dual_QSFP+_Port_PCI_Express_Gen3_x16_ Adapter_Card_User_Manual.pdf, 2013.

N. Popular and G. , Accelerated Applications Catalog, 2015.

A. Barak, T. Ben-nun, E. Levy, and A. Shiloh, A package for OpenCL based heterogeneous computing on clusters with many GPU devices, 2010 IEEE International Conference On Cluster Computing Workshops and Posters (CLUSTER WORKSHOPS), pp.1-7, 2010.
DOI : 10.1109/CLUSTERWKSP.2010.5613086

S. Barrachina, M. Castillo, F. D. Igual, R. Mayo, E. S. Quintana-ortí et al., Exploiting the capabilities of modern GPUs for dense matrix computations, Concurrency and Computation: Practice and Experience, vol.21, issue.4, pp.2457-2477, 2009.
DOI : 10.1002/cpe.1472

J. Duato, F. D. Igual, R. Mayo, A. J. Peña, E. S. Quintana-ortí et al., An Efficient Implementation of GPU Virtualization in High Performance Clusters, Proceedings of the 2009 International Conference on Parallel Processing Euro-Par'09, pp.385-394, 2010.
DOI : 10.1007/978-3-642-14122-5_44

W. Felter, An updated performance comparison of virtual machines and Linux containers, 2015 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2014.
DOI : 10.1109/ISPASS.2015.7095802

A. Gaikwad and I. M. Toke, GPU based sparse grid technique for solving multidimensional options pricing PDEs, Proceedings of the 2nd Workshop on High Performance Computational Finance, WHPCF '09, pp.1-6, 2009.
DOI : 10.1145/1645413.1645419

G. Giunta, R. Montella, G. Agrillo, and G. Coviello, A GPGPU Transparent Virtualization Component for High Performance Computing Clouds, 2010.
DOI : 10.1007/978-3-642-15277-1_37

V. Gupta, A. Gavrilovska, K. Schwan, H. Kharche, N. Tolia et al., GViM, Proceedings of the 3rd ACM Workshop on System-level Virtualization for High Performance Computing, HPCVirt '09, pp.17-24, 2009.
DOI : 10.1145/1519138.1519141

S. Iserte, A. C. Gimeno, R. Mayo, E. S. Quintana-ortí, F. Silla et al., SLURM Support for Remote GPU Virtualization: Implementation and Performance Study, 2014 IEEE 26th International Symposium on Computer Architecture and High Performance Computing, pp.318-325, 2014.
DOI : 10.1109/SBAC-PAD.2014.49

URL : http://hdl.handle.net/10251/66693

H. Jo, J. Jeong, M. Lee, and D. H. Choi, Exploiting GPUs in Virtual Machine for BioCloud, BioMed Research International, vol.30, issue.2, 2013.
DOI : 10.1016/j.patrec.2009.10.009

URL : http://doi.org/10.1155/2013/939460

P. Kegel, M. Steuwer, and S. Gorlatch, dOpenCL: Towards a Uniform Programming Approach for Distributed Heterogeneous Multi-/Many-Core Systems, 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum, pp.174-186, 2012.
DOI : 10.1109/IPDPSW.2012.16

J. Kim, S. Seo, J. Lee, J. Nah, G. Jo et al., SnuCL, Proceedings of the 26th ACM international conference on Supercomputing, ICS '12, pp.341-352, 2012.
DOI : 10.1145/2304576.2304623

V. Krishnan, Towards an integrated IO and clustering solution using PCI express, 2007 IEEE International Conference on Cluster Computing, pp.259-266, 2007.
DOI : 10.1109/CLUSTR.2007.4629239

S. N. Laboratories, LAMMPS Molecular Dynamics Simulator, 2013.

T. Y. Liang and Y. W. Chang, GridCuda: A Grid-Enabled CUDA Programming Toolkit, 2011 IEEE Workshops of International Conference on Advanced Information Networking and Applications, pp.141-146, 2011.
DOI : 10.1109/WAINA.2011.82

Y. Liu, A. Wirawan, and B. Schmidt, CUDASW++ 3.0: accelerating Smith-Waterman protein database search by coupling CPU and GPU SIMD instructions, BMC Bioinformatics, vol.14, issue.1, pp.1-10, 2013.
DOI : 10.1186/1471-2105-14-117

A. M. Merritt, V. Gupta, A. Verma, A. Gavrilovska, and K. Schwan, Shadowfax, Proceedings of the 5th international workshop on Virtualization technologies in distributed computing, VTDC '11, pp.3-10, 2011.
DOI : 10.1145/1996121.1996124

M. Oikawa, A. Kawai, K. Nomura, K. Yasuoka, K. Yoshikawa et al., DS-CUDA: A Middleware to Use Many GPUs in the Cloud Environment, 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, pp.1207-1214, 2012.
DOI : 10.1109/SC.Companion.2012.146

A. J. Peña, C. Reaño, F. Silla, R. Mayo, E. S. Quintana-ortí et al., A complete and efficient CUDA-sharing solution for HPC clusters, Parallel Computing, vol.40, issue.10, pp.574-588, 2014.
DOI : 10.1016/j.parco.2014.09.011

D. P. Playne and K. A. Hawick, Data Parallel Three-Dimensional Cahn-Hilliard Field Equation Simulation on GPUs with CUDA, pp.104-110, 2009.

L. Shi, H. Chen, and J. Sun, vCUDA: GPU-Accelerated High-Performance Computing in Virtual Machines, Parallel and Distributed Processing IPDPS 2009. IEEE International Symposium on, pp.1-11, 2009.
DOI : 10.1109/TC.2011.112

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.302.3933

P. D. Vouzis and N. V. Sahinidis, GPU-BLAST: using graphics processors to accelerate protein sequence alignment, Bioinformatics, vol.27, issue.2, 2010.
DOI : 10.1093/bioinformatics/btq644

URL : http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3018811

J. P. Walters, A. J. Younge, D. I. Kang, K. T. Yao, M. Kang et al., GPU Passthrough Performance: A Comparison of KVM, Xen, VMWare ESXi, and LXC for CUDA and OpenCL Applications, 2014 IEEE 7th International Conference on Cloud Computing, p.7, 2014.
DOI : 10.1109/CLOUD.2014.90

H. Wu, G. Diamos, T. Sheard, M. Aref, S. Baxter et al., Red Fox, Proceedings of Annual IEEE/ACM International Symposium on Code Generation and Optimization, CGO '14, pp.44-44, 2014.
DOI : 10.1145/2581122.2544166

S. Xiao, P. Balaji, Q. Zhu, R. Thakur, S. Coghlan et al., VOCL: An optimized environment for transparent virtualization of graphics processing units, 2012 Innovative Parallel Computing (InPar), 2012.
DOI : 10.1109/InPar.2012.6339609

C. T. Yang, H. Y. Wang, W. S. Ou, Y. T. Liu, and C. H. Hsu, On implementation of GPU virtualization using PCI pass-through, 4th IEEE International Conference on Cloud Computing Technology and Science Proceedings, pp.711-716, 2012.
DOI : 10.1109/CloudCom.2012.6427531