Flexible and manageable I/O expansion and virtualization, 2012. ,
Connect-IB Single and Dual QSFP+ Port PCI Express Gen3 x16 Adapter Card User Manual. http://www.mellanox.com/related-docs, Connect-IB_Single_and_Dual_QSFP+_Port_PCI_Express_Gen3_x16_ Adapter_Card_User_Manual.pdf, 2013. ,
Accelerated Applications Catalog, 2015. ,
A package for OpenCL based heterogeneous computing on clusters with many GPU devices, 2010 IEEE International Conference On Cluster Computing Workshops and Posters (CLUSTER WORKSHOPS), pp.1-7, 2010. ,
DOI : 10.1109/CLUSTERWKSP.2010.5613086
Exploiting the capabilities of modern GPUs for dense matrix computations, Concurrency and Computation: Practice and Experience, vol.21, issue.4, pp.2457-2477, 2009. ,
DOI : 10.1002/cpe.1472
An Efficient Implementation of GPU Virtualization in High Performance Clusters, Proceedings of the 2009 International Conference on Parallel Processing Euro-Par'09, pp.385-394, 2010. ,
DOI : 10.1007/978-3-642-14122-5_44
An updated performance comparison of virtual machines and Linux containers, 2015 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), 2014. ,
DOI : 10.1109/ISPASS.2015.7095802
GPU based sparse grid technique for solving multidimensional options pricing PDEs, Proceedings of the 2nd Workshop on High Performance Computational Finance, WHPCF '09, pp.1-6, 2009. ,
DOI : 10.1145/1645413.1645419
A GPGPU Transparent Virtualization Component for High Performance Computing Clouds, 2010. ,
DOI : 10.1007/978-3-642-15277-1_37
GViM, Proceedings of the 3rd ACM Workshop on System-level Virtualization for High Performance Computing, HPCVirt '09, pp.17-24, 2009. ,
DOI : 10.1145/1519138.1519141
SLURM Support for Remote GPU Virtualization: Implementation and Performance Study, 2014 IEEE 26th International Symposium on Computer Architecture and High Performance Computing, pp.318-325, 2014. ,
DOI : 10.1109/SBAC-PAD.2014.49
URL : http://hdl.handle.net/10251/66693
Exploiting GPUs in Virtual Machine for BioCloud, BioMed Research International, vol.30, issue.2, 2013. ,
DOI : 10.1016/j.patrec.2009.10.009
URL : http://doi.org/10.1155/2013/939460
dOpenCL: Towards a Uniform Programming Approach for Distributed Heterogeneous Multi-/Many-Core Systems, 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops & PhD Forum, pp.174-186, 2012. ,
DOI : 10.1109/IPDPSW.2012.16
SnuCL, Proceedings of the 26th ACM international conference on Supercomputing, ICS '12, pp.341-352, 2012. ,
DOI : 10.1145/2304576.2304623
Towards an integrated IO and clustering solution using PCI express, 2007 IEEE International Conference on Cluster Computing, pp.259-266, 2007. ,
DOI : 10.1109/CLUSTR.2007.4629239
LAMMPS Molecular Dynamics Simulator, 2013. ,
GridCuda: A Grid-Enabled CUDA Programming Toolkit, 2011 IEEE Workshops of International Conference on Advanced Information Networking and Applications, pp.141-146, 2011. ,
DOI : 10.1109/WAINA.2011.82
CUDASW++ 3.0: accelerating Smith-Waterman protein database search by coupling CPU and GPU SIMD instructions, BMC Bioinformatics, vol.14, issue.1, pp.1-10, 2013. ,
DOI : 10.1186/1471-2105-14-117
Shadowfax, Proceedings of the 5th international workshop on Virtualization technologies in distributed computing, VTDC '11, pp.3-10, 2011. ,
DOI : 10.1145/1996121.1996124
DS-CUDA: A Middleware to Use Many GPUs in the Cloud Environment, 2012 SC Companion: High Performance Computing, Networking Storage and Analysis, pp.1207-1214, 2012. ,
DOI : 10.1109/SC.Companion.2012.146
A complete and efficient CUDA-sharing solution for HPC clusters, Parallel Computing, vol.40, issue.10, pp.574-588, 2014. ,
DOI : 10.1016/j.parco.2014.09.011
Data Parallel Three-Dimensional Cahn-Hilliard Field Equation Simulation on GPUs with CUDA, pp.104-110, 2009. ,
vCUDA: GPU-Accelerated High-Performance Computing in Virtual Machines, Parallel and Distributed Processing IPDPS 2009. IEEE International Symposium on, pp.1-11, 2009. ,
DOI : 10.1109/TC.2011.112
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.302.3933
GPU-BLAST: using graphics processors to accelerate protein sequence alignment, Bioinformatics, vol.27, issue.2, 2010. ,
DOI : 10.1093/bioinformatics/btq644
URL : http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3018811
GPU Passthrough Performance: A Comparison of KVM, Xen, VMWare ESXi, and LXC for CUDA and OpenCL Applications, 2014 IEEE 7th International Conference on Cloud Computing, p.7, 2014. ,
DOI : 10.1109/CLOUD.2014.90
Red Fox, Proceedings of Annual IEEE/ACM International Symposium on Code Generation and Optimization, CGO '14, pp.44-44, 2014. ,
DOI : 10.1145/2581122.2544166
VOCL: An optimized environment for transparent virtualization of graphics processing units, 2012 Innovative Parallel Computing (InPar), 2012. ,
DOI : 10.1109/InPar.2012.6339609
On implementation of GPU virtualization using PCI pass-through, 4th IEEE International Conference on Cloud Computing Technology and Science Proceedings, pp.711-716, 2012. ,
DOI : 10.1109/CloudCom.2012.6427531