Q. Li, C. Zhong, K. Zhao, X. Mei, and X. Chu, Implementation and Analysis of AES Encryption on GPU, 2012 IEEE 14th International Conference on High Performance Computing and Communication & 2012 IEEE 9th International Conference on Embedded Software and Systems, pp.843-848, 2012.
DOI : 10.1109/HPCC.2012.119

X. Chu and K. Zhao, Practical random linear network coding on GPUs, GPU Solutions to Multi-scale Problems in Science and Engineering, pp.115-130, 2013.
DOI : 10.1007/978-3-642-01399-7_45

Y. Li, K. Zhao, X. Chu, and J. Liu, Speeding up k-Means algorithm by GPUs, Journal of Computer and System Sciences, vol.79, issue.2, pp.216-229, 2013.
DOI : 10.1016/j.jcss.2012.05.004

P. Micikevicius, 3D finite difference computation on GPUs using CUDA, Proceedings of 2nd Workshop on General Purpose Processing on Graphics Processing Units, GPGPU-2, pp.79-84, 2009.
DOI : 10.1145/1513895.1513905

K. Zhao and X. Chu, G-BLASTN: accelerating nucleotide alignment by graphics processors, Bioinformatics, vol.30, issue.10, 2014.
DOI : 10.1093/bioinformatics/btu047

X. Mei, L. S. Yung, K. Zhao, and X. Chu, A measurement study of GPU DVFS on energy conservation, Proceedings of the Workshop on Power-Aware Computing and Systems. Number, 2013.

V. Volkov and J. W. Demmel, Benchmarking GPUs to tune dense linear algebra, 2008 SC, International Conference for High Performance Computing, Networking, Storage and Analysis, 2008.
DOI : 10.1109/SC.2008.5214359

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.218.3436

M. Papadopoulou, M. Sadooghi-alvandi, and H. Wong, Micro-benchmarking the GT200 GPU, 2009.

H. Wong, M. M. Papadopoulou, M. Sadooghi-alvandi, and A. Moshovos, Demystifying GPU microarchitecture through microbenchmarking, 2010 IEEE International Symposium on Performance Analysis of Systems & Software (ISPASS), pp.235-246, 2010.
DOI : 10.1109/ISPASS.2010.5452013

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.189.5309

P. Micikevicius, Local Memory and Register Spilling, NVIDIA Corporation, 2011.

P. Micikevicius, GPU performance analysis and optimization, In: GPU Technology Conference, 2012.

R. H. Saavedra, CPU Performance Evaluation and Execution Time Prediction Using Narrow Spectrum Benchmarking, 1992.

R. H. Saavedra and A. J. Smith, Measuring cache and TLB performance and their effect on benchmark runtimes. Computers, IEEE Transactions on, vol.44, pp.1223-1235, 1995.
DOI : 10.1109/12.467697