A Performance Analysis Framework for Identifying Potential Benefits in GPGPU Applications, 17th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP '12), pp.11-22, 2012. ,
A New Composite CPU/Memory Model for Predicting Efficiency of Multi-core Processing, The 20th IEEE International Symposium on High Performance Computer Architecture (HPCA-2014) workshop, 2014. ,
Accelerating Single Iteration Performance of CUDA-Based 3D ReactionDiffusion Simulations, International Journal of Parallel Programming, vol.42, issue.2, pp.343-363, 2014. ,
Data Structures and Algorithms for Counting Problems on Graphs using GPU, International Journal of Networking and Computing, vol.3, issue.2, pp.264-288, 2013. ,
DOI : 10.15803/ijnc.3.2_264
Composite Prediction Model and Task Distribution on a Cloud of Multi-core Processors, IEEE International Conference on High Performance Computing (HiPC-14) workshop, 2013. ,
A quantitative performance analysis model for GPU architectures, 2011 IEEE 17th International Symposium on High Performance Computer Architecture, pp.382-393, 2011. ,
DOI : 10.1109/HPCA.2011.5749745
GPU Computing Webinar Available from: http://on-demand.gputechconf.com/gtc-express, 2011. ,
CUDA Performance: Maximizing Instruction-Level Parallelism Available from: http://continuum.io/blog/cudapy ilp opt, 2013. ,