L. Buatois and G. Caumon, Concurrent number cruncher: a GPU implementation of a general sparse linear solver, International Journal of Parallel, Emergent and Distributed Systems, vol.49, issue.3, pp.205-223, 2009.
DOI : 10.1016/0010-4485(92)90054-E

URL : https://hal.archives-ouvertes.fr/inria-00331906

R. Helfenstein and J. Koko, Parallel preconditioned conjugate gradient algorithm on GPU, Journal of Computational and Applied Mathematics, vol.236, issue.15, pp.3584-3590, 2012.
DOI : 10.1016/j.cam.2011.04.025

G. A. Gravvanis, C. K. Filelis-papadopoulos, and K. M. Giannoutakis, Solving finite difference linear systems on GPUs: CUDA based Parallel Explicit Preconditioned Biconjugate Conjugate Gradient type Methods, The Journal of Supercomputing, vol.4, issue.1, pp.590-604, 2012.
DOI : 10.1007/s11227-011-0619-z

V. Galiano, H. Migallón, and V. Migallón, GPU-based parallel algorithms for sparse nonlinear systems, Journal of Parallel and Distributed Computing, vol.72, issue.9, pp.1098-1105, 2012.
DOI : 10.1016/j.jpdc.2011.10.016

N. Bell and M. Garland, Efficient sparse matrix-vector multiplication on CUDA, 2008.