R. Bisseling, Parallel scientific computation: a structured approach using BSP and MPI, 2004.
DOI : 10.1093/acprof:oso/9780198529392.001.0001

O. Bonorden, B. Juurlink, I. Von-otte, and I. Rieping, The Paderborn University BSP (PUB) library, Parallel Computing, vol.29, issue.2, pp.187-207, 2003.
DOI : 10.1016/S0167-8191(02)00218-1

F. Broquedis, J. Clet-ortega, S. Moreaud, N. Furmento, B. Goglin et al., hwloc: A Generic Framework for Managing Hardware Affinities in HPC Applications, 2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing, pp.180-186, 2010.
DOI : 10.1109/PDP.2010.67

URL : https://hal.archives-ouvertes.fr/inria-00429889

J. Hill, B. Mccoll, D. Stefanescu, M. Goudreau, K. Lang et al., BSPlib: The BSP programming library, Parallel Computing, vol.24, issue.14, pp.1947-1980, 1998.
DOI : 10.1016/S0167-8191(98)00093-3

O. Lobachev, M. Guthe, and R. Loogen, Estimating parallel performance, Journal of Parallel and Distributed Computing, vol.73, issue.6, pp.876-887, 2013.
DOI : 10.1016/j.jpdc.2013.01.011

A. Savadi and H. Deldari, Measurement of the latency parameters of the Multi-BSP model: a multicore benchmarking approach, The Journal of Supercomputing, vol.77, issue.1, pp.565-584, 2014.
DOI : 10.1007/s11227-013-1018-4

L. Valiant, A bridging model for parallel computation, Communications of the ACM, vol.33, issue.8, pp.103-111, 1990.
DOI : 10.1145/79173.79181

L. Valiant, A bridging model for multi-core computing, Journal of Computer and System Sciences, vol.77, issue.1, pp.154-166, 2011.
DOI : 10.1016/j.jcss.2010.06.012

A. N. Yzelman, Fast sparse matrix-vector multiplication by partitioning and reordering, 2011.