S. Balay, W. D. Gropp, L. C. Mcinnes, and B. F. Smith, Efficient Management of Parallelism in Object-Oriented Numerical Software Libraries, Modern Software Tools in Scientific Computing, pp.163-202, 1997.
DOI : 10.1007/978-1-4612-1986-6_8

S. Balay, J. Brown, K. Buschelman, V. Eijkhout, W. D. Gropp et al., PETSc users manual, 2011.

R. H. Bisseling, Parallel iterative solution of sparse linear systems on a transputer network, Parallel Computation, pp.253-271, 1993.

R. H. Bisseling, Parallel Scientific Computation: A Structured Approach using BSP and MPI, 2004.
DOI : 10.1093/acprof:oso/9780198529392.001.0001

R. H. Bisseling and W. Meesen, Communication balancing in parallel sparse matrix-vector multiplication, Electron. Trans. Numer. Anal, vol.21, pp.47-65, 2005.

R. H. Bisseling, B. O. Auer, A. Yzelman, T. Van-leeuwen, and Ü. V. Çatalyürek, Two-Dimensional Approaches to Sparse Matrix Partitioning, Combinatorial Scientific Computing, chapter 12, 2012.
DOI : 10.1201/b11644-13

E. Boman, K. Devine, R. Heaphy, B. Hendrickson, V. Leung et al., Zoltan 3.0: Parallel Partitioning, Load Balancing, and Data-Management Services ; User's Guide, Sandia National Laboratories, 2007.

J. Byun, R. Lin, J. W. Demmel, and K. A. Yelick, pOSKI: Parallel Optimized Sparse Kernel Interface Library User's Guide for Version 1.0.0, Berkeley Benchmarking and Optimization, 2012.

Ü. V. Çatalyürek and C. Aykanat, A hypergraph model for mapping repeated sparse matrix-vector product computations onto multicomputers, HiPC'95, 1995.

Ü. V. Çatalyürek and C. Aykanat, Hypergraph-partitioning-based decomposition for parallel sparse-matrix vector multiplication, IEEE Transactions on Parallel and Distributed Systems, vol.10, issue.7, pp.673-693, 1999.
DOI : 10.1109/71.780863

Ü. V. Çatalyürek and C. Aykanat, PaToH: A Multilevel Hypergraph Partitioning Tool, Version 3.0

Ü. V. Çatalyürek and C. Aykanat, A fine-grain hypergraph model for 2D decomposition of sparse matrices, Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS 2001, 2001.
DOI : 10.1109/IPDPS.2001.925093

Ü. V. Çatalyürek and C. Aykanat, A hypergraph-partitioning approach for coarse-grain decomposition, Supercomputing'01, 2001.

Ü. V. Çatalyürek, K. Kaya, and B. Uçar, On shared-memory parallelization of a sparse matrix scaling algoritm, ICPP'12, 2012.

Ü. V. Çatalyürek, C. Aykanat, and B. Uçar, On Two-Dimensional Sparse Matrix Partitioning: Models, Methods, and a Recipe, SIAM Journal on Scientific Computing, vol.32, issue.2, pp.656-683, 2010.
DOI : 10.1137/080737770

B. Hendrickson and T. G. Kolda, Graph partitioning models for parallel computing, Parallel Computing, vol.26, issue.12, pp.1519-1534, 2000.
DOI : 10.1016/S0167-8191(00)00048-X

B. Hendrickson, R. W. Leland, and S. Plimpton, AN EFFICIENT PARALLEL ALGORITHM FOR MATRIX-VECTOR MULTIPLICATION, International Journal of High Speed Computing, vol.07, issue.01, pp.73-88, 1995.
DOI : 10.1142/S0129053395000051

M. A. Heroux, R. A. Bartlett, V. E. Howle, R. J. Hoekstra, J. J. Hu et al., An overview of the Trilinos project, ACM Transactions on Mathematical Software, vol.31, issue.3, pp.31397-423, 2005.
DOI : 10.1145/1089014.1089021

S. Lee and R. Eigenmann, Adaptive runtime tuning of parallel sparse matrix-vector multiplication on distributed memory systems, Proceedings of the 22nd annual international conference on Supercomputing , ICS '08, pp.195-204, 2008.
DOI : 10.1145/1375527.1375558

T. Lengauer, Combinatorial Algorithms for Integrated Circuit Layout, 1990.
DOI : 10.1007/978-3-322-92106-2

J. G. Lewis and R. A. Van-de-geijn, Matrix-vector multiplication and conjugate gradient algorithms on distributed memory computers, Proceedings of IEEE Scalable High Performance Computing Conference, pp.484-492, 1993.
DOI : 10.1109/SHPCC.1994.296689

A. T. Ogielski and W. Aiello, Sparse Matrix Computations on Parallel Processor Arrays, SIAM Journal on Scientific Computing, vol.14, issue.3, pp.519-530, 1993.
DOI : 10.1137/0914033

A. P?nar and C. Aykanat, Fast optimal load balancing algorithms for 1D partitioning, Journal of Parallel and Distributed Computing, vol.64, issue.8, pp.974-996, 2004.
DOI : 10.1016/j.jpdc.2004.05.003

Y. Saad and A. V. Malevsky, P-SPARSLIB: A portable library of distributed memory sparse iterative solvers, 1995.

E. Saule, E. Ö. Ba?, and Ü. V. Çatalyürek, Load-balancing spatially located computations using rectangular partitions, Journal of Parallel and Distributed Computing, vol.72, issue.10, pp.1201-1214, 2012.
DOI : 10.1016/j.jpdc.2012.05.013

URL : http://arxiv.org/abs/1104.2566

B. Uçar and C. Aykanat, Encapsulating Multiple Communication-Cost Metrics in Partitioning Sparse Rectangular Matrices for Parallel Matrix-Vector Multiplies, SIAM Journal on Scientific Computing, vol.25, issue.6, pp.1837-1859, 2004.
DOI : 10.1137/S1064827502410463

B. Uçar and C. Aykanat, A library for parallel sparse matrix-vector multiplies, 2005.

B. Uçar and C. Aykanat, Revisiting Hypergraph Models for Sparse Matrix Partitioning, SIAM Review, vol.49, issue.4, pp.595-603, 2007.
DOI : 10.1137/060662459

B. Uçar and Ü. V. Çatalyürek, On the Scalability of Hypergraph Models for Sparse Matrix Partitioning, 2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing, pp.593-600, 2010.
DOI : 10.1109/PDP.2010.92

B. Uçar, Ü. V. Çatalyürek, and C. Aykanat, A Matrix Partitioning Interface to PaToH in MATLAB, Parallel Computing, vol.36, issue.5-6, pp.254-272, 2010.
DOI : 10.1016/j.parco.2009.12.008

B. Vastenhouw and R. H. Bisseling, A Two-Dimensional Data Distribution Method for Parallel Sparse Matrix-Vector Multiplication, SIAM Review, vol.47, issue.1, pp.67-95, 2005.
DOI : 10.1137/S0036144502409019

A. N. Yzelman, D. Roose, and . Ku-leuven, High-level strategies for parallel sharedmemory sparse matrix?vector multiplication, 2012.

R. N°-8301 and R. Centre-grenoble-?-rhône-alpes, Inovallée 655 avenue de l'Europe Montbonnot 38334 Saint Ismier Cedex Publisher Inria Domaine de Voluceau -Rocquencourt BP 105 -78153 Le Chesnay Cedex inria, pp.249-6399