A contention-aware performance model for HPC-based networks: A case study of the InfiniBand network

Maxime Martinasso 1 Jean-François Méhaut 1
1 MESCAL - Middleware efficiently scalable
Inria Grenoble - Rhône-Alpes, LIG - Laboratoire d'Informatique de Grenoble
Abstract : Multi-core clusters are cost-effective clusters largely used in high-performance computing. Parallel applications using message passing as a communication mechanism may introduce complex communication behaviours on such clusters. By sending and receiving data simultaneously from and to several nodes, parallel applications create concurrent accesses to the resources of the network. In this paper, we present a general model that expresses network resource sharing characterised by a dynamic contention graph. The model is based on a linear system weighted by bandwidth distribution factors called penalty coefficients that are specific to a network technology. We propose a method to solve the linear system and present an analysis to determine penalty coefficients on InfiniBand technology. We use complex network conflicts to assess the ability of the model to predict with low errors.
Document type :
Conference papers
Liste complète des métadonnées

https://hal.inria.fr/hal-00690876
Contributor : Ist Rennes <>
Submitted on : Tuesday, April 24, 2012 - 3:56:10 PM
Last modification on : Thursday, October 11, 2018 - 8:48:02 AM

Links full text

Identifiers

Collections

Citation

Maxime Martinasso, Jean-François Méhaut. A contention-aware performance model for HPC-based networks: A case study of the InfiniBand network. Euro-Par 2011 : Proceedings of the 17th International Euro-Par Conference, Aug 2011, Bordeaux, France. pp.91-102, ⟨10.1007/978-3-642-23400-2_10⟩. ⟨hal-00690876⟩

Share

Metrics

Record views

215