A contention-aware performance model for HPC-based networks: A case study of the InfiniBand network - Archive ouverte HAL Access content directly
Conference Papers Year : 2011

A contention-aware performance model for HPC-based networks: A case study of the InfiniBand network

Maxime Martinasso
  • Function : Author
  • PersonId : 838777

Abstract

Multi-core clusters are cost-effective clusters largely used in high-performance computing. Parallel applications using message passing as a communication mechanism may introduce complex communication behaviours on such clusters. By sending and receiving data simultaneously from and to several nodes, parallel applications create concurrent accesses to the resources of the network. In this paper, we present a general model that expresses network resource sharing characterised by a dynamic contention graph. The model is based on a linear system weighted by bandwidth distribution factors called penalty coefficients that are specific to a network technology. We propose a method to solve the linear system and present an analysis to determine penalty coefficients on InfiniBand technology. We use complex network conflicts to assess the ability of the model to predict with low errors.

Dates and versions

hal-00690876 , version 1 (24-04-2012)

Identifiers

Cite

Maxime Martinasso, Jean-François Méhaut. A contention-aware performance model for HPC-based networks: A case study of the InfiniBand network. Euro-Par 2011 : Proceedings of the 17th International Euro-Par Conference, Aug 2011, Bordeaux, France. pp.91-102, ⟨10.1007/978-3-642-23400-2_10⟩. ⟨hal-00690876⟩
150 View
0 Download

Altmetric

Share

Gmail Facebook Twitter LinkedIn More