inria-00116897, version 3
A Framework for Adaptive Collective Communications on Heterogeneous Hierarchical Networks
(2006)
Abstract: Today, due to the wide variety of existing parallel systems consisting on collections of heterogeneous machines, it is very difficult for a user to solve a target problem by using a single algorithm or to write portable programs that perform well on multiple computational supports. The inherent heterogeneity and the diversity of networks of such environments represent a great challenge to model the communications for high performance computing applications. Our objective within this work is to propose a generic framework based on communication models and adaptive techniques for dealing with prediction of communication performances on cluster-based hierarchical platforms. Toward this goal, we introduce the concept of polyalgorithmic model of communications, which correspond to selection of the most adapted communication algorithms and scheduling strategies, giving the characteristics of the hardware resources of the target parallel system. We apply this methodology on collective communication operations and show that the framework provides significant performances while determining the best algorithm depending on the problem and architecture parameters.
- a – Université Nancy II
- 1:
- INRIA – CNRS : UMR7503 – Université Henri Poincaré - Nancy I – Université Nancy II – Institut National Polytechnique de Lorraine (INPL)
- Collaboration : Grid'5000
- Domain : Computer Science/Distributed, Parallel, and Cluster Computing
Computer Science/Modeling and Simulation - Keywords : Cluster computing – Performance modeling – Adaptive techniques – Polymodels of communications – Collective communication operations
- Comment : Extended version of the IPDPS 2006 paper
- Available versions : v1 (2006-11-28) v2 (2006-11-29) v3 (2006-11-30)
- inria-00116897, version 3
- http://hal.inria.fr/inria-00116897
- oai:hal.inria.fr:inria-00116897
- From:
- Submitted on: Thursday, 30 November 2006 10:04:45
- Updated on: Monday, 23 April 2012 16:32:29





Associated documents

Export