An Efficient Method for Determining Full Point-to-Point Latency of Arbitrary Indirect HPC Networks

Chengchun Liu; Zhang Yang; Limin Xiao; Baicheng Yan; Zhihao Wang; Hongyun Tian

doi:10.1007/978-3-030-05677-3_5

Communication Dans Un Congrès Année : 2018

An Efficient Method for Determining Full Point-to-Point Latency of Arbitrary Indirect HPC Networks

(1) , (2) , (1) , (1) , (1) , (2)

1
2

Chengchun Liu

Fonction : Auteur

School of Computer Science and Engineering [Beijing]

Zhang Yang

Fonction : Auteur
PersonId : 1053401

Institute of Applied Physics and Computational Mathematics - IACM (Beijing, China))

Limin Xiao

Fonction : Auteur
PersonId : 1006934

School of Computer Science and Engineering [Beijing]

Baicheng Yan

Fonction : Auteur

School of Computer Science and Engineering [Beijing]

Zhihao Wang

Fonction : Auteur

School of Computer Science and Engineering [Beijing]

Hongyun Tian

Fonction : Auteur

Institute of Applied Physics and Computational Mathematics - IACM (Beijing, China))

Résumé

Point-to-point latency is one of the most important metrics for high performance computer networks and is used widely in communication performance modeling, link-failure detection, and application optimization. However, it is often hard to determine the full-scale point-to-point latency of large scale HPC networks since it often requires measurements to the square of the number of terminal nodes. In this paper, we propose an efficient method to generate measurement plans for arbitrary indirect HPC networks and reduces the measurement requirements from $$O(n^2)$$ to m, which is often O(n) in modern indirect networks containing n nodes and m links, thus significantly reduces the latency measure overhead. Both analysis and experiments show that the proposed method can reduce the overhead of large-scale fat-tree networks by orders of magnitudes.

Domaines

Informatique [cs]

Fichier principal

477597_1_En_5_Chapter.pdf (542.21 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Hal Ifip : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02279556

Soumis le : jeudi 5 septembre 2019-13:31:24

Dernière modification le : mercredi 3 novembre 2021-06:39:10

Archivage à long terme le : jeudi 6 février 2020-05:00:48

Dates et versions

hal-02279556 , version 1 (05-09-2019)

Licence

Paternité

Identifiants

HAL Id : hal-02279556 , version 1
DOI : 10.1007/978-3-030-05677-3_5

Citer

Chengchun Liu, Zhang Yang, Limin Xiao, Baicheng Yan, Zhihao Wang, et al.. An Efficient Method for Determining Full Point-to-Point Latency of Arbitrary Indirect HPC Networks. 15th IFIP International Conference on Network and Parallel Computing (NPC), Nov 2018, Muroran, Japan. pp.52-63, ⟨10.1007/978-3-030-05677-3_5⟩. ⟨hal-02279556⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

IFIP-LNCS IFIP IFIP-TC IFIP-TC10 IFIP-NPC IFIP-WG10-3 IFIP-LNCS-11276

51 Consultations

62 Téléchargements

An Efficient Method for Determining Full Point-to-Point Latency of Arbitrary Indirect HPC Networks

Résumé

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager