TAPIOCA: An I/O Library for Optimized Topology-Aware Data Aggregation on Large-Scale Supercomputers

François Tessier; Venkatram Vishwanath; Emmanuel Jeannot

doi:10.1109/CLUSTER.2017.80

Communication Dans Un Congrès Année : 2017

TAPIOCA: An I/O Library for Optimized Topology-Aware Data Aggregation on Large-Scale Supercomputers

(1) , (1) , (2)

1
2

François Tessier

Fonction : Auteur
PersonId : 2396
IdHAL : francois-tessier
ORCID : 0000-0003-4441-7898
IdRef : 189616369

Argonne National Laboratory [Lemont]

Venkatram Vishwanath

Fonction : Auteur

Argonne National Laboratory [Lemont]

Emmanuel Jeannot

Fonction : Auteur
PersonId : 15678
IdHAL : emmanuel-jeannot
ORCID : 0000-0002-3956-2997
IdRef : 084595108

Topology-Aware System-Scale Data Management for High-Performance Computing

Résumé

Reading and writing data efficiently from storage system is necessary for most scientific simulations to achieve good performance at scale. Many software solutions have been developed to decrease the I/O bottleneck. One well-known strategy, in the context of collective I/O operations, is the two-phase I/O scheme. This strategy consists of selecting a subset of processes to aggregate contiguous pieces of data before performing reads/writes. In this paper, we present TAPIOCA, an MPI-based library implementing an efficient topology-aware two-phase I/O algorithm. We show how TAPIOCA can take advantage of double-buffering and one-sided communication to reduce as much as possible the idle time during data aggregation. We also introduce our cost model leading to a topology-aware aggregator placement optimizing the movements of data. We validate our approach at large scale on two leadership-class supercomputers: Mira (IBM BG/Q) and Theta (Cray XC40). We present the results obtained with TAPIOCA on a micro-benchmark and the I/O kernel of a large-scale simulation. On both architectures, we show a substantial improvement of I/O performance compared with the default MPI I/O implementation. On BG/Q+GPFS, for instance, our algorithm leads to a performance improvement by a factor of twelve while on the Cray XC40 system associated with a Lustre filesystem, we achieve an improvement of four.

Domaines

Calcul parallèle, distribué et partagé [cs.DC]

Fichier principal

paper_version_publiée.pdf (497.37 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Emmanuel Jeannot : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01621344

Soumis le : mardi 24 octobre 2017-10:13:59

Dernière modification le : vendredi 24 mars 2023-14:53:05

Archivage à long terme le : jeudi 25 janvier 2018-12:23:27

Dates et versions

hal-01621344 , version 1 (24-10-2017)

Identifiants

HAL Id : hal-01621344 , version 1
DOI : 10.1109/CLUSTER.2017.80

Citer

François Tessier, Venkatram Vishwanath, Emmanuel Jeannot. TAPIOCA: An I/O Library for Optimized Topology-Aware Data Aggregation on Large-Scale Supercomputers. CLUSTER 2017 - IEEE International Conference on Cluster Computing, Sep 2017, Honolulu, United States. pp.1-11, ⟨10.1109/CLUSTER.2017.80⟩. ⟨hal-01621344⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA INRIA2

283 Consultations

280 Téléchargements

TAPIOCA: An I/O Library for Optimized Topology-Aware Data Aggregation on Large-Scale Supercomputers

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager