A Visual Performance Analysis Framework for Task-based Parallel Applications running on Hybrid Clusters - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2017

A Visual Performance Analysis Framework for Task-based Parallel Applications running on Hybrid Clusters

Résumé

Programming paradigms in High-Performance Computing have been shifting towards task-based models which are capable of adapting readily to heterogeneous and scalable supercomputers. The performance of task-based application heavily depends on the runtime scheduling heuristics and on its ability to exploit computing and communication resources. Unfortunately, the traditional performance analysis strategies are unfit to fully understand task-based runtime systems and applications: they expect a regular behavior with communication and computation phases, while task-based applications demonstrate no clear phases. Moreover, the finer granularity of task-based applications typically induces a stochastic behavior that leads to irregular structures that are difficult to analyze. This paper details a flexible framework combining visualization panels to understand and pinpoint performance problems incurred by bad scheduling decisions in task-based applications. Three case-studies using StarPU-MPI, a task-based multi-node runtime system, are detailed to show how our framework is used to study the performance of the well-known Cholesky fac-torization. Performance improvements include a better task partitioning among the multi-(GPU,core) to get closer to theoretical lower bounds, improved MPI pipelin-ing in multi-(node,core,GPU) to reduce the slow start, and changes in the runtime system to increase MPI bandwidth, with gains of up to 13% in the total makespan.
Fichier principal
Vignette du fichier
CCPE_article_submitted_2017_09_29-gz.pdf (4.81 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01616632 , version 1 (19-10-2017)
hal-01616632 , version 2 (17-07-2018)

Identifiants

  • HAL Id : hal-01616632 , version 1

Citer

Vinicius Garcia Pinto, Lucas Mello Schnorr, Luka Stanisic, Arnaud Legrand, Samuel Thibault, et al.. A Visual Performance Analysis Framework for Task-based Parallel Applications running on Hybrid Clusters. 2017. ⟨hal-01616632v1⟩
1012 Consultations
847 Téléchargements

Partager

Gmail Facebook X LinkedIn More