Detecção de Anomalias de Desempenho em Aplicações de Alto Desempenho baseadas em Tarefas em Clusters Híbridos

Abstract : Programming paradigms in High-Performance Computing have been shifting towards task-based models which are capable to more readily adapt to heterogeneous and scalable supercomputers. Detecting performance anomalies in such environments is particularly difficult since it must consider architecture heterogeneity, variability, and the capability to obtain trusted measurements. This work presents a case-study about the detection of anomalies in the execution of the well-known tiled dense Cholesky factorization developed with StarPU. Our experiments have been conducted in a variety of hybrid multi-node platforms to demonstrate how we are capable to detect and highlight performance anomalies.
Type de document :
Communication dans un congrès
17º Workshop em Desempenho de Sistemas Computacionais e de Comunicação (WPerformance), Jul 2018, Natal, Brazil
Liste complète des métadonnées

https://hal.inria.fr/hal-01842038
Contributeur : Samuel Thibault <>
Soumis le : mardi 17 juillet 2018 - 18:33:35
Dernière modification le : mardi 31 juillet 2018 - 13:53:11

Fichier

181587_1.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01842038, version 1

Citation

Vinicius Pinto, Lucas Mello Schnorr, Arnaud Legrand, Samuel Thibault, Luka Stanisic, et al.. Detecção de Anomalias de Desempenho em Aplicações de Alto Desempenho baseadas em Tarefas em Clusters Híbridos. 17º Workshop em Desempenho de Sistemas Computacionais e de Comunicação (WPerformance), Jul 2018, Natal, Brazil. 〈hal-01842038〉

Partager

Métriques

Consultations de la notice

67

Téléchargements de fichiers

19