Ocelotl: Large Trace Overviews Based on Multidimensional Data Aggregation - Archive ouverte HAL Access content directly
Conference Papers Year :

Ocelotl: Large Trace Overviews Based on Multidimensional Data Aggregation

(1) , (1) , (2) , (1) , (3)
1
2
3

Abstract

Performance analysis of parallel applications is commonly based on execution traces that might be investigated through visualization techniques. The weak scalability of such techniques appears when traces get larger both in time (many events registered) and space (many processing elements), a very common situation for current large-scale HPC applications. In this paper we present an approach to tackle such scenarios in order to give a correct overview of the behavior registered in very large traces. Two configurable and controlled aggregation-based techniques are presented: one based exclusively on the temporal aggregation, and another that consists in a spatiotemporal aggregation algorithm. The paper also details the implementation and evaluation of these techniques in Ocelotl, a performance analysis and visualization tool that overcomes the current graphical and interpretation limitations by providing a concise overview registered on traces. The experimental results show that Ocelotl helps in detecting quickly and accurately anomalies in 8 GB traces containing up to two hundred million of events.
Fichier principal
Vignette du fichier
ocelotl.pdf (518.64 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-01120179 , version 1 (25-02-2015)

Identifiers

  • HAL Id : hal-01120179 , version 1

Cite

Damien Dosimont, Youenn Corre, Lucas Mello Schnorr, Guillaume Huard, Jean-Marc Vincent. Ocelotl: Large Trace Overviews Based on Multidimensional Data Aggregation. 8th International Parallel Tools Workshop, Oct 2014, Stuttgart, Germany. ⟨hal-01120179⟩
249 View
184 Download

Share

Gmail Facebook Twitter LinkedIn More