Optimization Metrics for the Evaluation of Batch Schedulers in HPC - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2023

Optimization Metrics for the Evaluation of Batch Schedulers in HPC

Résumé

Machine Learning techniques are taking a prominent position in the design of system softwares. In HPC, many work are proposing to use such techniques (specifically Reinforcement Learning) to improve the performance of batch schedulers. Their main limitation is the lack of transparency of their decision. This underlines the importance of choosing correctly the optimization criteria when evaluating these solutions. In this work, we discuss bias and limitations of the most frequent optimization metrics in the literature. We provide elements on how to evaluate performance when studying HPC batch scheduling. We also propose a new metric: the standard deviation of the utilization, which we believe can be used when the utilization reaches its limits. We then experimentally evaluate these limitations by focusing on the use-case of runtime estimates. One of the information that HPC batch schedulers use to schedule jobs on the available resources is user runtime estimates: an estimation provide by the user of how long their job will run on the machine. These estimates are known to be inaccurate, hence many work have focused on improving runtime prediction.
Fichier principal
Vignette du fichier
main.pdf (1.66 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-04042591 , version 1 (23-03-2023)
hal-04042591 , version 2 (04-07-2023)

Identifiants

  • HAL Id : hal-04042591 , version 2

Citer

Robin Boëzennec, Fanny Dufossé, Guillaume Pallez. Optimization Metrics for the Evaluation of Batch Schedulers in HPC. JSSPP 2023 - 26th edition of the workshop on Job Scheduling Strategies for Parallel Processing, May 2023, St. Petersburg, Florida, United States. pp.1-19. ⟨hal-04042591v2⟩
113 Consultations
158 Téléchargements

Partager

Gmail Facebook X LinkedIn More