Skip to Main content Skip to Navigation
Conference papers

A simple dynamic bandit algorithm for hyper-parameter tuning

Xuedong Shang 1 Emilie Kaufmann 1 Michal Valko 2, 1
1 SEQUEL - Sequential Learning
Inria Lille - Nord Europe, CRIStAL - Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189
Abstract : Hyper-parameter tuning is a major part of modern machine learning systems. The tuning itself can be seen as a sequential resource allocation problem. As such, methods for multi-armed bandits have been already applied. In this paper, we view hyper-parameter optimization as an instance of best-arm identification in infinitely many-armed bandits. We propose D-TTTS, a new adaptive algorithm inspired by Thompson sampling, which dynamically balances between refining the estimate of the quality of hyper-parameter configurations previously explored and adding new hyper-parameter configurations to the pool of candidates. The algorithm is easy to implement and shows competitive performance compared to state-of-the-art algorithms for hyper-parameter tuning.
Document type :
Conference papers
Complete list of metadata

Cited literature [25 references]  Display  Hide  Download
Contributor : Michal Valko Connect in order to contact the contributor
Submitted on : Saturday, June 1, 2019 - 11:49:38 PM
Last modification on : Tuesday, January 4, 2022 - 6:14:25 AM


Files produced by the author(s)


  • HAL Id : hal-02145200, version 1


Xuedong Shang, Emilie Kaufmann, Michal Valko. A simple dynamic bandit algorithm for hyper-parameter tuning. Workshop on Automated Machine Learning at International Conference on Machine Learning, AutoML@ICML 2019 - 6th ICML Workshop on Automated Machine Learning, Jun 2019, Long Beach, United States. ⟨hal-02145200⟩



Les métriques sont temporairement indisponibles