Adaptive Operator Selection with Dynamic Multi-Armed Bandits

Luis da Costa 1, 2 Álvaro Fialho 3 Marc Schoenauer 1, 2, 3, * Michèle Sebag 1, 2, 3
* Corresponding author
2 TAO - Machine Learning and Optimisation
CNRS - Centre National de la Recherche Scientifique : UMR8623, Inria Saclay - Ile de France, UP11 - Université Paris-Sud - Paris 11, LRI - Laboratoire de Recherche en Informatique
Abstract : An important step toward self-tuning Evolutionary Algorithms is to design efficient Adaptive Operator Selection procedures. Such a procedure is made of two main components: a credit assignment mechanism, that computes a reward for each operator at hand based on some characteristics of the past offspring; and an adaptation rule, that modifies the selection mechanism based on the rewards of the different operators. This paper is concerned with the latter, and proposes a new approach for it based on the well-known Multi-Armed Bandit paradigm. However, because the basic Multi-Armed Bandit methods have been developed for static frameworks, a specific Dynamic Multi-Armed Bandit algorithm is proposed, that hybridizes an optimal Multi-Armed Bandit algorithm with the statistical Page-Hinkley test, which enforces the efficient detection of changes in time series. This original Operator Selection procedure is then compared to the state-of-the-art rules known as Probability Matching and Adaptive Pursuit on several artificial scenarios, after a careful sensitivity analysis of all methods. The Dynamic Multi-Armed Bandit method is found to outperform the other methods on a scenario from the literature, while on another scenario, the basic Multi-Armed Bandit performs best.
Document type :
Conference papers
Complete list of metadatas

https://hal.inria.fr/inria-00278542
Contributor : Álvaro Fialho <>
Submitted on : Wednesday, December 3, 2008 - 2:12:09 PM
Last modification on : Thursday, February 7, 2019 - 4:26:14 PM
Long-term archiving on : Wednesday, September 22, 2010 - 11:12:27 AM

Identifiers

Collections

Citation

Luis da Costa, Álvaro Fialho, Marc Schoenauer, Michèle Sebag. Adaptive Operator Selection with Dynamic Multi-Armed Bandits. Genetic and Evolutionary Computation Conference (GECCO), ACM, Jul 2008, Atlanta, United States. pp.913-920, ⟨10.1145/1389095.1389272⟩. ⟨inria-00278542v2⟩

Share

Metrics

Record views

563

Files downloads

1240