Adaptive Operator Selection with Dynamic Multi-Armed Bandits

Luis da Costa; Álvaro Fialho; Marc Schoenauer; Michèle Sebag

doi:10.1145/1389095.1389272

Communication Dans Un Congrès Année : 2008

Adaptive Operator Selection with Dynamic Multi-Armed Bandits

(1, 2) , (3) , (1, 2, 3) , (1, 2, 3)

1
2
3

Luis da Costa

Fonction : Auteur
PersonId : 849052

Laboratoire de Recherche en Informatique

Machine Learning and Optimisation

Álvaro Fialho

Fonction : Auteur
PersonId : 849053

Microsoft Research - Inria Joint Centre

Marc Schoenauer

Fonction : Auteur correspondant
PersonId : 739309
IdHAL : evomarc
ORCID : 0000-0003-1450-6830
IdRef : 057775575

Connectez-vous pour contacter l'auteur

Laboratoire de Recherche en Informatique

Machine Learning and Optimisation

Microsoft Research - Inria Joint Centre

Michèle Sebag

Fonction : Auteur
PersonId : 836537

Laboratoire de Recherche en Informatique

Machine Learning and Optimisation

Microsoft Research - Inria Joint Centre

Résumé

An important step toward self-tuning Evolutionary Algorithms is to design efficient Adaptive Operator Selection procedures. Such a procedure is made of two main components: a credit assignment mechanism, that computes a reward for each operator at hand based on some characteristics of the past offspring; and an adaptation rule, that modifies the selection mechanism based on the rewards of the different operators. This paper is concerned with the latter, and proposes a new approach for it based on the well-known Multi-Armed Bandit paradigm. However, because the basic Multi-Armed Bandit methods have been developed for static frameworks, a specific Dynamic Multi-Armed Bandit algorithm is proposed, that hybridizes an optimal Multi-Armed Bandit algorithm with the statistical Page-Hinkley test, which enforces the efficient detection of changes in time series. This original Operator Selection procedure is then compared to the state-of-the-art rules known as Probability Matching and Adaptive Pursuit on several artificial scenarios, after a careful sensitivity analysis of all methods. The Dynamic Multi-Armed Bandit method is found to outperform the other methods on a scenario from the literature, while on another scenario, the basic Multi-Armed Bandit performs best.

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

pap333s1-dacosta.pdf (630.87 Ko)

slidesAOSforGECCO2008.pdf (367.11 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Format : Autre

Álvaro Fialho : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00278542

Soumis le : mercredi 3 décembre 2008-14:12:09

Dernière modification le : lundi 12 février 2024-09:48:04

Archivage à long terme le : mercredi 22 septembre 2010-11:12:27

Dates et versions

inria-00278542 , version 1 (13-05-2008)

inria-00278542 , version 2 (03-12-2008)

Identifiants

HAL Id : inria-00278542 , version 2
DOI : 10.1145/1389095.1389272

Citer

Luis da Costa, Álvaro Fialho, Marc Schoenauer, Michèle Sebag. Adaptive Operator Selection with Dynamic Multi-Armed Bandits. Genetic and Evolutionary Computation Conference (GECCO), ACM, Jul 2008, Atlanta, United States. pp.913-920, ⟨10.1145/1389095.1389272⟩. ⟨inria-00278542v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS CNRS INRIA UMR8623 INRIA2 LRI-AO UNIV-PARIS-SACLAY

347 Consultations

1874 Téléchargements

Adaptive Operator Selection with Dynamic Multi-Armed Bandits

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager