Bandits attack function optimization

Philippe Preux; Rémi Munos; Michal Valko

Communication Dans Un Congrès Année : 2014

Bandits attack function optimization

(1) , (1) , (1)

Philippe Preux

Fonction : Auteur
PersonId : 5488
IdHAL : preux-philippe
IdRef : 059896353

Sequential Learning

Rémi Munos

Fonction : Auteur
PersonId : 836863

Sequential Learning

Michal Valko

Fonction : Auteur
PersonId : 284
IdHAL : michal
IdRef : 22360934X

Sequential Learning

Résumé

We consider function optimization as a sequential decision making problem under the budget constraint. Such constraint limits the number of objective function evaluations allowed during the optimization. We consider an algorithm inspired by a continuous version of a multi-armed bandit problem which attacks this optimization problem by solving the tradeoff between exploration (initial quasi-uniform search of the domain) and exploitation (local optimization around the potentially global maxima). We introduce the so-called Simultaneous Optimistic Optimization (SOO), a deterministic algorithm that works by domain partitioning. The benefit of such an approach are the guarantees on the returned solution and the numerical eficiency of the algorithm. We present this machine learning rooted approach to optimization, and provide the empirical assessment of SOO on the CEC'2014 competition on single objective real-parameter numerical optimization testsuite.

Domaines

Machine Learning [stat.ML]

Fichier principal

preux2014bandits.pdf (169.3 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Michal Valko : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00978637

Soumis le : lundi 14 avril 2014-14:32:03

Dernière modification le : vendredi 24 mars 2023-14:52:58

Archivage à long terme le : lundi 14 juillet 2014-11:45:56

Dates et versions

hal-00978637 , version 1 (14-04-2014)

Identifiants

HAL Id : hal-00978637 , version 1

Citer

Philippe Preux, Rémi Munos, Michal Valko. Bandits attack function optimization. IEEE Congress on Evolutionary Computation, Jul 2014, Beijing, China. ⟨hal-00978637⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-LILLE3 CNRS INRIA LAGIS CRISTAL INRIA2 CRISTAL-SEQUEL

388 Consultations

554 Téléchargements

Bandits attack function optimization

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager