Differential Evolution Algorithm Applied to Non-Stationary Bandit Problem

David L. St-Pierre; Jialin Liu

Communication Dans Un Congrès Année : 2014

Differential Evolution Algorithm Applied to Non-Stationary Bandit Problem

(1, 2) , (1, 3)

1
2
3

David L. St-Pierre

Fonction : Auteur

Machine Learning and Optimisation

Department of Electrical Engineering and Computer Science

Jialin Liu

Fonction : Auteur
PersonId : 579
IdHAL : jialin-liu
IdRef : 192885766

Machine Learning and Optimisation

Laboratoire de Recherche en Informatique

Résumé

In this paper we compare Differential Evolution (DE), an evolutionary algorithm, to classical bandit algorithms over the non-stationary bandit problem. First we define a testcase where the variation of the distributions depends on the number of times an option is evaluated rather than over time. This definition allows the possibility to apply these algorithms over a wide range of problems such as black-box portfolio selection. Second we present our own variant of discounted Upper Confidence Bound (UCB) algorithm that outperforms the current state-of-the-art algorithms for the non-stationary bandit problem. Third, we introduce a variant of DE and show that, on a selection over a portfolio of solvers for the Cart-Pole problem, our version of DE outperforms the current best UCB algorithms.

Domaines

Intelligence artificielle [cs.AI] Optimisation et contrôle [math.OC]

Fichier principal

main.pdf (312.91 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Jialin Liu : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00979456

Soumis le : mercredi 16 juillet 2014-07:00:15

Dernière modification le : jeudi 14 mars 2024-03:10:28

Archivage à long terme le : jeudi 20 novembre 2014-15:37:44

Dates et versions

hal-00979456 , version 1 (16-07-2014)

Identifiants

HAL Id : hal-00979456 , version 1

Citer

David L. St-Pierre, Jialin Liu. Differential Evolution Algorithm Applied to Non-Stationary Bandit Problem. 2014 IEEE Congress on Evolutionary Computation (IEEE CEC 2014), Jul 2014, Beijing, China. ⟨hal-00979456⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS CNRS INRIA UMR8623 INRIA2 LRI-AO TDS-MACS UNIV-PARIS-SACLAY

360 Consultations

299 Téléchargements

Differential Evolution Algorithm Applied to Non-Stationary Bandit Problem

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager