Differential Evolution Algorithm Applied to Non-Stationary Bandit Problem - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2014

Differential Evolution Algorithm Applied to Non-Stationary Bandit Problem

Résumé

In this paper we compare Differential Evolution (DE), an evolutionary algorithm, to classical bandit algorithms over the non-stationary bandit problem. First we define a testcase where the variation of the distributions depends on the number of times an option is evaluated rather than over time. This definition allows the possibility to apply these algorithms over a wide range of problems such as black-box portfolio selection. Second we present our own variant of discounted Upper Confidence Bound (UCB) algorithm that outperforms the current state-of-the-art algorithms for the non-stationary bandit problem. Third, we introduce a variant of DE and show that, on a selection over a portfolio of solvers for the Cart-Pole problem, our version of DE outperforms the current best UCB algorithms.
Fichier principal
Vignette du fichier
main.pdf (312.91 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00979456 , version 1 (16-07-2014)

Identifiants

  • HAL Id : hal-00979456 , version 1

Citer

David L. St-Pierre, Jialin Liu. Differential Evolution Algorithm Applied to Non-Stationary Bandit Problem. 2014 IEEE Congress on Evolutionary Computation (IEEE CEC 2014), Jul 2014, Beijing, China. ⟨hal-00979456⟩
360 Consultations
299 Téléchargements

Partager

Gmail Facebook X LinkedIn More