sign in
english version rss feed

inria-00164033, version 1

Change Point Detection and Meta-Bandits for Online Learning in Dynamic Environments

Cédric Hartland () 12, Nicolas Baskiotis 12, Sylvain Gelly 12, Michèle Sebag () 12, Olivier Teytaud 12

CAp (2007) 237-250

Abstract: Motivated by realtime website optimization, this paper is about online learning in abruptly changing environments. Two extensions of the UCBT algorithm are combined in order to handle dynamic multi-armed bandits, and specifically to cope with fast variations in the rewards. Firstly, a change point detection test based on Page-Hinkley statistics is used to overcome the limitations due to the UCBT inertia. Secondly, a controlled forgetting strategy dubbed Meta-Bandit is proposed to take care of the Exploration vs Exploitation trade-off when the PH test is triggered. Extensive empirical validation shows significant improvements compared to the baseline algorithms. The paper also investigates the sensitivity of the proposed algorithm with respect to the number of available options.

  • Domain : Computer Science/Artificial Intelligence
    Computer Science/Learning
    Mathematics/Statistics
    Statistics/Statistics Theory
  • Keywords : online learning – meta bandits – ucb – dynamic environments
 
  • inria-00164033, version 1
  • oai:hal.inria.fr:inria-00164033
  • From: 
  • Submitted on: Monday, 5 November 2007 10:18:05
  • Updated on: Monday, 5 November 2007 10:52:36
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...