Multi-objective Monte-Carlo Tree Search - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2012

Multi-objective Monte-Carlo Tree Search

Résumé

Concerned with multi-objective reinforcement learning (MORL), this paper presents MO-MCTS, an extension of Monte-Carlo Tree Search to multi-objective sequential decision making. The known multi-objective indicator referred to as hyper-volume indicator is used to define an action selection criterion, replacing the UCB criterion in order to deal with multi-dimensional rewards. MO-MCTS is firstly compared with an existing MORL algorithm on the artificial Deep Sea Treasure problem. Then a scalability study of MO-MCTS is made on the NP-hard problem of grid scheduling, showing that the performance of MO-MCTS matches the non RL-based state of the art albeit with a higher computational cost.
Fichier principal
Vignette du fichier
wang88.pdf (899.53 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00758379 , version 1 (28-11-2012)

Identifiants

  • HAL Id : hal-00758379 , version 1

Citer

Weijia Wang, Michèle Sebag. Multi-objective Monte-Carlo Tree Search. Asian Conference on Machine Learning, Nov 2012, Singapour, Singapore. pp.507-522. ⟨hal-00758379⟩
454 Consultations
784 Téléchargements

Partager

Gmail Facebook X LinkedIn More