Multi-objective Monte-Carlo Tree Search

Weijia Wang; Michèle Sebag

Communication Dans Un Congrès Année : 2012

Multi-objective Monte-Carlo Tree Search

(1, 2) , (1, 2)

1
2

Weijia Wang

Fonction : Auteur
PersonId : 930922

Laboratoire de Recherche en Informatique

Machine Learning and Optimisation

Michèle Sebag

Fonction : Auteur
PersonId : 836537

Laboratoire de Recherche en Informatique

Machine Learning and Optimisation

Résumé

Concerned with multi-objective reinforcement learning (MORL), this paper presents MO-MCTS, an extension of Monte-Carlo Tree Search to multi-objective sequential decision making. The known multi-objective indicator referred to as hyper-volume indicator is used to define an action selection criterion, replacing the UCB criterion in order to deal with multi-dimensional rewards. MO-MCTS is firstly compared with an existing MORL algorithm on the artificial Deep Sea Treasure problem. Then a scalability study of MO-MCTS is made on the NP-hard problem of grid scheduling, showing that the performance of MO-MCTS matches the non RL-based state of the art albeit with a higher computational cost.

Domaines

Apprentissage [cs.LG]

Fichier principal

wang88.pdf (899.53 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Weijia Wang : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00758379

Soumis le : mercredi 28 novembre 2012-16:06:33

Dernière modification le : lundi 12 février 2024-09:48:04

Archivage à long terme le : samedi 17 décembre 2016-16:23:00

Dates et versions

hal-00758379 , version 1 (28-11-2012)

Identifiants

HAL Id : hal-00758379 , version 1

Citer

Weijia Wang, Michèle Sebag. Multi-objective Monte-Carlo Tree Search. Asian Conference on Machine Learning, Nov 2012, Singapour, Singapore. pp.507-522. ⟨hal-00758379⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS CNRS INRIA UMR8623 INRIA2 LRI-AO UNIV-PARIS-SACLAY

454 Consultations

784 Téléchargements

Multi-objective Monte-Carlo Tree Search

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager