Hypervolume indicator and dominance reward based multi-objective Monte-Carlo Tree Search

Weijia Wang; Michèle Sebag

doi:10.1007/s10994-013-5369-0

Article Dans Une Revue Machine Learning Année : 2013

Hypervolume indicator and dominance reward based multi-objective Monte-Carlo Tree Search

(1, 2) , (1, 2)

1
2

Weijia Wang

Fonction : Auteur
PersonId : 930922

Laboratoire de Recherche en Informatique

Machine Learning and Optimisation

Michèle Sebag

Fonction : Auteur
PersonId : 836537

Laboratoire de Recherche en Informatique

Machine Learning and Optimisation

Résumé

Concerned with multi-objective reinforcement learning (MORL), this paper presents MOMCTS, an extension of Monte-Carlo Tree Search to multi-objective sequential decision making, embedding two decision rules respectively based on the hypervolume indicator and the Pareto dominance reward. The MOMCTS approaches are firstly compared with the MORL state of the art on two artificial problems, the two-objective Deep Sea Treasure problem and the three-objective Resource Gathering problem. The scalability of MOMCTS is also examined in the context of the NP-hard grid scheduling problem, showing that the MOMCTS performance matches the (non-RL based) state of the art albeit with a higher computational cost.

Mots clés

reinforcement learning Monte-Carlo tree search multi-objective optimization sequential decision making

Domaines

Intelligence artificielle [cs.AI] Informatique Optimisation et contrôle [math.OC]

Fichier principal

acmlSIrevised.pdf (1.78 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Weijia Wang : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00852048

Soumis le : lundi 19 août 2013-16:15:35

Dernière modification le : jeudi 18 avril 2024-16:29:00

Archivage à long terme le : mercredi 5 avril 2017-21:54:19

Dates et versions

hal-00852048 , version 1 (19-08-2013)

Identifiants

HAL Id : hal-00852048 , version 1
DOI : 10.1007/s10994-013-5369-0

Citer

Weijia Wang, Michèle Sebag. Hypervolume indicator and dominance reward based multi-objective Monte-Carlo Tree Search. Machine Learning, 2013, 92 (2-3), pp.403-429. ⟨10.1007/s10994-013-5369-0⟩. ⟨hal-00852048⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS CNRS INRIA UMR8623 INRIA2 LRI-AO TDS-MACS UNIV-PARIS-SACLAY

297 Consultations

836 Téléchargements

Hypervolume indicator and dominance reward based multi-objective Monte-Carlo Tree Search

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager