Bandit-based Search for Constraint Programming

Manuel Loth; Michèle Sebag; Youssef Hamadi; Marc Schoenauer

Communication Dans Un Congrès Année : 2013

Bandit-based Search for Constraint Programming

(1) , (2) , (1, 3, 4) , (2, 5)

1
2
3
4
5

Manuel Loth

Fonction : Auteur
PersonId : 836853

Microsoft Research - Inria Joint Centre

Michèle Sebag

Fonction : Auteur
PersonId : 836537

Laboratoire de Recherche en Informatique

Youssef Hamadi

Fonction : Auteur
PersonId : 840368

Microsoft Research - Inria Joint Centre

Laboratoire d'informatique de l'École polytechnique [Palaiseau]

Microsoft Research [Cambridge]

Marc Schoenauer

Fonction : Auteur
PersonId : 739309
IdHAL : evomarc
ORCID : 0000-0003-1450-6830
IdRef : 057775575

Laboratoire de Recherche en Informatique

Machine Learning and Optimisation

Résumé

Constraint Programming (CP) solvers classically explore the solution space using tree-search based heuristics. Monte-Carlo Tree-Search (MCTS) is a method aimed at optimal sequential decision making under uncertainty. It simultaneously estimates node values (with respect to some reward function) by Monte-Carlo trials and uses them to bias the exploration towards the most promising regions of the tree, borrowing the multi-armed-bandit decision rule. At the crossroads of CP and MCTS, this paper presents the Bandit Search for Constraint Programming (BASCOP) algorithm, adapting MCTS to the specifics of CP search trees. These adaptations concern i) the design of a generic reward function suited for CP problems; ii) the replacement of Monte-Carlo trials by iterations of a complete depth-first-search procedure; iii) the ability to take into account an existing value-ordering heuristics; iv) the aggregation of statistics in order to handle multiple restarts. BASCOP, using Gecode as the underlying constraint solver, shows significant improvements over the depth-first-search baseline on some CP benchmark suites, demonstrating its potential as a generic yet robust search method for CP.

Domaines

Apprentissage [cs.LG]

Fichier principal

paper123.pdf (184.44 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Manuel Loth : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00863451

Soumis le : mercredi 18 septembre 2013-21:43:32

Dernière modification le : lundi 12 février 2024-09:48:04

Archivage à long terme le : vendredi 20 décembre 2013-15:04:35

Dates et versions

hal-00863451 , version 1 (18-09-2013)

Identifiants

HAL Id : hal-00863451 , version 1

Citer

Manuel Loth, Michèle Sebag, Youssef Hamadi, Marc Schoenauer. Bandit-based Search for Constraint Programming. International Conference on Principles and Practice of Constraint Programming, Sep 2013, Uppsala, Sweden. pp.464-480. ⟨hal-00863451⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

X EC-PARIS UNIV-RENNES1 CNRS INRIA IRISA LIX X-LIX X-DEP-INFO UMR8623 INRIA2 LRI-AO UR1-MATH-STIC UNIV-PARIS-SACLAY UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

470 Consultations

1173 Téléchargements

Bandit-based Search for Constraint Programming

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager