Alternating Optimisation and Quadrature for Robust Control

Supratik Paul; Konstantinos Chatzilygeroudis; Kamil Ciosek; Jean-Baptiste Mouret; Michael A Osborne; Shimon Whiteson

Communication Dans Un Congrès Année : 2018

Alternating Optimisation and Quadrature for Robust Control

(1) , (2) , (1) , (2) , (1) , (1)

1
2

Supratik Paul

Fonction : Auteur

University of Oxford

Konstantinos Chatzilygeroudis

Fonction : Auteur
PersonId : 10921
IdHAL : konstantinos-chatzilygeroudis
ORCID : 0000-0003-3585-1027
IdRef : 234845414

Lifelong Autonomy and interaction skills for Robots in a Sensing ENvironment

Kamil Ciosek

Fonction : Auteur

University of Oxford

Jean-Baptiste Mouret

Fonction : Auteur
PersonId : 1495
IdHAL : jb-mouret
ORCID : 0000-0002-2513-027X
IdRef : 137470002

Lifelong Autonomy and interaction skills for Robots in a Sensing ENvironment

Michael A Osborne

Fonction : Auteur

University of Oxford

Shimon Whiteson

Fonction : Auteur

University of Oxford

Résumé

Bayesian optimisation has been successfully applied to a variety of reinforcement learning problems. However, the traditional approach for learning optimal policies in simulators does not utilise the opportunity to improve learning by adjusting certain environment variables - state features that are randomly determined by the environment in a physical setting but are controllable in a simulator. This paper considers the problem of finding an optimal policy while taking into account the impact of environment variables. We present alternating optimisation and quadrature (ALOQ), which uses Bayesian optimisation and Bayesian quadrature to address such settings. ALOQ is robust to the presence of significant rare events, which may not be observable under random sampling, but have a considerable impact on determining the optimal policy. We provide experimental results demonstrating our approach learning more efficiently than existing methods.

Domaines

Automatique / Robotique Robotique [cs.RO] Intelligence artificielle [cs.AI]

Fichier principal

ALOQ_AAAI18_final.pdf (574.53 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Jean-Baptiste Mouret : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01644063

Soumis le : mercredi 22 novembre 2017-11:16:41

Dernière modification le : jeudi 1 février 2024-10:03:48

Dates et versions

hal-01644063 , version 1 (22-11-2017)

Identifiants

HAL Id : hal-01644063 , version 1
ARXIV : 1605.07496

Citer

Supratik Paul, Konstantinos Chatzilygeroudis, Kamil Ciosek, Jean-Baptiste Mouret, Michael A Osborne, et al.. Alternating Optimisation and Quadrature for Robust Control. AAAI 2018 - The Thirty-Second AAAI Conference on Artificial Intelligence, Feb 2018, New Orleans, United States. ⟨hal-01644063⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA IRISA UNIV-LORRAINE INRIA2 TDS-MACS LORIA LORIA-AIS UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM CREATIV-LAB

295 Consultations

218 Téléchargements

Alternating Optimisation and Quadrature for Robust Control

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager