Derivative-free & order-robust optimisation

Victor Gabillon; Rasul Tutunov; Michal Valko; Haitham Bou Ammar

Communication Dans Un Congrès Année : 2020

Derivative-free & order-robust optimisation

(1) , (1) , (2) , (1)

1
2

Victor Gabillon

Fonction : Auteur

Huawei R&D [United Kingdom]

Rasul Tutunov

Fonction : Auteur

Huawei R&D [United Kingdom]

Michal Valko

Fonction : Auteur
PersonId : 284
IdHAL : michal
IdRef : 22360934X

DeepMind [Paris]

Haitham Bou Ammar

Fonction : Auteur

Huawei R&D [United Kingdom]

Résumé

In this paper, we formalise order-robust optimisation as an instance of online learning minimising simple regret, and propose VROOM, a zeroth order optimisation algorithm capable of achieving vanishing regret in non-stationary environments, while recovering favorable rates under stochastic reward-generating processes. Our results are the first to target simple regret definitions in adversarial scenarios unveiling a challenge that has been rarely considered in prior work.

Domaines

Machine Learning [stat.ML]

Fichier principal

gabillon2020derivative-free.pdf (433.36 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Michal Valko : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-03288939

Soumis le : vendredi 16 juillet 2021-15:42:02

Dernière modification le : vendredi 5 novembre 2021-16:12:29

Archivage à long terme le : dimanche 17 octobre 2021-18:51:26

Dates et versions

hal-03288939 , version 1 (16-07-2021)

Identifiants

HAL Id : hal-03288939 , version 1

Citer

Victor Gabillon, Rasul Tutunov, Michal Valko, Haitham Bou Ammar. Derivative-free & order-robust optimisation. International Conference on Artificial Intelligence and Statistics, Aug 2020, Palermo / Virtual, Italy. ⟨hal-03288939⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

30 Consultations

37 Téléchargements

Derivative-free & order-robust optimisation

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Partager