Online non-convex optimization with imperfect feedback - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2020

Online non-convex optimization with imperfect feedback

Amélie Héliou
  • Fonction : Auteur
  • PersonId : 1084692
Matthieu Martin
Thibaud Rahier
  • Fonction : Auteur
  • PersonId : 1084694

Résumé

We consider the problem of online learning with non-convex losses. In terms of feedback, we assume that the learner observes – or otherwise constructs – an inexact model for the loss function encountered at each stage, and we propose a mixed-strategy learning policy based on dual averaging. In this general context, we derive a series of tight regret minimization guarantees, both for the learner’s static (external) regret, as well as the regret incurred against the best dynamic policy in hindsight. Subsequently, we apply this general template to the case where the learner only has access to the actual loss incurred at each stage of the process. This is achieved by means of a kernel-based estimator which generates an inexact model for each round’s loss function using only the learner’s realized losses as input.
Fichier principal
Vignette du fichier
OnlineNonConvex-NIPS.pdf (952.43 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03043746 , version 1 (07-12-2020)

Identifiants

  • HAL Id : hal-03043746 , version 1

Citer

Amélie Héliou, Matthieu Martin, Panayotis Mertikopoulos, Thibaud Rahier. Online non-convex optimization with imperfect feedback. NeurIPS 2020 - 34th International Conference on Neural Information Processing Systems, 2020, Vancouver, Canada. ⟨hal-03043746⟩
83 Consultations
290 Téléchargements

Partager

Gmail Facebook X LinkedIn More