Learning with stochastic inputs and adversarial outputs

Alessandro Lazaric; Rémi Munos

doi:10.1016/j.jcss.2011.12.027

Article Dans Une Revue Journal of Computer and System Sciences Année : 2012

Learning with stochastic inputs and adversarial outputs

(1) , (1)

Alessandro Lazaric

Fonction : Auteur
PersonId : 851
IdHAL : alessandro-lazaric
ORCID : 0000-0002-8970-413X
IdRef : 188701486

Sequential Learning

Rémi Munos

Fonction : Auteur
PersonId : 836863

Sequential Learning

Résumé

Most of the research in online learning is focused either on the problem of adversarial classification (i.e., both inputs and labels are arbitrarily chosen by an adversary) or on the traditional supervised learning problem in which samples are independent and identically distributed according to a stationary probability distribution. Nonetheless, in a number of domains the relationship between inputs and outputs may be adversarial, whereas input instances are i.i.d. from a stationary distribution (e.g., user preferences). This scenario can be formalized as a learning problem with stochastic inputs and adversarial outputs. In this paper, we introduce this novel stochastic-adversarial learning setting and we analyze its learnability. In particular, we show that in a binary classification problem over an horizon of $n$ rounds, given a hypothesis space $\mathscr{H}$ with finite VC-dimension, it is possible to design an algorithm that incrementally builds a suitable finite set of hypotheses from $\mathscr{H}$ used as input for an exponentially weighted forecaster and achieves a cumulative regret of order $O(\sqrt{n VC$\mathscr{H} log n})$ with overwhelming probability. This result shows that whenever inputs are i.i.d. it is possible to solve any binary classification problem using a finite VC-dimension hypothesis space with a sub-linear regret independently from the way labels are generated (either stochastic or adversarial). We also discuss extensions to multi-class classification, regression, learning from experts and bandit settings with stochastic side information, and application to games.

Mots clés

Online learning Hybrid stochastic-adversarial learning

Domaines

Machine Learning [stat.ML]

Fichier principal

00-estochad-alex.pdf (471.06 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Alessandro Lazaric : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00772046

Soumis le : jeudi 10 janvier 2013-11:25:11

Dernière modification le : jeudi 15 février 2024-03:32:00

Archivage à long terme le : samedi 1 avril 2017-02:53:54

Dates et versions

hal-00772046 , version 1 (10-01-2013)

Identifiants

HAL Id : hal-00772046 , version 1
DOI : 10.1016/j.jcss.2011.12.027

Citer

Alessandro Lazaric, Rémi Munos. Learning with stochastic inputs and adversarial outputs. Journal of Computer and System Sciences, 2012, 78 (5), pp.1516-1537. ⟨10.1016/j.jcss.2011.12.027⟩. ⟨hal-00772046⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 UNIV-LILLE3 CNRS INRIA IRISA LAGIS INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

333 Consultations

200 Téléchargements

Learning with stochastic inputs and adversarial outputs

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager