Scaling-up Empirical Risk Minimization: Optimization of Incomplete U-statistics

Stéphan Clémençon; Igor Colin; Aurélien Bellet

Article Dans Une Revue Journal of Machine Learning Research Année : 2016

Scaling-up Empirical Risk Minimization: Optimization of Incomplete U-statistics

(1) , (1) , (2)

1
2

Stéphan Clémençon

Fonction : Auteur
PersonId : 174491
IdHAL : stephan-clemencon
ORCID : 0000-0002-5879-9500
IdRef : 08905203X

Laboratoire Traitement et Communication de l'Information

Igor Colin

Fonction : Auteur
PersonId : 983143

Laboratoire Traitement et Communication de l'Information

Aurélien Bellet

Fonction : Auteur
PersonId : 9877
IdHAL : aurelien-bellet
ORCID : 0000-0003-3440-1251
IdRef : 17653136X

Machine Learning in Information Networks

Résumé

In a wide range of statistical learning problems such as ranking, clustering or metric learning among others, the risk is accurately estimated by U-statistics of degree d ≥ 1, i.e. functionals of the training data with low variance that take the form of averages over k-tuples. From a computational perspective, the calculation of such statistics is highly expensive even for a moderate sample size n, as it requires averaging O(n^d) terms. This makes learning procedures relying on the optimization of such data functionals hardly feasible in practice. It is the major goal of this paper to show that, strikingly, such empirical risks can be replaced by drastically computationally simpler Monte-Carlo estimates based on O(n) terms only, usually referred to as incomplete U-statistics, without damaging the O(1/√n) learning rate of Empirical Risk Minimization (ERM) procedures. For this purpose, we establish uniform deviation results describing the error made when approximating a U-process by its incomplete version under appropriate complexity assumptions. Extensions to model selection, fast rate situations and various sampling techniques are also considered , as well as an application to stochastic gradient descent for ERM. Finally, numerical examples are displayed in order to provide strong empirical evidence that the approach we promote largely surpasses more naive subsampling techniques.

Mots clés

rate bound analysis U-processes stochastic gradient descent sampling design big data empirical risk minimization

Domaines

Apprentissage [cs.LG] Machine Learning [stat.ML]

Fichier principal

jmlr16.pdf (682.24 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Aurélien Bellet : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01327662

Soumis le : lundi 6 juin 2016-23:25:37

Dernière modification le : mercredi 24 janvier 2024-09:54:24

Dates et versions

hal-01327662 , version 1 (06-06-2016)

Identifiants

HAL Id : hal-01327662 , version 1
ARXIV : 1501.02629

Citer

Stéphan Clémençon, Igor Colin, Aurélien Bellet. Scaling-up Empirical Risk Minimization: Optimization of Incomplete U-statistics. Journal of Machine Learning Research, 2016, 17 (76), pp.1-36. ⟨hal-01327662⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INSTITUT-TELECOM UNIV-LILLE3 CNRS INRIA PARISTECH CRISTAL INRIA2 CRISTAL-MAGNET UNIV-LILLE LTCI IDS S2A

244 Consultations

124 Téléchargements

Scaling-up Empirical Risk Minimization: Optimization of Incomplete U-statistics

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Relations

Exporter

Collections

Altmetric

Partager