Differentiable PAC-Bayes Objectives with Partially Aggregated Neural Networks

Felix Biggs; Benjamin Guedj

doi:10.3390/e23101280

Article Dans Une Revue Entropy Année : 2021

Differentiable PAC-Bayes Objectives with Partially Aggregated Neural Networks

(1, 2) , (3, 1, 4, 5, 2)

1
2
3
4
5

Felix Biggs

Fonction : Auteur
PersonId : 1073749

Department of Computer science [University College of London]

The Inria London Programme

Benjamin Guedj

Fonction : Auteur
PersonId : 9385
IdHAL : bguedj
ORCID : 0000-0003-1237-7430
IdRef : 179227807

University College of London [London]

Department of Computer science [University College of London]

Inria-CWI

MOdel for Data Analysis and Learning

The Inria London Programme

Résumé

We make three related contributions motivated by the challenge of training stochastic neural networks, particularly in a PAC-Bayesian setting: (1) we show how averaging over an ensemble of stochastic neural networks enables a new class of \emph{partially-aggregated} estimators; (2) we show that these lead to provably lower-variance gradient estimates for non-differentiable signed-output networks; (3) we reformulate a PAC-Bayesian bound for these networks to derive a directly optimisable, differentiable objective and a generalisation guarantee, without using a surrogate loss or loosening the bound. This bound is twice as tight as that of Letarte et al. (2019) on a similar network type. We show empirically that these innovations make training easier and lead to competitive guarantees.

Domaines

Apprentissage [cs.LG] Machine Learning [stat.ML]

Fichier principal

2006.12228.pdf (253.54 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Benjamin Guedj : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02879216

Soumis le : mardi 23 juin 2020-16:00:59

Dernière modification le : vendredi 19 avril 2024-14:04:05

Dates et versions

hal-02879216 , version 1 (23-06-2020)

Identifiants

HAL Id : hal-02879216 , version 1
ARXIV : 2006.12228
DOI : 10.3390/e23101280

Citer

Felix Biggs, Benjamin Guedj. Differentiable PAC-Bayes Objectives with Partially Aggregated Neural Networks. Entropy, 2021, ⟨10.3390/e23101280⟩. ⟨hal-02879216⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA INRIA2 INRIA-CWI UNIV-LILLE INRIA-LONDON ANR LPP-MATH

44 Consultations

152 Téléchargements

Differentiable PAC-Bayes Objectives with Partially Aggregated Neural Networks

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager