Deep Equals Shallow for ReLU Networks in Kernel Regimes

Alberto Bietti; Francis Bach

Communication Dans Un Congrès Année : 2021

Deep Equals Shallow for ReLU Networks in Kernel Regimes

(1, 2) , (3)

1
2
3

Alberto Bietti

Fonction : Auteur
PersonId : 1078962

Apprentissage de modèles à partir de données massives

New York University [New York]

Francis Bach

Fonction : Auteur
PersonId : 863086

Statistical Machine Learning and Parsimony

Résumé

Deep networks are often considered to be more expressive than shallow ones in terms of approximation. Indeed, certain functions can be approximated by deep networks provably more efficiently than by shallow ones, however, no tractable algorithms are known for learning such deep models. Separately, a recent line of work has shown that deep networks trained with gradient descent may behave like (tractable) kernel methods in a certain over-parameterized regime, where the kernel is determined by the architecture and initialization, and this paper focuses on approximation for such kernels. We show that for ReLU activations, the kernels derived from deep fully-connected networks have essentially the same approximation properties as their "shallow" two-layer counterpart, namely the same eigenvalue decay for the corresponding integral operator. This highlights the limitations of the kernel framework for understanding the benefits of such deep architectures. Our main theoretical result relies on characterizing such eigenvalue decays through differentiability properties of the kernel function, which also easily applies to the study of other kernels defined on the sphere.

Domaines

Machine Learning [stat.ML] Apprentissage [cs.LG]

Fichier principal

main.pdf (465.95 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Alberto Bietti : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02963250

Soumis le : mercredi 17 mars 2021-23:29:12

Dernière modification le : samedi 27 avril 2024-03:15:33

Dates et versions

hal-02963250 , version 1 (09-10-2020)

hal-02963250 , version 2 (17-03-2021)

Identifiants

HAL Id : hal-02963250 , version 2
ARXIV : 2009.14397

Citer

Alberto Bietti, Francis Bach. Deep Equals Shallow for ReLU Networks in Kernel Regimes. ICLR 2021 - International Conference on Learning Representations, May 2021, Virtual, Austria. pp.1-22. ⟨hal-02963250v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS UGA CNRS INRIA INSMI LJK LJK_GI INRIA2 LJK-GI-THOTH PSL ANR PRAIRIE-IA

3958 Consultations

631 Téléchargements

Deep Equals Shallow for ReLU Networks in Kernel Regimes

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager