Skip to Main content Skip to Navigation
Preprints, Working Papers, ...

Deep Equals Shallow for ReLU Networks in Kernel Regimes

Alberto Bietti 1 Francis Bach 1
1 SIERRA - Statistical Machine Learning and Parsimony
DI-ENS - Département d'informatique de l'École normale supérieure, CNRS - Centre National de la Recherche Scientifique, Inria de Paris
Abstract : Deep networks are often considered to be more expressive than shallow ones in terms of approximation. Indeed, certain functions can be approximated by deep networks provably more efficiently than by shallow ones, however, no tractable algorithms are known for learning such deep models. Separately, a recent line of work has shown that deep networks trained with gradient descent may behave like (tractable) kernel methods in a certain over-parameterized regime, where the kernel is determined by the architecture and initialization, and this paper focuses on approximation for such kernels. We show that for ReLU activations, the kernels derived from deep fully-connected networks have essentially the same approximation properties as their "shallow" two-layer counterpart, namely the same eigenvalue decay for the corresponding integral operator. This highlights the limitations of the kernel framework for understanding the benefits of such deep architectures. Our main theoretical result relies on characterizing such eigenvalue decays through differentiability properties of the kernel function, which also easily applies to the study of other kernels defined on the sphere.
Document type :
Preprints, Working Papers, ...
Complete list of metadatas

Cited literature [48 references]  Display  Hide  Download

https://hal.inria.fr/hal-02963250
Contributor : Alberto Bietti <>
Submitted on : Friday, October 9, 2020 - 10:10:04 PM
Last modification on : Tuesday, October 13, 2020 - 3:37:50 AM

File

main.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02963250, version 1
  • ARXIV : 2009.14397

Collections

Citation

Alberto Bietti, Francis Bach. Deep Equals Shallow for ReLU Networks in Kernel Regimes. 2020. ⟨hal-02963250⟩

Share

Metrics

Record views

384

Files downloads

27