Gaussian Pre-Activations in Neural Networks: Myth or Reality?

Pierre Wolinski; Julyan Arbel

doi:10.48550/arXiv.2205.12379

Preprints, Working Papers, ... (Preprint) Year : 2023

Gaussian Pre-Activations in Neural Networks: Myth or Reality?

(1) , (1)

Pierre Wolinski

Function : Author
PersonId : 177477
IdHAL : pierre-wolinski
ORCID : 0000-0003-1007-0144
IdRef : 245386297

Modèles statistiques bayésiens et des valeurs extrêmes pour données structurées et de grande dimension

Julyan Arbel

Function : Author
PersonId : 5183
IdHAL : julyanarbel
ORCID : 0000-0002-2525-4416
IdRef : 178641936

Modèles statistiques bayésiens et des valeurs extrêmes pour données structurées et de grande dimension

Abstract

The study of feature propagation at initialization in neural networks lies at the root of numerous initialization designs. An assumption very commonly made in the field states that the pre-activations are Gaussian. Although this convenient Gaussian hypothesis can be justified when the number of neurons per layer tends to infinity, it is challenged by both theoretical and experimental works for finite-width neural networks. Our major contribution is to construct a family of pairs of activation functions and initialization distributions that ensure that the pre-activations remain Gaussian throughout the network’s depth, even in narrow neural networks. In the process, we discover a set of constraints that a neural network should fulfill to ensure Gaussian pre-activations. Additionally, we provide a critical review of the claims of the Edge of Chaos line of works and build an exact Edge of Chaos analysis. We also propose a unified view on pre-activations propagation, encompassing the framework of several well-known initialization procedures. Finally, our work provides a principled framework for answering the much-debated question: is it desirable to initialize the training of a neural network whose pre-activations are ensured to be Gaussian?

Keywords

Neural netwoks Statistics

Domains

Machine Learning [cs.LG] Machine Learning [stat.ML]

Fichier principal

2205.12379v3.pdf (3.12 Mo)

Origin : Files produced by the author(s)

Pierre Wolinski : Connect in order to contact the contributor

https://hal.science/hal-03933169

Submitted on : Friday, March 3, 2023-3:27:10 PM

Last modification on : Saturday, April 27, 2024-3:15:55 AM

Dates and versions

hal-03933169 , version 1 (10-01-2023)

hal-03933169 , version 2 (03-03-2023)

Licence

Attribution

Identifiers

HAL Id : hal-03933169 , version 2
ARXIV : 2205.12379v3
DOI : 10.48550/arXiv.2205.12379

Cite

Pierre Wolinski, Julyan Arbel. Gaussian Pre-Activations in Neural Networks: Myth or Reality?. 2023. ⟨hal-03933169v2⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS INRIA INSMI LJK LJK_PS INRIA2 LJK-PS-STATIFY ANR

47 View

53 Download

Gaussian Pre-Activations in Neural Networks: Myth or Reality?

Abstract

Keywords

Domains

Dates and versions

Licence

Identifiers

Cite

Export

Collections

Altmetric

Share