Training and Generalization Errors for Underparameterized Neural Networks

Daniel Martin Xavier; Ludovic Chamoin; Laurent Fribourg

Communication Dans Un Congrès Année : 2024

Training and Generalization Errors for Underparameterized Neural Networks

(1) , (2) , (3)

1
2
3

Daniel Martin Xavier

Fonction : Auteur
PersonId : 1328660
IdHAL : daniel-martin-xavier
ORCID : 0000-0002-0981-6896

Laboratoire de Mécanique Paris-Saclay

Ludovic Chamoin

Fonction : Auteur
PersonId : 14945
IdHAL : ludovic-chamoin
ORCID : 0000-0002-8361-0757
IdRef : 127960112

Laboratoire de Mécanique et Technologie

Laurent Fribourg

Fonction : Auteur
PersonId : 1263411
ORCID : 0000-0002-5562-4078

Laboratoire Méthodes Formelles

Résumé

It has been theoretically explained, through the notion of Neural Tangent Kernel, why the training error of overparameterized networks converges linearly to 0. In this work, we focus on the case of small (or underparameterized) networks. An advantage of small networks is that they are faster to train while retaining sufficient precision to perform useful tasks in many applications. Our main theoretical contribution is to prove that the training error of small networks converges linearly to a (non-null) constant, of which we give a precise estimate. We verify this result on a neural network of 10 neurons simulating a Model Predictive Controller. We also observe that an upper bound of the generalization error follows a double-peak curve as the number of training data increases.

Mots clés

Neural networks data driven control optimization Neural networks data driven control optimization

Domaines

Intelligence artificielle [cs.AI] Sciences de l'ingénieur [physics]

Fichier principal

Training_and_Generalization_Errors_for_Underparameterized_Neural_Networks-1.pdf (557.96 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Daniel MARTIN XAVIER : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04423901

Soumis le : lundi 29 janvier 2024-13:20:40

Dernière modification le : dimanche 24 mars 2024-11:35:58

Dates et versions

hal-04423901 , version 1 (29-01-2024)

hal-04423901 , version 2 (13-03-2024)

Identifiants

HAL Id : hal-04423901 , version 1

Citer

Daniel Martin Xavier, Ludovic Chamoin, Laurent Fribourg. Training and Generalization Errors for Underparameterized Neural Networks. 2024 American Control Conference, Jul 2024, Toronto, Canada. ⟨hal-04423901v1⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

106 Consultations

21 Téléchargements

Training and Generalization Errors for Underparameterized Neural Networks

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Partager