Regression as Classification: Influence of Task Formulation on Neural Network Features - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2023

Regression as Classification: Influence of Task Formulation on Neural Network Features

Lawrence Stewart
  • Fonction : Auteur
  • PersonId : 1184478
Francis Bach
  • Fonction : Auteur
  • PersonId : 863086
Jean-Philippe Vert
  • Fonction : Auteur
  • PersonId : 1060917

Résumé

Neural networks can be trained to solve regression problems by using gradient-based methods to minimize the square loss. However, practitioners often prefer to reformulate regression as a classification problem, observing that training on the cross entropy loss results in better performance. By focusing on two-layer ReLU networks, which can be fully characterized by measures over their feature space, we explore how the implicit bias induced by gradient-based optimization could partly explain the above phenomenon. We provide theoretical evidence that the regression formulation yields a measure whose support can differ greatly from that for classification, in the case of one-dimensional data. Our proposed optimal supports correspond directly to the features learned by the input layer of the network. The different nature of these supports sheds light on possible optimization difficulties the square loss could encounter during training, and we present empirical results illustrating this phenomenon.
Fichier principal
Vignette du fichier
sample_paper.pdf (1.26 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03846706 , version 1 (10-11-2022)
hal-03846706 , version 2 (23-02-2023)

Licence

Paternité

Identifiants

Citer

Lawrence Stewart, Francis Bach, Quentin Berthet, Jean-Philippe Vert. Regression as Classification: Influence of Task Formulation on Neural Network Features. AISTATS 2023 - 26th International Conference on Artificial Intelligence and Statistics, Apr 2023, Valence, Spain. ⟨hal-03846706v2⟩
158 Consultations
94 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More