28973 articles – 22398 references  [version française]

hal-00437538, version 1

Optimisation de la Topologie pour les Réseaux de Neurones Profonds

Ludovic Arnold () 12, Hélène Paugam-Moisy () 23, Michèle Sebag () 12

17e congrès francophone AFRIF-AFIA Reconnaissance des Formes et Intelligence Artificielle - RFIA 2010 (2010) .

Abstract: Recently, deep neural networks have proven their ability to achieve excellent results on tasks such as classification and dimensionality reduction. The issue of hyper-parameter selection is decisive for these networks since the size of the search space increases exponentially with the number of layers. As a result, the grid-search approach is inappropriate and it is often left to the experimenter to ``guess'' sensible values for the hyper-parameters. In this study, we propose to select hyper-parameters layer after layer, on the basis of an unsupervised criterion, thus reducing to linear the complexity of the hyper-parameter selection procedure. Two unsupervised criteria are considered and the study focuses on determining an optimal number of neurons per layer. Experimentally, we show that the reconstruction error constitutes an adequate criterion for the layer-wise optimization of the number of neurons. In addition, we observe that the optimal size of layers tends to decrease when the number of training samples increases and we discuss this counter-intuitive result.

  • 1:  Laboratoire de Recherche en Informatique (LRI)
  • CNRS : UMR8623 – Université Paris XI - Paris Sud
  • 2:  TAO (INRIA Saclay - Ile de France)
  • INRIA – CNRS : UMR8623 – Université Paris XI - Paris Sud
  • 3:  Laboratoire d'InfoRmatique en Images et Systèmes d'Information (LIRIS)
  • CNRS : UMR5205 – Université Claude Bernard - Lyon I – Université Lumière - Lyon II – Institut National des Sciences Appliquées (INSA) - Lyon – Ecole Centrale de Lyon
  • Domain : Computer Science/Neural and Evolutionary Computing
    Computer Science/Learning
  • Keywords : Learning – Deep Neural Networks – Model Selection
 
  • hal-00437538, version 1
  • oai:hal.archives-ouvertes.fr:hal-00437538
  • From: 
  • Submitted on: Tuesday, 1 December 2009 13:34:03
  • Updated on: Tuesday, 1 December 2009 14:03:39