Greedy Layerwise Learning Can Scale to ImageNet

Eugene Belilovsky; Michael Eickenberg; Edouard Oyallon

Communication Dans Un Congrès Année : 2019

Greedy Layerwise Learning Can Scale to ImageNet

(1, 2) , (3) , (4, 5)

1
2
3
4
5

Eugene Belilovsky

Fonction : Auteur
PersonId : 963154

Département d'Informatique et de Recherche Opérationnelle [Montreal]

Montreal Institute for Learning Algorithms [Montréal]

Michael Eickenberg

Fonction : Auteur

Lawrence Berkeley National Laboratory [Berkeley]

Edouard Oyallon

Fonction : Auteur
PersonId : 179157
IdHAL : edouard-oyallon
ORCID : 0000-0002-4826-7527
IdRef : 228745500

Organ Modeling through Extraction, Representation and Understanding of Medical Image Content

Centre de vision numérique

Résumé

Shallow supervised 1-hidden layer neural networks have a number of favorable properties that make them easier to interpret, analyze, and optimize than their deep counterparts, but lack their representational power. Here we use 1-hidden layer learning problems to sequentially build deep networks layer by layer, which can inherit properties from shallow networks. Contrary to previous approaches using shallow networks, we focus on problems where deep learning is reported as critical for success. We thus study CNNs on image classification tasks using the large-scale ImageNet dataset and the CIFAR-10 dataset. Using a simple set of ideas for architecture and training we find that solving sequential 1-hidden-layer auxiliary problems lead to a CNN that exceeds AlexNet performance on ImageNet. Extending this training methodology to construct individual layers by solving 2-and-3-hidden layer auxiliary problems , we obtain an 11-layer network that exceeds several members of the VGG model family on ImageNet, and can train a VGG-11 model to the same accuracy as end-to-end learning. To our knowledge, this is the first competitive alternative to end-to-end training of CNNs that can scale to ImageNet. We illustrate several interesting properties of these models theoretically and conduct a range of experiments to study the properties this training induces on the intermediate representations .

Domaines

Apprentissage [cs.LG] Vision par ordinateur et reconnaissance de formes [cs.CV]

Fichier principal

iclr_2019.pdf (593.25 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Eugene Belilovsky : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02119398

Soumis le : vendredi 3 mai 2019-17:15:55

Dernière modification le : mercredi 15 mars 2023-08:56:17

Dates et versions

hal-02119398 , version 1 (03-05-2019)

Identifiants

HAL Id : hal-02119398 , version 1
ARXIV : 1812.11446

Citer

Eugene Belilovsky, Michael Eickenberg, Edouard Oyallon. Greedy Layerwise Learning Can Scale to ImageNet. ICML 2019 - 36th International Conference on Machine Learning, Jun 2019, Long Beach, CA, United States. ⟨hal-02119398⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INRIA CVN CENTRALESUPELEC INRIA2 UNIV-PARIS-SACLAY GS-ENGINEERING GS-COMPUTER-SCIENCE

172 Consultations

208 Téléchargements

Greedy Layerwise Learning Can Scale to ImageNet

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager