Deep invariant networks with differentiable augmentation layers

Cédric Rommel; Thomas Moreau; Alexandre Gramfort

Communication Dans Un Congrès Année : 2022

Deep invariant networks with differentiable augmentation layers

(1, 2) , (1, 2) , (2, 1)

1
2

Cédric Rommel

Fonction : Auteur
PersonId : 1023227

CEA- Saclay

Modèles et inférence pour les données de Neuroimagerie

Thomas Moreau

Fonction : Auteur
PersonId : 171108
IdHAL : tommoral
ORCID : 0000-0002-1523-3419

CEA- Saclay

Modèles et inférence pour les données de Neuroimagerie

Alexandre Gramfort

Fonction : Auteur
PersonId : 687
IdHAL : agramfort
ORCID : 0000-0001-9791-4404
IdRef : 169233758

Modèles et inférence pour les données de Neuroimagerie

CEA- Saclay

Résumé

Designing learning systems which are invariant to certain data transformations is critical in machine learning. Practitioners can typically enforce a desired invariance on the trained model through the choice of a network architecture, e.g. using convolutions for translations, or using data augmentation. Yet, enforcing true invariance in the network can be difficult, and data invariances are not always known a piori. State-of-the-art methods for learning data augmentation policies require held-out data and are based on bilevel optimization problems, which are complex to solve and often computationally demanding. In this work we investigate new ways of learning invariances only from the training data. Using learnable augmentation layers built directly in the network, we demonstrate that our method is very versatile. It can incorporate any type of differentiable augmentation and be applied to a broad class of learning problems beyond computer vision. We provide empirical evidence showing that our approach is easier and faster to train than modern automatic data augmentation techniques based on bilevel optimization, while achieving comparable results. Experiments show that while the invariances transferred to a model through automatic data augmentation are limited by the model expressivity, the invariance yielded by our approach is insensitive to it by design.

Mots clés

invariance learning data augmentation automatic data augmentation

Domaines

Intelligence artificielle [cs.AI] Apprentissage [cs.LG]

Fichier principal

main.pdf (1.39 Mo)

ablation-weights-mags-heatmap.pdf (24.11 Ko)

architecture.pdf (78.12 Ko)

augnet-augerino-and-ablation.pdf (123.2 Ko)

benchmark_mass_with_adda.pdf (16.6 Ko)

cifar10-neurips-submission.pdf (26.31 Ko)

exp-illustration.pdf (39.35 Ko)

four-aug-layers.pdf (35.57 Ko)

learned_shift.pdf (10.33 Ko)

reg_illustration_color.pdf (8.16 Ko)

sin_capacity_study.pdf (13.98 Ko)

sin_dataset_illustration.pdf (37.32 Ko)

sin_learned_shift.pdf (11.89 Ko)

sin_params_heatmap.pdf (20.59 Ko)

sin_training_plot.pdf (14.21 Ko)

two-aug-layers.pdf (25.49 Ko)

weight_mags_seed1.pdf (20.58 Ko)

weight_mags_seed29.pdf (18.21 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Cédric Rommel : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03691742

Soumis le : jeudi 9 juin 2022-12:08:03

Dernière modification le : mercredi 3 avril 2024-10:20:12

Dates et versions

hal-03691742 , version 1 (09-06-2022)

Identifiants

HAL Id : hal-03691742 , version 1
ARXIV : 2202.02142

Citer

Cédric Rommel, Thomas Moreau, Alexandre Gramfort. Deep invariant networks with differentiable augmentation layers. Thirty-sixth Conference on Neural Information Processing Systems, Nov 2022, New Orleans, United States. ⟨hal-03691742⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CEA INRIA INRIA2 GENCI ANR GS-ENGINEERING GS-COMPUTER-SCIENCE

40 Consultations

53 Téléchargements

Deep invariant networks with differentiable augmentation layers

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager