On the Shift Invariance of Max Pooling Feature Maps in Convolutional Neural Networks

Hubert Leterme; Kévin Polisano; Valérie Perrier; Karteek Alahari

Preprints, Working Papers, ... Year : 2023

On the Shift Invariance of Max Pooling Feature Maps in Convolutional Neural Networks

(1, 2) , (3) , (1) , (2)

1
2
3

Hubert Leterme

Function : Author
PersonId : 751941
IdHAL : hubert-leterme
ORCID : 0000-0002-4840-5299
IdRef : 272860786

Equations aux Dérivées Partielles

Apprentissage de modèles à partir de données massives

Kévin Polisano

Function : Author

Statistique pour le Vivant et l’Homme

Valérie Perrier

Function : Author

Equations aux Dérivées Partielles

Karteek Alahari

Function : Author
PersonId : 19670
IdHAL : karteek
ORCID : 0000-0002-1838-5936
IdRef : 196283892

Apprentissage de modèles à partir de données massives

Abstract

This paper focuses on improving the mathematical interpretability of convolutional neural networks (CNNs) in the context of image classification. Specifically, we tackle the instability issue arising in their first layer, which tends to learn parameters that closely resemble oriented band-pass filters when trained on datasets like ImageNet. Subsampled convolutions with such Gabor-like filters are prone to aliasing, causing sensitivity to small input shifts. In this context, we establish conditions under which the max pooling operator approximates a complex modulus, which is nearly shift invariant. We then derive a measure of shift invariance for subsampled convolutions followed by max pooling. In particular, we highlight the crucial role played by the filter's frequency and orientation in achieving stability. We experimentally validate our theory by considering a deterministic feature extractor based on the dual-tree complex wavelet packet transform, a particular case of discrete Gabor-like decomposition.

Keywords

deep learning image classification dual-tree wavelet packet transform max pooling shift invariance feature extractor subsampling aliasing

Domains

Computer Vision and Pattern Recognition [cs.CV] Artificial Intelligence [cs.AI] Signal and Image Processing Machine Learning [stat.ML]

Fichier principal

preprint.pdf (1.71 Mo)

Origin : Files produced by the author(s)

Hubert Leterme : Connect in order to contact the contributor

https://hal.science/hal-03779434

Submitted on : Tuesday, October 24, 2023-1:20:27 PM

Last modification on : Saturday, April 27, 2024-3:14:50 AM

Dates and versions

hal-03779434 , version 1 (16-09-2022)

hal-03779434 , version 2 (24-10-2023)

Licence

Attribution

Identifiers

HAL Id : hal-03779434 , version 2

Cite

Hubert Leterme, Kévin Polisano, Valérie Perrier, Karteek Alahari. On the Shift Invariance of Max Pooling Feature Maps in Convolutional Neural Networks. 2023. ⟨hal-03779434v2⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA CNRS INRIA INSMI LJK LJK_GI LJK_MAD LJK_PS LJK_MAD_EDP PERSYVAL-LAB LJK-PS-SVH INRIA2 LJK-GI-THOTH ANR

194 View

69 Download

On the Shift Invariance of Max Pooling Feature Maps in Convolutional Neural Networks

Abstract

Keywords

Domains

Dates and versions

Licence

Identifiers

Cite

Export

Collections

Share