Discriminative part model for visual recognition

Ronan Sicre 1 Frédéric Jurie 1
1 Equipe Image - Laboratoire GREYC - UMR6072
GREYC - Groupe de Recherche en Informatique, Image, Automatique et Instrumentation de Caen
Abstract : The recent literature on visual recognition and image classification has been mainly focused on Deep Convolutional Neural Networks (Deep CNN) and their variants, which has resulted in a significant progression of the performance of these algorithms. Building on these recent advances, this paper proposes to explicitly add translation and scale invariance to Deep CNN-based local representations, by introducing a new algorithm for image recognition which is modeling image categories as a collection of automatically discovered distinctive parts. These parts are matched across images while learning their visual model and are finally pooled to provide images signatures. The appearance model of the parts is learnt from the training images to allow the distinction between the categories to be recognized. A key ingredient of the approach is a softassign-like matching algorithm that simultaneously learns the model of each part and automatically assigns image regions to the model's parts. Once the model of the category is trained, it can be used to classify new images by finding image's regions similar to the learned parts and encoding them in a single compact signature. The experimental validation shows that the performance of the proposed approach is better than those of the latest Deep Convolutional Neural Networks approaches, hence providing state-of-the art results on several publicly available datasets.
Type de document :
Article dans une revue
Computer Vision and Image Understanding, Elsevier, 2015, http://www.sciencedirect.com/science/article/pii/S1077314215001642#. 〈10.1016/j.cviu.2015.08.002〉
Liste complète des métadonnées

https://hal.inria.fr/hal-01132389
Contributeur : Ronan Sicre <>
Soumis le : mardi 25 août 2015 - 15:31:35
Dernière modification le : mardi 5 juin 2018 - 18:00:02
Document(s) archivé(s) le : mercredi 26 avril 2017 - 10:26:37

Fichier

cviu-parts.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Citation

Ronan Sicre, Frédéric Jurie. Discriminative part model for visual recognition. Computer Vision and Image Understanding, Elsevier, 2015, http://www.sciencedirect.com/science/article/pii/S1077314215001642#. 〈10.1016/j.cviu.2015.08.002〉. 〈hal-01132389v2〉

Partager

Métriques

Consultations de la notice

342

Téléchargements de fichiers

447