Unifying discriminative visual codebook generation with classifier training for object category recognition

Liu Yang 1 Rong Jin 1 Rahul Sukthankar 2, 3 Frédéric Jurie 4
4 LEAR - Learning and recognition in vision
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
Abstract : The idea of representing images using a bag of visual words is currently popular in object category recognition. Since this representation is typically constructed using unsupervised clustering, the resulting visual words may not capture the desired information. Recent work has explored the construction of discriminative visual codebooks that explicitly consider object category information. However, since the codebook generation process is still disconnected from that of classifier training, the set of resulting visual words, while individually discriminative, may not be those best suited for the classifier. This paper proposes a novel optimization framework that unifies codebook generation with classifier training. In our approach, each image feature is encoded by a sequence of ldquovisual bitsrdquo optimized for each category. An image, which can contain objects from multiple categories, is represented using aggregates of visual bits for each category. Classifiers associated with different categories determine how well a given image corresponds to each category. Based on the performance of these classifiers on the training data, we augment the visual words by generating additional bits. The classifiers are then updated to incorporate the new representation. These two phases are repeated until the desired performance is achieved. Experiments compare our approach to standard clustering-based methods and with state-of-the-art discriminative visual codebook generation. The significant improvements over previous techniques clearly demonstrate the value of unifying representation and classification into a single optimization framework.
Type de document :
Communication dans un congrès
CVPR '08 - Conference on Computer Vision & Pattern Recognition, Jun 2008, Anchorage, United States. IEEE Computer Society, pp.1-8, 2008, 〈http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=4587504〉. 〈10.1109/CVPR.2008.4587504〉
Liste complète des métadonnées

Littérature citée [18 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00548653
Contributeur : Thoth Team <>
Soumis le : jeudi 6 janvier 2011 - 11:42:38
Dernière modification le : mercredi 29 juillet 2015 - 01:20:35
Document(s) archivé(s) le : lundi 5 novembre 2012 - 15:45:33

Fichier

cvpr2008-unified-rahuls.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Collections

Citation

Liu Yang, Rong Jin, Rahul Sukthankar, Frédéric Jurie. Unifying discriminative visual codebook generation with classifier training for object category recognition. CVPR '08 - Conference on Computer Vision & Pattern Recognition, Jun 2008, Anchorage, United States. IEEE Computer Society, pp.1-8, 2008, 〈http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=4587504〉. 〈10.1109/CVPR.2008.4587504〉. 〈inria-00548653〉

Partager

Métriques

Consultations de
la notice

509

Téléchargements du document

691