Constructing Category Hierarchies for Visual Recognition

Marcin Marszałek 1 Cordelia Schmid 1
1 LEAR - Learning and recognition in vision
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
Abstract : Class hierarchies are commonly used to reduce the complexity of the classification problem. This is crucial in situations when one has to deal with multiple categories. In this work, we evaluate the suitability of class hierarchies currently constructed for visual recognition. We show that top-down as well as bottom-up approaches that are commonly used to automatically construct hierarchies, incorporate assumptions about separability of classes that cannot be fulfilled in the case of visual recognition of a large number of object categories. We propose a modification which is appropriate for most top-down approaches. It allows to construct better class hierarchies that postpone decisions in the presence of uncertainty and thus provide higher recognition accuracy. We also compare our method to flat one-against-all approach and show how to control the speed-for-accuracy trade-off by using our method. For the experimental evaluation, we use the Caltech-256 visual object classes dataset and compare to the state-of-the-art.
Type de document :
Communication dans un congrès
David A. Forsyth and Philip H. S. Torr and Andrew Zisserman. ECCV 2008 - 10th European Conference on Computer Vision, Oct 2008, Marseille, France. Springer-Verlag, 5305, pp.479-491, 2008, Lecture Notes in Computer Science. <http://www.springerlink.com/content/l176732j52071g72/>. <10.1007/978-3-540-88693-8_35>
Liste complète des métadonnées


https://hal.inria.fr/inria-00548656
Contributeur : Thoth Team <>
Soumis le : lundi 20 décembre 2010 - 10:24:36
Dernière modification le : mercredi 9 juillet 2014 - 16:07:17
Document(s) archivé(s) le : lundi 21 mars 2011 - 03:28:37

Identifiants

Collections

Citation

Marcin Marszałek, Cordelia Schmid. Constructing Category Hierarchies for Visual Recognition. David A. Forsyth and Philip H. S. Torr and Andrew Zisserman. ECCV 2008 - 10th European Conference on Computer Vision, Oct 2008, Marseille, France. Springer-Verlag, 5305, pp.479-491, 2008, Lecture Notes in Computer Science. <http://www.springerlink.com/content/l176732j52071g72/>. <10.1007/978-3-540-88693-8_35>. <inria-00548656>

Partager

Métriques

Consultations de
la notice

353

Téléchargements du document

633