Taxonomic Multi-class Prediction and Person Layout using Efficient Structured Ranking

Abstract : In computer vision efficient multi-class classification is becoming a key problem as the field develops and the number of object classes to be identified increases. Often objects might have some sort of structure such as a taxonomy in which the mis-classification score for object classes close by, using tree distance within the taxonomy, should be less than for those far apart. This is an example of multi-class classification in which the loss function has a special structure. Another example in vision is for the ubiquitous pictorial structure or parts based model. In this case we would like the mis-classification score to be proportional to the number of parts misclassified. It transpires both of these are examples of structured output ranking problems. However, so far no efficient large scale algorithm for this problem has been demonstrated. In this work we propose an algorithm for structured output ranking that can be trained in a time linear in the number of samples under a mild assumption common to many computer vision problems: that the loss function can be discretized into a small number of values. We show the feasibility of structured ranking on these two core computer vision problems and demonstrate a consistent and substantial improvement over competing techniques. Aside from this, we also achieve state-of-the art results for the PASCAL VOC human layout problem.
Type de document :
Communication dans un congrès
European Conference on Computer Vision, Oct 2012, Firenze, Italy. Springer, 7573, pp.245-258, 2012, Lecture Notes in Computer Science. 〈10.1007/978-3-642-33709-3_18〉
Liste complète des métadonnées

Littérature citée [25 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-00775992
Contributeur : Matthew Blaschko <>
Soumis le : vendredi 31 janvier 2014 - 16:45:29
Dernière modification le : vendredi 12 janvier 2018 - 11:23:42
Document(s) archivé(s) le : samedi 1 avril 2017 - 04:34:20

Fichier

mittal12.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Collections

Citation

Arpit Mittal, Matthew Blaschko, Andrew Zisserman, Philip Torr. Taxonomic Multi-class Prediction and Person Layout using Efficient Structured Ranking. European Conference on Computer Vision, Oct 2012, Firenze, Italy. Springer, 7573, pp.245-258, 2012, Lecture Notes in Computer Science. 〈10.1007/978-3-642-33709-3_18〉. 〈hal-00775992〉

Partager

Métriques

Consultations de la notice

223

Téléchargements de fichiers

161