Multi-View Object Class Detection with a 3D Geometric Model

Jörg Liebelt; Cordelia Schmid

doi:10.1109/CVPR.2010.5539836

Communication Dans Un Congrès Année : 2010

Multi-View Object Class Detection with a 3D Geometric Model

(1) , (2)

1
2

Jörg Liebelt

Fonction : Auteur

EADS Innovation Works [Munich]

Cordelia Schmid

Fonction : Auteur
PersonId : 831154

Learning and recognition in vision

Résumé

This paper presents a new approach for multi-view object class detection. Appearance and geometry are treated as separate learning tasks with different training data. Our approach uses a part model which discriminatively learns the object appearance with spatial pyramids from a database of real images, and encodes the 3D geometry of the object class with a generative representation built from a database of synthetic models. The geometric information is linked to the 2D training data and allows to perform an approximate 3D pose estimation for generic object classes. The pose estimation provides an efficient method to evaluate the likelihood of groups of 2D part detections with respect to a full 3D geometry model in order to disambiguate and prune 2D detections and to handle occlusions. In contrast to other methods, neither tedious manual part annotation of training images nor explicit appearance matching between synthetic and real training data is required, which results in high geometric fidelity and in increased flexibility. On the 3D Object Category datasets CAR and BICYCLE, the current state-of-the-art benchmark for 3D object detection, our approach outperforms previously published results for viewpoint estimation.

Mots clés

geometry image representation object detection pose estimation solid modelling

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV]

Fichier principal

cvpr_1423.pdf (2.35 Mo)

1423.jpg (64.16 Ko)

cvpr_1423.jpg (137.22 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Format : Figure, Image

THOTH Team : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00548634

Soumis le : lundi 20 décembre 2010-10:22:35

Dernière modification le : jeudi 4 avril 2024-18:17:53

Archivage à long terme le : lundi 21 mars 2011-02:33:24

Dates et versions

inria-00548634 , version 1 (20-12-2010)

Identifiants

HAL Id : inria-00548634 , version 1
DOI : 10.1109/CVPR.2010.5539836

Citer

Jörg Liebelt, Cordelia Schmid. Multi-View Object Class Detection with a 3D Geometric Model. CVPR 2010 - 23rd IEEE Conference on Computer Vision & Pattern Recognition, Jun 2010, San Francisco, United States. pp.1688-1695, ⟨10.1109/CVPR.2010.5539836⟩. ⟨inria-00548634⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 UGA CNRS INRIA IRISA LJK LJK_GI LJK_GI_LEAR INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

880 Consultations

1543 Téléchargements

Multi-View Object Class Detection with a 3D Geometric Model

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager