Towards true 3D object recognition

Jean Ponce; Svetlana Lazebnik; Fred Rothganger; Cordelia Schmid

Communication Dans Un Congrès Année : 2004

Towards true 3D object recognition

(1) , (1) , (1, 2) , (3)

1
2
3

Jean Ponce

Fonction : Auteur
PersonId : 853809

The Beckman Institute for Advanced Science and Technology

Svetlana Lazebnik

Fonction : Auteur

The Beckman Institute for Advanced Science and Technology

Fred Rothganger

Fonction : Auteur

The Beckman Institute for Advanced Science and Technology

Department of Computer Science [UIUC]

Cordelia Schmid

Fonction : Auteur correspondant
PersonId : 831154

Connectez-vous pour contacter l'auteur

Learning and recognition in vision

Résumé

This talk addresses the problem of recognizing three-dimensional (3D) objects in photographs and image sequences, revisiting viewpoint invariants as a -local- representation of shape and appearance. The key insight is that, although smooth surfaces are almost never planar in the large, and thus do not (in general) admit global invariants, they are always planar in the small---that is, sufficiently small surface patches can always be thought of as being comprised of coplanar points---and thus can be represented locally by planar invariants. This is the basis for a new, unified approach to object recognition where object models consist of a collection of small (planar) patches, their invariants, and a description of their 3D spatial relationship. Specifically, the local invariants used in this proposal are the affine-invariant descriptions of the image brightness pattern in the neighborhood of salient image features ("interest points") recently developed by Lindeberg and Garding and by Mikolajczyk and Schmid. These affine-invariant patches provide a normalized representation of the local object appearance, invariant under viewpoint and illumination changes, that can be used as a local measure of image, part, or object similarity. The spatial relationship between local invariants is used to represent the global object structure and drive the recognition process. I will illustrate our approach with two fundamental instances of the 3D object recognition problem: (1) modeling rigid 3D objects from a small set of unregistered pictures and recognizing them in cluttered photographs taken from unconstrained viewpoints; and (2) representing, learning, and recognizing non-uniform texture patterns under non-rigid transformations. If time permits, I will conclude with a brief discussion of our current work in 3D photography using shape, texture, and motion cues.

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV]

THOTH Team : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00548535

Soumis le : lundi 20 décembre 2010-09:09:28

Dernière modification le : jeudi 4 avril 2024-21:30:59

Dates et versions

inria-00548535 , version 1 (20-12-2010)

Identifiants

HAL Id : inria-00548535 , version 1

Citer

Jean Ponce, Svetlana Lazebnik, Fred Rothganger, Cordelia Schmid. Towards true 3D object recognition. 14ème Congrès de Reconnaissance des Formes et Intelligence Artificielle (RFIA '04), Jan 2004, Toulouse, France. ⟨inria-00548535⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UGA IMAG CNRS INRIA INRIA2

101 Consultations

0 Téléchargements

Towards true 3D object recognition

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager