Spatial pyramid matching

Svetlana Lazebnik; Cordelia Schmid; Jean Ponce

Chapitre D'ouvrage Année : 2009

Spatial pyramid matching

(1) , (2) , (3)

1
2
3

Svetlana Lazebnik

Fonction : Auteur

Department of Computer Science

Cordelia Schmid

Fonction : Auteur
PersonId : 831154

Learning and recognition in vision

Jean Ponce

Fonction : Auteur
PersonId : 853809

Laboratoire d'informatique de l'école normale supérieure

Résumé

This chapter deals with the problem of whole-image categorization. We may want to classify a photograph based on a high-level semantic attribute (e.g., indoor or outdoor), scene type (forest, street, office, etc.), or object category (car, face, etc.). Our philosophy is that such global image tasks can be approached in a holistic fashion: It should be possible to develop image representations that use low-level features to directly infer high-level semantic information about the scene without going through the intermediate step of segmenting the image into more "basic" semantic entities. For example, we should be able to recognize that an image contains a beach scene without first segmenting and identifying its separate components, such as sand, water, sky, or bathers. This philosophy is inspired by psychophysical and psychological evidence that people can recognize scenes by considering them in a "holistic" manner, while overlooking most of the details of the constituent objects (Oliva and Torralba, 2001). It has been shown that human subjects can perform high-level categorization tasks extremely rapidly and in the near absence of attention (Thorpe et al., 1996; Fei-Fei et al., 2002), which would most likely preclude any feedback or detailed analysis of individual parts of the scene.

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV]

Fichier principal

pyramid_chapter.pdf (1.12 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

THOTH Team : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00548647

Soumis le : jeudi 6 janvier 2011-11:20:59

Dernière modification le : vendredi 19 avril 2024-16:18:55

Archivage à long terme le : jeudi 7 avril 2011-02:36:44

Dates et versions

inria-00548647 , version 1 (06-01-2011)

Identifiants

HAL Id : inria-00548647 , version 1

Citer

Svetlana Lazebnik, Cordelia Schmid, Jean Ponce. Spatial pyramid matching. Sven J. Dickinson and Aleš Leonardis and Bernt Schiele and Michael J. Tarr. Object Categorization: Computer and Human Vision Perspectives, Cambridge University Press, pp.401-415, 2009, 9780521887380. ⟨inria-00548647⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS UNIV-RENNES1 UGA CNRS INRIA IRISA LJK LJK_GI LJK_GI_LEAR INRIA2 PSL UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

845 Consultations

1066 Téléchargements

Spatial pyramid matching

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager