A Statistical Framework for Image Category Search from a Mental Picture - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Rapport (Rapport De Recherche) Année : 2008

A Statistical Framework for Image Category Search from a Mental Picture

Résumé

Starting from a member of an image database designated the “query image,” traditional image retrieval techniques, for example search by visual similarity, allow one to locate additional instances of a target category residing in the database. However, in many cases, the query image or, more generally, the target category, resides only in the mind of the user as a set of subjective visual patterns, psychological impressions or “mental pictures.” Consequently, since image databases available today are often unstructured and lack reliable semantic annotations, it is often not obvious how to initiate a search session; this is the “page zero problem.” We propose a new statistical framework based on relevance feedback to locate an instance of a semantic category in an unstructured image database with no semantic annotations. A search session is initiated from a random sample of images. At each retrieval round the user is asked to select one image from among a set of displayed images – the one that is closest in his opinion to the target class. The matching is then “mental.” Performance is measured by the number of iterations necessary to display an image which satisfies the user, at which point standard techniques can be employed to display other instances. Our core contribution is a Bayesian formulation which scales to large databases. The two key components are a response model which accounts for the user's subjective perception of similarity and a display algorithm which seeks to maximize the flow of information. Experiments with real users and two databases of 20,000 and 60,000 images demonstrate the efficiency of the search process.
Fichier principal
Vignette du fichier
RR-6584.pdf (962.79 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

inria-00303572 , version 1 (22-07-2008)
inria-00303572 , version 2 (22-07-2008)

Identifiants

  • HAL Id : inria-00303572 , version 2

Citer

Marin Ferecatu, Donald Geman. A Statistical Framework for Image Category Search from a Mental Picture. [Research Report] RR-6584, INRIA. 2008, pp.31. ⟨inria-00303572v2⟩
158 Consultations
356 Téléchargements

Partager

Gmail Facebook X LinkedIn More