A Statistical Framework for Image Category Search from a Mental Picture

Abstract : Starting from a member of an image database designated the “query image,” traditional image retrieval techniques, for example search by visual similarity, allow one to locate additional instances of a target category residing in the database. However, in many cases, the query image or, more generally, the target category, resides only in the mind of the user as a set of subjective visual patterns, psychological impressions or “mental pictures.” Consequently, since image databases available today are often unstructured and lack reliable semantic annotations, it is often not obvious how to initiate a search session; this is the “page zero problem.” We propose a new statistical framework based on relevance feedback to locate an instance of a semantic category in an unstructured image database with no semantic annotations. A search session is initiated from a random sample of images. At each retrieval round the user is asked to select one image from among a set of displayed images – the one that is closest in his opinion to the target class. The matching is then “mental.” Performance is measured by the number of iterations necessary to display an image which satisfies the user, at which point standard techniques can be employed to display other instances. Our core contribution is a Bayesian formulation which scales to large databases. The two key components are a response model which accounts for the user's subjective perception of similarity and a display algorithm which seeks to maximize the flow of information. Experiments with real users and two databases of 20,000 and 60,000 images demonstrate the efficiency of the search process.
Type de document :
[Research Report] RR-6584, INRIA. 2008, pp.31
Liste complète des métadonnées

Littérature citée [28 références]  Voir  Masquer  Télécharger

Contributeur : Marin Ferecatu <>
Soumis le : mardi 22 juillet 2008 - 15:56:48
Dernière modification le : vendredi 25 mai 2018 - 12:02:03
Document(s) archivé(s) le : samedi 26 novembre 2016 - 00:29:22


Fichiers produits par l'(les) auteur(s)


  • HAL Id : inria-00303572, version 2



Marin Ferecatu, Donald Geman. A Statistical Framework for Image Category Search from a Mental Picture. [Research Report] RR-6584, INRIA. 2008, pp.31. 〈inria-00303572v2〉



Consultations de la notice


Téléchargements de fichiers