sign in
english version rss feed

inria-00548640, version 1

Multimodal semi-supervised learning for image classification

Matthieu Guillaumin () a12, Jakob Verbeek () 1, Cordelia Schmid () 12

23rd IEEE Conference on Computer Vision & Pattern Recognition (CVPR '10) (2010) 902--909

Abstract: In image categorization the goal is to decide if an image belongs to a certain category or not. A binary classifier can be learned from manually labeled images; while using more labeled examples improves performance, obtaining the image labels is a time consuming process. We are interested in how other sources of information can aid the learning process given a fixed amount of labeled images. In particular, we consider a scenario where keywords are associated with the training images, eg as found on photo sharing websites. The goal is to learn a classifier for images alone, but we will use the keywords associated with labeled and unlabeled images to improve the classifier using semi-supervised learning. We first learn a strong Multiple Kernel Learning (MKL) classifier using both the image content and keywords, and use it to score unlabeled images. We then learn classifiers on visual features only, either support vector machines (SVM) or least-squares regression (LSR), from the MKL output values on both the labeled and unlabeled images. In our experiments on 20 classes from the PASCAL VOC'07 set and 38 from the MIR Flickr set, we demonstrate the benefit of our semi-supervised approach over only using the labeled images. We also present results for a scenario where we do not use any manual labeling but directly learn classifiers from the image tags. The semi-supervised approach also improves classification accuracy in this case.

  • Icone de GVS10.png
  • a –  INRIA
  • 1:  LEAR (INRIA Grenoble Rhône-Alpes / LJK Laboratoire Jean Kuntzmann)
  • CNRS : FR71 – CNRS : UMR5527 – INRIA – Laboratoire Jean Kuntzmann – Université Joseph Fourier - Grenoble I – Institut National Polytechnique de Grenoble (INPG)
  • 2:  Laboratoire Jean Kuntzmann (LJK)
  • CNRS : UMR5224 – Université Joseph Fourier - Grenoble I – Université Pierre Mendès-France - Grenoble II – Institut Polytechnique de Grenoble - Grenoble Institute of Technology
  • Domain : Computer Science/Computer Vision and Pattern Recognition
  • Keywords : image classification – learning (artificial intelligence) – regression analysis – support vector machines
 
  • inria-00548640, version 1
  • oai:hal.inria.fr:inria-00548640
  • From: 
  • Submitted for: 
  • Submitted on: Monday, 20 December 2010 10:23:35
  • Updated on: Friday, 1 July 2011 09:28:48
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...
all articles on CCSd database...