8481 articles  [version française]

inria-00516477, version 1

Weakly supervised learning of interactions between humans and objects

Alessandro Prest () a12, Cordelia Schmid () a1, Vittorio Ferrari () b2

N° RT-391 (2010)

Abstract: We introduce a weakly supervised approach for learning human actions modeled as interactions between humans and objects. Our approach is human-centric: we first localize a human in the image and then determine the object relevant for the action and its spatial relation with the human. The model is learned automatically from a set of still images annotated (only) with the action label. Our approach relies on a human detector to initialize the model learning. For robustness to various degrees of visibility, we build a detector that learns to combine a set of existing part detectors. Starting from humans detected in a set of images depicting the action, our approach determines the action object and its spatial relation to the human. Its final output is a probabilistic model of the human-object interaction, i.e. the spatial relation between the human and the object.

  • Icone de Screenshot.png
  • a –  INRIA
  • b –  Swiss Federal Institute of Technology Zurich
  • 1:  LEAR (INRIA Grenoble Rhône-Alpes / LJK Laboratoire Jean Kuntzmann)
  • CNRS : UMR5527 – INRIA – Laboratoire Jean Kuntzmann – Université Joseph Fourier - Grenoble I – Institut National Polytechnique de Grenoble (INPG)
  • 2:  Eldgenössische Technische Hochschule Zürich (ETH Zürich)
  • ETH Zurich
  • Domain : Computer Science/Computer Vision and Pattern Recognition
  • Internal note : RT-391
 
  • inria-00516477, version 1
  • oai:hal.inria.fr:inria-00516477
  • From: 
  • Submitted on: Thursday, 9 September 2010 17:28:21
  • Updated on: Friday, 25 May 2012 13:57:19