Learning Object Class Detectors from Weakly Annotated Video - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2012

Learning Object Class Detectors from Weakly Annotated Video

Résumé

Object detectors are typically trained on a large set of still images annotated by bounding-boxes. This paper introduces an approach for learning object detectors from real-world web videos known only to contain objects of a target class. We propose a fully automatic pipeline that localizes objects in a set of videos of the class and learns a detector for it. The approach extracts candidate spatio-temporal tubes based on motion segmentation and then selects one tube per video jointly over all videos. To compare to the state of the art, we test our detector on still images, i.e., Pascal VOC 2007. We observe that frames extracted from web videos can differ significantly in terms of quality to still images taken by a good camera. Thus, we formulate the learning from videos as a domain adaptation task. We show that training from a combination of weakly annotated videos and fully annotated still images using domain adaptation improves the performance of a detector trained from still images alone.
Fichier principal
Vignette du fichier
VO.pdf (2.31 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00695940 , version 1 (10-05-2012)
hal-00695940 , version 2 (02-07-2012)

Identifiants

Citer

Alessandro Prest, Christian Leistner, Javier Civera, Cordelia Schmid, Vittorio Ferrari. Learning Object Class Detectors from Weakly Annotated Video. CVPR 2012 - Conference on Computer Vision and Pattern Recognition, Jun 2012, Providence, RI, United States. pp.3282-3289, ⟨10.1109/CVPR.2012.6248065⟩. ⟨hal-00695940v2⟩
597 Consultations
1890 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More