Learning Object Class Detectors from Weakly Annotated Video

Alessandro Prest; Christian Leistner; Javier Civera; Cordelia Schmid; Vittorio Ferrari

doi:10.1109/CVPR.2012.6248065

Conference Papers Year : 2012

Learning Object Class Detectors from Weakly Annotated Video

(1, 2) , (2) , (3) , (1) , (2)

1
2
3

Alessandro Prest

Function : Author
PersonId : 879018

Learning and recognition in vision

Eidgenössische Technische Hochschule - Swiss Federal Institute of Technology [Zürich]

Christian Leistner

Function : Author

Eidgenössische Technische Hochschule - Swiss Federal Institute of Technology [Zürich]

Javier Civera

Function : Author
PersonId : 885704

Departamento de Informática e Ingeniería de Sistemas

Cordelia Schmid

Function : Author
PersonId : 831154

Learning and recognition in vision

Vittorio Ferrari

Function : Author
PersonId : 852592

Eidgenössische Technische Hochschule - Swiss Federal Institute of Technology [Zürich]

Abstract

Object detectors are typically trained on a large set of still images annotated by bounding-boxes. This paper introduces an approach for learning object detectors from real-world web videos known only to contain objects of a target class. We propose a fully automatic pipeline that localizes objects in a set of videos of the class and learns a detector for it. The approach extracts candidate spatio-temporal tubes based on motion segmentation and then selects one tube per video jointly over all videos. To compare to the state of the art, we test our detector on still images, i.e., Pascal VOC 2007. We observe that frames extracted from web videos can differ significantly in terms of quality to still images taken by a good camera. Thus, we formulate the learning from videos as a domain adaptation task. We show that training from a combination of weakly annotated videos and fully annotated still images using domain adaptation improves the performance of a detector trained from still images alone.

Domains

Computer Vision and Pattern Recognition [cs.CV]

Fichier principal

VO.pdf (2.31 Mo)

Origin : Files produced by the author(s)

Alessandro Prest : Connect in order to contact the contributor

https://inria.hal.science/hal-00695940

Submitted on : Monday, July 2, 2012-9:59:33 AM

Last modification on : Thursday, April 4, 2024-9:20:04 PM

Long-term archiving on: Thursday, December 15, 2016-7:44:50 PM

Dates and versions

hal-00695940 , version 1 (10-05-2012)

hal-00695940 , version 2 (02-07-2012)

Identifiers

HAL Id : hal-00695940 , version 2
DOI : 10.1109/CVPR.2012.6248065

Cite

Alessandro Prest, Christian Leistner, Javier Civera, Cordelia Schmid, Vittorio Ferrari. Learning Object Class Detectors from Weakly Annotated Video. CVPR 2012 - Conference on Computer Vision and Pattern Recognition, Jun 2012, Providence, RI, United States. pp.3282-3289, ⟨10.1109/CVPR.2012.6248065⟩. ⟨hal-00695940v2⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 UGA CNRS INRIA IRISA LJK LJK_GI LJK_GI_LEAR INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

597 View

1898 Download

Learning Object Class Detectors from Weakly Annotated Video

Abstract

Domains

Dates and versions

Identifiers

Cite

Export

Collections

Altmetric

Share