Learning Object Class Detectors from Weakly Annotated Video

Alessandro Prest; Christian Leistner; Javier Civera; Cordelia Schmid; Vittorio Ferrari

doi:10.1109/CVPR.2012.6248065

Communication Dans Un Congrès Année : 2012

Learning Object Class Detectors from Weakly Annotated Video

(1, 2) , (2) , (3) , (1) , (2)

1
2
3

Alessandro Prest

Fonction : Auteur
PersonId : 879018

Learning and recognition in vision

Eidgenössische Technische Hochschule - Swiss Federal Institute of Technology [Zürich]

Christian Leistner

Fonction : Auteur

Eidgenössische Technische Hochschule - Swiss Federal Institute of Technology [Zürich]

Javier Civera

Fonction : Auteur
PersonId : 885704

Departamento de Informática e Ingeniería de Sistemas

Cordelia Schmid

Fonction : Auteur
PersonId : 831154

Learning and recognition in vision

Vittorio Ferrari

Fonction : Auteur
PersonId : 852592

Eidgenössische Technische Hochschule - Swiss Federal Institute of Technology [Zürich]

Résumé

Object detectors are typically trained on a large set of still images annotated by bounding-boxes. This paper introduces an approach for learning object detectors from real-world web videos known only to contain objects of a target class. We propose a fully automatic pipeline that localizes objects in a set of videos of the class and learns a detector for it. The approach extracts candidate spatio-temporal tubes based on motion segmentation and then selects one tube per video jointly over all videos. To compare to the state of the art, we test our detector on still images, i.e., Pascal VOC 2007. We observe that frames extracted from web videos can differ significantly in terms of quality to still images taken by a good camera. Thus, we formulate the learning from videos as a domain adaptation task. We show that training from a combination of weakly annotated videos and fully annotated still images using domain adaptation improves the performance of a detector trained from still images alone.

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV]

Fichier principal

VO.pdf (2.31 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Alessandro Prest : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00695940

Soumis le : lundi 2 juillet 2012-09:59:33

Dernière modification le : jeudi 4 avril 2024-21:20:04

Archivage à long terme le : jeudi 15 décembre 2016-19:44:50

Dates et versions

hal-00695940 , version 1 (10-05-2012)

hal-00695940 , version 2 (02-07-2012)

Identifiants

HAL Id : hal-00695940 , version 2
DOI : 10.1109/CVPR.2012.6248065

Citer

Alessandro Prest, Christian Leistner, Javier Civera, Cordelia Schmid, Vittorio Ferrari. Learning Object Class Detectors from Weakly Annotated Video. CVPR 2012 - Conference on Computer Vision and Pattern Recognition, Jun 2012, Providence, RI, United States. pp.3282-3289, ⟨10.1109/CVPR.2012.6248065⟩. ⟨hal-00695940v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 UGA CNRS INRIA IRISA LJK LJK_GI LJK_GI_LEAR INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

597 Consultations

1898 Téléchargements

Learning Object Class Detectors from Weakly Annotated Video

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager