Learning to Track Any Object

Achal Dave; Pavel Tokmakov; Cordelia Schmid; Deva Ramanan

Communication Dans Un Congrès Année : 2019

Learning to Track Any Object

(1) , (1) , (2, 3) , (1)

1
2
3

Achal Dave

Fonction : Auteur

Carnegie Mellon University [Pittsburgh]

Pavel Tokmakov

Fonction : Auteur

Carnegie Mellon University [Pittsburgh]

Cordelia Schmid

Fonction : Auteur

Models of visual object recognition and scene understanding

Google Research [Paris]

Deva Ramanan

Fonction : Auteur

Carnegie Mellon University [Pittsburgh]

Résumé

Object tracking can be formulated as "finding the right object in a video". We observe that recent approaches for class-agnostic tracking tend to focus on the "finding" part, but largely overlook the "object" part of the task, essentially doing a template matching over a frame in a sliding-window. In contrast, class-specific trackers heavily rely on object priors in the form of category-specific object detectors. In this work, we re-purpose category-specific appearance models into a generic objectness prior. Our approach converts a category-specific object detector into a category-agnostic, object-specific detector (i.e. a tracker) efficiently, on the fly. Moreover, at test time the same network can be applied to detection and tracking, resulting in a unified approach for the two tasks. We achieve state-of-the-art results on two recent large-scale tracking benchmarks (OxUvA and GOT, using external data). By simply adding a mask prediction branch, our approach is able to produce instance segmentation masks for the tracked object. Despite only using box-level information on the first frame, our method outputs high-quality masks, as evaluated on the DAVIS '17 video object segmentation benchmark.

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV]

Fichier principal

1910.11844.pdf (4.04 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

THOTH Team : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-03990657

Soumis le : mercredi 15 février 2023-14:14:08

Dernière modification le : lundi 11 décembre 2023-11:31:38

Archivage à long terme le : mardi 16 mai 2023-19:06:13

Dates et versions

hal-03990657 , version 1 (15-02-2023)

Identifiants

HAL Id : hal-03990657 , version 1
ARXIV : 1910.11844

Citer

Achal Dave, Pavel Tokmakov, Cordelia Schmid, Deva Ramanan. Learning to Track Any Object. ICCV workshop - Holistic Video Understanding, Oct 2019, Seoul, South Korea. ⟨hal-03990657⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS CNRS INRIA INRIA2 PSL

28 Consultations

21 Téléchargements

Learning to Track Any Object

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager