Track to the Future: Spatio-temporal Video Segmentation with Long-range Motion Cues

Jose Lezama; Karteek Alahari; Josef Sivic; Ivan Laptev

doi:10.1109/CVPR.2011.6044588

Communication Dans Un Congrès Année : 2011

Track to the Future: Spatio-temporal Video Segmentation with Long-range Motion Cues

(1) , (2, 3) , (2, 3) , (2, 3)

1
2
3

Jose Lezama

Fonction : Auteur

Centre de Mathématiques et de Leurs Applications

Karteek Alahari

Fonction : Auteur
PersonId : 19670
IdHAL : karteek
ORCID : 0000-0002-1838-5936
IdRef : 196283892

Models of visual object recognition and scene understanding

Laboratoire d'informatique de l'école normale supérieure

Josef Sivic

Fonction : Auteur

Models of visual object recognition and scene understanding

Laboratoire d'informatique de l'école normale supérieure

Ivan Laptev

Fonction : Auteur

Models of visual object recognition and scene understanding

Laboratoire d'informatique de l'école normale supérieure

Résumé

Video provides not only rich visual cues such as motion and appearance, but also much less explored long-range temporal interactions among objects. We aim to capture such interactions and to construct a powerful intermediate-level video representation for subsequent recognition. Motivated by this goal, we seek to obtain spatio-temporal oversegmentation of a video into regions that respect object boundaries and, at the same time, associate object pixels over many video frames. The contributions of this paper are two-fold. First, we develop an efficient spatiotemporal video segmentation algorithm, which naturally incorporates long-range motion cues from the past and future frames in the form of clusters of point tracks with coherent motion. Second, we devise a new track clustering cost function that includes occlusion reasoning, in the form of depth ordering constraints, as well as motion similarity along the tracks. We evaluate the proposed approach on a challenging set of video sequences of office scenes from feature length movies.

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV]

Fichier principal

lezama11.pdf (1.07 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Karteek Alahari : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00817961

Soumis le : jeudi 17 octobre 2013-19:02:25

Dernière modification le : lundi 8 avril 2024-12:24:02

Archivage à long terme le : samedi 18 janvier 2014-02:40:18

Dates et versions

hal-00817961 , version 1 (17-10-2013)

Identifiants

HAL Id : hal-00817961 , version 1
DOI : 10.1109/CVPR.2011.6044588

Citer

Jose Lezama, Karteek Alahari, Josef Sivic, Ivan Laptev. Track to the Future: Spatio-temporal Video Segmentation with Long-range Motion Cues. CVPR - IEEE Conference on Computer Vision and Pattern Recognition, Jun 2011, Colorado Springs, United States. pp.3369 - 3376, ⟨10.1109/CVPR.2011.6044588⟩. ⟨hal-00817961⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS CNRS INRIA ENS-CACHAN INRIA2 PSL ENS-PARIS-SACLAY

484 Consultations

593 Téléchargements

Track to the Future: Spatio-temporal Video Segmentation with Long-range Motion Cues

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager