Spatio-Temporal Object Detection Proposals

Dan Oneata 1 Jérôme Revaud 1 Jakob Verbeek 1 Cordelia Schmid 1
1 LEAR - Learning and recognition in vision
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, INPG - Institut National Polytechnique de Grenoble
Abstract : Spatio-temporal detection of actions and events in video is a challenging problem. Besides the difficulties related to recognition, a major challenge for detection in video is the size of the search space defined by spatio-temporal tubes formed by sequences of bounding boxes along the frames. Recently methods that generate unsupervised detection proposals have proven to be very effective for object detection in still images. These methods open the possibility to use strong but computationally expensive features since only a relatively small number of detection hypotheses need to be assessed. In this paper we make two contributions towards exploiting detection proposals for spatio-temporal detection problems. First, we extend a recent 2D object proposal method, to produce spatio-temporal proposals by a randomized supervoxel merging process. We introduce spatial, temporal, and spatio-temporal pairwise supervoxel features that are used to guide the merging process. Second, we propose a new efficient supervoxel method. We experimentally evaluate our detection proposals, in combination with our new supervoxel method as well as existing ones. This evaluation shows that our supervoxels lead to more accurate proposals when compared to using existing state-of-the-art supervoxel methods.
Document type :
Conference papers
Liste complète des métadonnées

Cited literature [49 references]  Display  Hide  Download


https://hal.inria.fr/hal-01021902
Contributor : Thoth Team <>
Submitted on : Friday, September 26, 2014 - 10:39:06 AM
Last modification on : Monday, December 17, 2018 - 11:22:02 AM
Document(s) archivé(s) le : Friday, April 14, 2017 - 12:20:04 PM

Files

proof.pdf
Files produced by the author(s)

Identifiers

Collections

Citation

Dan Oneata, Jérôme Revaud, Jakob Verbeek, Cordelia Schmid. Spatio-Temporal Object Detection Proposals. ECCV - European Conference on Computer Vision, Sep 2014, Zurich, Switzerland. pp.737-752, ⟨10.1007/978-3-319-10578-9_48⟩. ⟨hal-01021902v2⟩

Share

Metrics

Record views

2569

Files downloads

9155