Skip to Main content Skip to Navigation
New interface
Conference papers

Spatio-Temporal Object Detection Proposals

Dan Oneata 1 Jérôme Revaud 1 Jakob Verbeek 1 Cordelia Schmid 1 
1 LEAR - Learning and recognition in vision
Inria Grenoble - Rhône-Alpes, LJK - Laboratoire Jean Kuntzmann, Grenoble INP - Institut polytechnique de Grenoble - Grenoble Institute of Technology
Abstract : Spatio-temporal detection of actions and events in video is a challenging problem. Besides the difficulties related to recognition, a major challenge for detection in video is the size of the search space defined by spatio-temporal tubes formed by sequences of bounding boxes along the frames. Recently methods that generate unsupervised detection proposals have proven to be very effective for object detection in still images. These methods open the possibility to use strong but computationally expensive features since only a relatively small number of detection hypotheses need to be assessed. In this paper we make two contributions towards exploiting detection proposals for spatio-temporal detection problems. First, we extend a recent 2D object proposal method, to produce spatio-temporal proposals by a randomized supervoxel merging process. We introduce spatial, temporal, and spatio-temporal pairwise supervoxel features that are used to guide the merging process. Second, we propose a new efficient supervoxel method. We experimentally evaluate our detection proposals, in combination with our new supervoxel method as well as existing ones. This evaluation shows that our supervoxels lead to more accurate proposals when compared to using existing state-of-the-art supervoxel methods.
Document type :
Conference papers
Complete list of metadata

Cited literature [49 references]  Display  Hide  Download
Contributor : THOTH Team Connect in order to contact the contributor
Submitted on : Friday, September 26, 2014 - 10:39:06 AM
Last modification on : Thursday, January 20, 2022 - 5:30:20 PM
Long-term archiving on: : Friday, April 14, 2017 - 12:20:04 PM


Files produced by the author(s)




Dan Oneata, Jérôme Revaud, Jakob Verbeek, Cordelia Schmid. Spatio-Temporal Object Detection Proposals. ECCV - European Conference on Computer Vision, Sep 2014, Zurich, Switzerland. pp.737-752, ⟨10.1007/978-3-319-10578-9_48⟩. ⟨hal-01021902v2⟩



Record views


Files downloads