Skip to Main content Skip to Navigation
Conference papers

Segmenting, modeling and matching video clips containing multiple moving objects

Fred Rothganger 1 Svetlana Lazebnik 1 Cordelia Schmid 2, * Jean Ponce 1 
* Corresponding author
2 LEAR - Learning and recognition in vision
GRAVIR - IMAG - Laboratoire d'informatique GRAphique, VIsion et Robotique de Grenoble, Inria Grenoble - Rhône-Alpes, CNRS - Centre National de la Recherche Scientifique : FR71
Abstract : This paper presents a novel representation for dynamic scenes composed of multiple rigid objects that may undergo different motions and be observed by a moving camera. Multi-view constraints associated with groups of affine-invariant scene patches and a normalized description of their appearance are used to segment a scene into its rigid parts, construct three-dimensional protective, affine, and Euclidean models of these parts, and match instances of models recovered from different image sequences. The proposed approach has been implemented, and it is applied to the detection and recognition of moving objects in video sequences and the identification of shots that depict the same scene in a video clip (shot matching).
Document type :
Conference papers
Complete list of metadata

Cited literature [33 references]  Display  Hide  Download
Contributor : THOTH Team Connect in order to contact the contributor
Submitted on : Monday, December 20, 2010 - 9:09:27 AM
Last modification on : Thursday, January 20, 2022 - 4:12:48 PM
Long-term archiving on: : Monday, March 21, 2011 - 3:10:51 AM


Files produced by the author(s)




Fred Rothganger, Svetlana Lazebnik, Cordelia Schmid, Jean Ponce. Segmenting, modeling and matching video clips containing multiple moving objects. IEEE Conference on Computer Vision and Pattern Recognition (CVPR '04), Jun 2004, Washington, United States. pp.914--921, ⟨10.1109/CVPR.2004.1315263⟩. ⟨inria-00548534⟩



Record views


Files downloads