Skip to Main content Skip to Navigation
Master thesis

Human Detection and Action Recognition in Video Sequences - Human Character Recognition in TV-Style Movies

Alexander Klaser 1 
1 LEAR - Learning and recognition in vision
GRAVIR - IMAG - Laboratoire d'informatique GRAphique, VIsion et Robotique de Grenoble, Inria Grenoble - Rhône-Alpes, CNRS - Centre National de la Recherche Scientifique : FR71
Abstract : This master thesis describes a supervised approach to the detection and the identification of humans in TV-style video sequences. In still images and video sequences, humans appear in different poses and views, fully visible and partly occluded, with varying distances to the camera, at different places, under different illumination conditions, etc. This diversity in appearance makes the task of human detection and identification to a particularly challenging problem. A possible solution of this problem is interesting for a wide range of applications such as video surveillance and content-based image and video processing. In order to detect humans in views ranging from full to close-up view and in the presence of clutter and occlusion, they are modeled by an assembly of several upper body parts. For each body part, a detector is trained based on a Support Vector Machine and on densely sampled, SIFT-like feature points in a detection window. For a more robust human detection, localized body parts are assembled using a learned model for geometric relations based on Gaussians. For a flexible human identification, the outward appearance of humans is captured and learned using the Bag-of-Features approach and non-linear Support Vector Machines. Probabilistic votes for each body part are combined to improve classification results. The combined votes yield an identification accuracy of about 80% in our experiments on episodes of the TV series "Buffy the Vampire Slayer". The Bag-of-Features approach has been used in previous work mainly for object classification tasks. Our results show that this approach can also be applied to the identification of humans in video sequences. Despite the difficulty of the given problem, the overall results are good and encourage future work in this direction.
Document type :
Master thesis
Complete list of metadata

Cited literature [5 references]  Display  Hide  Download
Contributor : THOTH Team Connect in order to contact the contributor
Submitted on : Monday, June 6, 2011 - 3:12:52 PM
Last modification on : Wednesday, February 2, 2022 - 3:58:37 PM
Long-term archiving on: : Friday, November 9, 2012 - 2:41:12 PM


  • HAL Id : inria-00598474, version 1



Alexander Klaser. Human Detection and Action Recognition in Video Sequences - Human Character Recognition in TV-Style Movies. Graphics [cs.GR]. 2006. ⟨inria-00598474⟩



Record views


Files downloads