Human Detection and Action Recognition in Video Sequences - Human Character Recognition in TV-Style Movies

Alexander Klaser 1
1 LEAR - Learning and recognition in vision
GRAVIR - IMAG - Graphisme, Vision et Robotique, Inria Grenoble - Rhône-Alpes, CNRS - Centre National de la Recherche Scientifique : FR71
Abstract : This master thesis describes a supervised approach to the detection and the identification of humans in TV-style video sequences. In still images and video sequences, humans appear in different poses and views, fully visible and partly occluded, with varying distances to the camera, at different places, under different illumination conditions, etc. This diversity in appearance makes the task of human detection and identification to a particularly challenging problem. A possible solution of this problem is interesting for a wide range of applications such as video surveillance and content-based image and video processing. In order to detect humans in views ranging from full to close-up view and in the presence of clutter and occlusion, they are modeled by an assembly of several upper body parts. For each body part, a detector is trained based on a Support Vector Machine and on densely sampled, SIFT-like feature points in a detection window. For a more robust human detection, localized body parts are assembled using a learned model for geometric relations based on Gaussians. For a flexible human identification, the outward appearance of humans is captured and learned using the Bag-of-Features approach and non-linear Support Vector Machines. Probabilistic votes for each body part are combined to improve classification results. The combined votes yield an identification accuracy of about 80% in our experiments on episodes of the TV series "Buffy the Vampire Slayer". The Bag-of-Features approach has been used in previous work mainly for object classification tasks. Our results show that this approach can also be applied to the identification of humans in video sequences. Despite the difficulty of the given problem, the overall results are good and encourage future work in this direction.
Type de document :
Mémoires d'étudiants -- Hal-inria+
Graphics [cs.GR]. 2006
Liste complète des métadonnées

Littérature citée [5 références]  Voir  Masquer  Télécharger
Contributeur : Thoth Team <>
Soumis le : lundi 6 juin 2011 - 15:12:52
Dernière modification le : mercredi 11 avril 2018 - 01:53:55
Document(s) archivé(s) le : vendredi 9 novembre 2012 - 14:41:12


  • HAL Id : inria-00598474, version 1




Alexander Klaser. Human Detection and Action Recognition in Video Sequences - Human Character Recognition in TV-Style Movies. Graphics [cs.GR]. 2006. 〈inria-00598474〉



Consultations de la notice


Téléchargements de fichiers