Stochastic Models for Multimodal Video Analysis

Emmanouil Delakis; Guillaume Gravier; Patrick Gros

doi:10.1007/978-0-387-76316-3_3

Chapitre D'ouvrage Année : 2008

Stochastic Models for Multimodal Video Analysis

(1) , (1) , (1)

Emmanouil Delakis

Fonction : Auteur
PersonId : 880274

Multimedia content-based indexing

Guillaume Gravier

Fonction : Auteur
PersonId : 1046
IdHAL : guig
ORCID : 0000-0002-2266-5682
IdRef : 110355415

Multimedia content-based indexing

Patrick Gros

Fonction : Auteur
PersonId : 894
IdHAL : patrick-gros
IdRef : 075986604

Multimedia content-based indexing

Résumé

This chapter presents video indexing with segment models (SM), aiming at a more efficient and versatile multimodal fusion. In segment models, synchrony constraints between modalities can be relaxed to the scene boundaries, thus enabling to process each modality with their native sampling rates and models within each scene. We illustrate the many possibilities of audiovisual integration that SM can offer in the context of tennis video structuring. We first briefly review stochastic models that have been used for multimodal video analysis. We then present the task of tennis video structuring and the cues and related features that we want to incorporate in a stochastic model. We show how HMM can be used for multimodal integration before generalizing the HMM approach based on the segment model framework. We finally show that the hierarchical structure of a tennis video can be taken into consideration in both frameworks and present a new decoding algorithm to take into account textual score information displayed on screen.

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Patrick Gros : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00770993

Soumis le : lundi 7 janvier 2013-20:17:11

Dernière modification le : vendredi 24 mars 2023-14:52:56

Dates et versions

hal-00770993 , version 1 (07-01-2013)

Identifiants

HAL Id : hal-00770993 , version 1
DOI : 10.1007/978-0-387-76316-3_3

Citer

Emmanouil Delakis, Guillaume Gravier, Patrick Gros. Stochastic Models for Multimodal Video Analysis. Maragos, Petros and Potamianos, Alexandros and Gros, Patrick. Multimodal Processing and Interaction, 33, Springer, pp.89-107, 2008, 978-0-387-76315-6. ⟨10.1007/978-0-387-76316-3_3⟩. ⟨hal-00770993⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EC-PARIS UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA IRISA-D6 INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES INSA-GROUPE UR1-MATH-NUM

119 Consultations

0 Téléchargements

Stochastic Models for Multimodal Video Analysis

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager