Audiovisual integration for sport broadcast structuring

Ewa Kijak 1 Guillaume Gravier 2, * Lionel Oisel 3 Patrick Gros 1
* Auteur correspondant
1 TEXMEX - Multimedia content-based indexing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
2 METISS - Speech and sound data modeling and processing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : This paper focuses on the integration of multimodal features for sport video structure analysis. The method relies on a statistical model which takes into account both the shot content and the interleaving of shots. This stochastic modelling is performed in the global framework of Hidden Markov Models (HMMs) that can be efficiently applied to merge audio and visual cues. Our approach is validated in the particular domain of tennis videos. The model integrates prior information about tennis content and editing rules. The basic temporal unit is the video shot. Visual features are used to characterize the type of shot view. Audio features describe the audio events within a video shot. Two sets of audio features are used in this study: the first one is extracted from a manual segmentation of the soundtrack and is more reliable. The second one is provided by an automatic segmentation and classification process. As a result of the overall HMM process, typical tennis scenes are simultaneously segmented and identified. The experiments illustrate the improvement of HMM-based fusion over indexing using only the best single media, when both media are of similar quality.
Type de document :
Article dans une revue
Multimedia Tools and Applications, Springer Verlag, 2006, 30 (3), pp.289-312. 〈http://www.springerlink.com/content/24h61433843r474l/fulltext.pdf〉. 〈10.1007/s11042-006-0031-5〉
Liste complète des métadonnées

https://hal.inria.fr/inria-00568183
Contributeur : Patrick Gros <>
Soumis le : mardi 22 février 2011 - 17:46:35
Dernière modification le : vendredi 20 juillet 2018 - 13:42:08

Lien texte intégral

Identifiants

Citation

Ewa Kijak, Guillaume Gravier, Lionel Oisel, Patrick Gros. Audiovisual integration for sport broadcast structuring. Multimedia Tools and Applications, Springer Verlag, 2006, 30 (3), pp.289-312. 〈http://www.springerlink.com/content/24h61433843r474l/fulltext.pdf〉. 〈10.1007/s11042-006-0031-5〉. 〈inria-00568183〉

Partager

Métriques

Consultations de la notice

302