Skip to Main content Skip to Navigation
New interface
Journal articles

Audiovisual integration for sport broadcast structuring

Ewa Kijak 1 Guillaume Gravier 2, * Lionel Oisel 3 Patrick Gros 1 
* Corresponding author
1 TEXMEX - Multimedia content-based indexing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
2 METISS - Speech and sound data modeling and processing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : This paper focuses on the integration of multimodal features for sport video structure analysis. The method relies on a statistical model which takes into account both the shot content and the interleaving of shots. This stochastic modelling is performed in the global framework of Hidden Markov Models (HMMs) that can be efficiently applied to merge audio and visual cues. Our approach is validated in the particular domain of tennis videos. The model integrates prior information about tennis content and editing rules. The basic temporal unit is the video shot. Visual features are used to characterize the type of shot view. Audio features describe the audio events within a video shot. Two sets of audio features are used in this study: the first one is extracted from a manual segmentation of the soundtrack and is more reliable. The second one is provided by an automatic segmentation and classification process. As a result of the overall HMM process, typical tennis scenes are simultaneously segmented and identified. The experiments illustrate the improvement of HMM-based fusion over indexing using only the best single media, when both media are of similar quality.
Document type :
Journal articles
Complete list of metadata
Contributor : Patrick Gros Connect in order to contact the contributor
Submitted on : Tuesday, February 22, 2011 - 5:46:35 PM
Last modification on : Thursday, January 20, 2022 - 5:30:25 PM

Links full text



Ewa Kijak, Guillaume Gravier, Lionel Oisel, Patrick Gros. Audiovisual integration for sport broadcast structuring. Multimedia Tools and Applications, 2006, 30 (3), pp.289-312. ⟨10.1007/s11042-006-0031-5⟩. ⟨inria-00568183⟩



Record views