Audiovisual integration for sport broadcast structuring - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Article Dans Une Revue Multimedia Tools and Applications Année : 2006

Audiovisual integration for sport broadcast structuring

Ewa Kijak
Lionel Oisel
  • Fonction : Auteur
  • PersonId : 892537
Patrick Gros

Résumé

This paper focuses on the integration of multimodal features for sport video structure analysis. The method relies on a statistical model which takes into account both the shot content and the interleaving of shots. This stochastic modelling is performed in the global framework of Hidden Markov Models (HMMs) that can be efficiently applied to merge audio and visual cues. Our approach is validated in the particular domain of tennis videos. The model integrates prior information about tennis content and editing rules. The basic temporal unit is the video shot. Visual features are used to characterize the type of shot view. Audio features describe the audio events within a video shot. Two sets of audio features are used in this study: the first one is extracted from a manual segmentation of the soundtrack and is more reliable. The second one is provided by an automatic segmentation and classification process. As a result of the overall HMM process, typical tennis scenes are simultaneously segmented and identified. The experiments illustrate the improvement of HMM-based fusion over indexing using only the best single media, when both media are of similar quality.

Dates et versions

inria-00568183 , version 1 (22-02-2011)

Identifiants

Citer

Ewa Kijak, Guillaume Gravier, Lionel Oisel, Patrick Gros. Audiovisual integration for sport broadcast structuring. Multimedia Tools and Applications, 2006, 30 (3), pp.289-312. ⟨10.1007/s11042-006-0031-5⟩. ⟨inria-00568183⟩
148 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More