Extraction of information from video sound tracks - Can we detect simultaneous events? - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2003

Extraction of information from video sound tracks - Can we detect simultaneous events?

Résumé

Detecting and tracking broad sound classes in audio documents is an important step toward their structuration. In the case of complex audio scenes, such as the sound track of a TV broadcast, one problem is that several classes of sound maybe present simultaneously. It is therefore important to detect such superimposed events. Most methods would necessitate to estimate a model for each combination of sound classes that is to be detected, which is intractable in practice since it requires a lot of manual labelling. In this paper, we propose and compare several approaches to detect simultaneous events using only the models of the base classes we are interested in. Two main approaches are compared: model combination and binary hypothesis tests. The results show that the best results are obtained with the model combination approach.
Fichier principal
Vignette du fichier
betser-cbmi03.pdf (94.31 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

inria-00576209 , version 1 (13-03-2011)

Identifiants

  • HAL Id : inria-00576209 , version 1

Citer

Michaël A. Betser, Guillaume Gravier, Rémi Gribonval. Extraction of information from video sound tracks - Can we detect simultaneous events?. Third Int. Workshop on Content-Based Multimedia Indexing, IRISA, Sep 2003, Rennes, France. pp.71--78. ⟨inria-00576209⟩
125 Consultations
398 Téléchargements

Partager

Gmail Facebook X LinkedIn More