Extraction of information from video sound tracks - Can we detect simultaneous events?

Michaël A. Betser 1 Guillaume Gravier 1 Rémi Gribonval 1
1 METISS - Speech and sound data modeling and processing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : Detecting and tracking broad sound classes in audio documents is an important step toward their structuration. In the case of complex audio scenes, such as the sound track of a TV broadcast, one problem is that several classes of sound maybe present simultaneously. It is therefore important to detect such superimposed events. Most methods would necessitate to estimate a model for each combination of sound classes that is to be detected, which is intractable in practice since it requires a lot of manual labelling. In this paper, we propose and compare several approaches to detect simultaneous events using only the models of the base classes we are interested in. Two main approaches are compared: model combination and binary hypothesis tests. The results show that the best results are obtained with the model combination approach.
Type de document :
Communication dans un congrès
Third Int. Workshop on Content-Based Multimedia Indexing, Sep 2003, Rennes, France. pp.71--78, 2003
Liste complète des métadonnées

Littérature citée [7 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00576209
Contributeur : Rémi Gribonval <>
Soumis le : dimanche 13 mars 2011 - 16:27:13
Dernière modification le : jeudi 11 janvier 2018 - 06:20:09
Document(s) archivé(s) le : mardi 14 juin 2011 - 02:30:39

Fichier

betser-cbmi03.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inria-00576209, version 1

Collections

Citation

Michaël A. Betser, Guillaume Gravier, Rémi Gribonval. Extraction of information from video sound tracks - Can we detect simultaneous events?. Third Int. Workshop on Content-Based Multimedia Indexing, Sep 2003, Rennes, France. pp.71--78, 2003. 〈inria-00576209〉

Partager

Métriques

Consultations de la notice

289

Téléchargements de fichiers

364