An efficient method for the unsupervised discovery of signalling motifs in large audio streams

Armando Muscariello 1 Guillaume Gravier 1 Frédéric Bimbot 1
1 METISS - Speech and sound data modeling and processing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : Providing effective tools to navigate and access through long audio archives, or monitor and classify broadcast streams, proves to be an extremely challenging task. Main issues originate from the varied nature of patterns of interest in a composite audio environment, the massive size of such databases, and the capability of performing when prior knowledge on audio content is scarce or absent. This paper proposes a computational architecture aimed at discovering occurrences of repeating patterns in audio streams by means of unsupervised learning. The targeted repetitions (or motifs) are called signalling, by analogy with a biological nomenclature, as referring to a broad class of audio patterns (as jingles, songs, advertisements, etc...) frequently occurring in broadcast audio. This paper evaluates the applicability of a pattern discovery architecture to a near-duplicate discovery task, that is the task of discovering and collecting occurrences of repeating audio patterns (also called motifs) bearing limited variability, in the absence of prior acoustic or linguistic cues. We adapt a system originally developed for word discovery applications, and demonstrate its effectiveness in a song discovery scenario. The adaption consists in speeding up critical parts of the computations, mostly based on audio feature coarsening, to deal with the large occurrence period of repeating songs in radio streams.
Type de document :
Communication dans un congrès
International Workshop on Content-Based Multimedia Indexing, Jun 2011, Madrid, Spain. 2011
Liste complète des métadonnées

https://hal.inria.fr/inria-00572817
Contributeur : Armando Muscariello <>
Soumis le : mercredi 2 mars 2011 - 11:41:01
Dernière modification le : mercredi 11 avril 2018 - 01:53:37

Identifiants

  • HAL Id : inria-00572817, version 1

Citation

Armando Muscariello, Guillaume Gravier, Frédéric Bimbot. An efficient method for the unsupervised discovery of signalling motifs in large audio streams. International Workshop on Content-Based Multimedia Indexing, Jun 2011, Madrid, Spain. 2011. 〈inria-00572817〉

Partager

Métriques

Consultations de la notice

228