Towards robust word discovery by self similarity matrix comparison

Armando Muscariello 1 Guillaume Gravier 1 Frédéric Bimbot 1
1 METISS - Speech and sound data modeling and processing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : Word discovery is the task of discovering and collecting occurrences of repeating words in the absence of prior acoustic and linguistic knowledge, or training material. The capability of extracting such patterns (or motifs) represents a preliminary step towards automatic mining of contentful information in spoken documents. The absence of modelling and training data, forces the use of direct pattern matching of speech templates, which, in turn, is sensitive to speech variability, like the inter-speaker one, for instance. In the present work, a variability tolerant pattern recognition technique is proposed that relies on the comparison of self similarity matrices of speech sequences. The joint use of such technique and a dynamic time warping dissimilarity measure, is shown to account for more variability with respect to the DTW-based system alone, as demonstrated on several hours of broadcast news shows.
Type de document :
Communication dans un congrès
IEEE International Conference on Acoustics, Speech and Signal Processing, May 2011, Prague, Czech Republic. 2011
Liste complète des métadonnées

https://hal.inria.fr/inria-00563418
Contributeur : Armando Muscariello <>
Soumis le : vendredi 4 février 2011 - 19:58:40
Dernière modification le : mercredi 11 avril 2018 - 01:56:25

Identifiants

  • HAL Id : inria-00563418, version 1

Citation

Armando Muscariello, Guillaume Gravier, Frédéric Bimbot. Towards robust word discovery by self similarity matrix comparison. IEEE International Conference on Acoustics, Speech and Signal Processing, May 2011, Prague, Czech Republic. 2011. 〈inria-00563418〉

Partager

Métriques

Consultations de la notice

390