Phonetic segmentation of speech signal using local singularity analysis

Abstract : This paper presents the application of a radically novel approach, called the Microcanonical Multiscale Formalism (MMF) to speech analysis. MMF is based on precise estimation of local scaling parameters that describe the inter-scale correlations at each point in the signal domain and provides e cient means for studying local non-linear dynamics of complex signals. In this paper we introduce an e cient way for estimation of these parameters and then, we show that they convey relevant information about local dynamics of the speech signal that can be used for the task of phonetic segmentation. We thus develop a two-stage segmentation algorithm: for the first step, we introduce a new dynamic programming technique to e ciently generate an initial list of phoneme-boundary candidates and in the second step, we use hypothesis testing to refine the initial list of candidates. We present extensive experiments on the full TIMIT database. The results show that our algorithm is significantly more accurate than state-of-the-art ones.
Type de document :
Article dans une revue
Digital Signal Processing, Elsevier, 2014
Liste complète des métadonnées

Littérature citée [34 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01059348
Contributeur : Khalid Daoudi <>
Soumis le : vendredi 29 août 2014 - 18:03:21
Dernière modification le : mercredi 3 janvier 2018 - 14:18:08
Document(s) archivé(s) le : dimanche 30 novembre 2014 - 11:50:41

Fichier

DSPsegmRevised.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01059348, version 1

Collections

Citation

Vahid Khanagha, Khalid Daoudi, Oriol Pont, Hussein Yahia. Phonetic segmentation of speech signal using local singularity analysis. Digital Signal Processing, Elsevier, 2014. 〈hal-01059348〉

Partager

Métriques

Consultations de la notice

353

Téléchargements de fichiers

450