Phonetic segmentation of speech signal using local singularity analysis

Abstract : This paper presents the application of a radically novel approach, called the Microcanonical Multiscale Formalism (MMF) to speech analysis. MMF is based on precise estimation of local scaling parameters that describe the inter-scale correlations at each point in the signal domain and provides e cient means for studying local non-linear dynamics of complex signals. In this paper we introduce an e cient way for estimation of these parameters and then, we show that they convey relevant information about local dynamics of the speech signal that can be used for the task of phonetic segmentation. We thus develop a two-stage segmentation algorithm: for the first step, we introduce a new dynamic programming technique to e ciently generate an initial list of phoneme-boundary candidates and in the second step, we use hypothesis testing to refine the initial list of candidates. We present extensive experiments on the full TIMIT database. The results show that our algorithm is significantly more accurate than state-of-the-art ones.
Liste complète des métadonnées

Cited literature [34 references]  Display  Hide  Download

https://hal.inria.fr/hal-01059348
Contributor : Khalid Daoudi <>
Submitted on : Friday, August 29, 2014 - 6:03:21 PM
Last modification on : Wednesday, November 14, 2018 - 1:54:02 PM
Document(s) archivé(s) le : Sunday, November 30, 2014 - 11:50:41 AM

File

DSPsegmRevised.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01059348, version 1

Collections

Citation

Vahid Khanagha, Khalid Daoudi, Oriol Pont, Hussein Yahia. Phonetic segmentation of speech signal using local singularity analysis. Digital Signal Processing, Elsevier, 2014. ⟨hal-01059348⟩

Share

Metrics

Record views

457

Files downloads

593