Phonetic segmentation of speech signal using local singularity analysis

Vahid Khanagha; Khalid Daoudi; Oriol Pont; Hussein Yahia

Article Dans Une Revue Digital Signal Processing Année : 2014

Phonetic segmentation of speech signal using local singularity analysis

(1) , (1) , (1) , (1)

Vahid Khanagha

Fonction : Auteur
PersonId : 865238

Geometry and Statistics in acquisition data

Khalid Daoudi

Fonction : Auteur
PersonId : 1329075
ORCID : 0000-0003-3536-1060
IdRef : 115483500

Geometry and Statistics in acquisition data

Oriol Pont

Fonction : Auteur
PersonId : 1986
IdHAL : oriolpont
IdRef : 253134374

Geometry and Statistics in acquisition data

Hussein Yahia

Fonction : Auteur
PersonId : 16847
IdHAL : hussein-yahia
ORCID : 0000-0002-4284-096X
IdRef : 031827543

Geometry and Statistics in acquisition data

Résumé

This paper presents the application of a radically novel approach, called the Microcanonical Multiscale Formalism (MMF) to speech analysis. MMF is based on precise estimation of local scaling parameters that describe the inter-scale correlations at each point in the signal domain and provides e cient means for studying local non-linear dynamics of complex signals. In this paper we introduce an e cient way for estimation of these parameters and then, we show that they convey relevant information about local dynamics of the speech signal that can be used for the task of phonetic segmentation. We thus develop a two-stage segmentation algorithm: for the first step, we introduce a new dynamic programming technique to e ciently generate an initial list of phoneme-boundary candidates and in the second step, we use hypothesis testing to refine the initial list of candidates. We present extensive experiments on the full TIMIT database. The results show that our algorithm is significantly more accurate than state-of-the-art ones.

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Fichier principal

DSPsegmRevised.pdf (207.79 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Khalid Daoudi : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01059348

Soumis le : vendredi 29 août 2014-18:03:21

Dernière modification le : jeudi 1 février 2024-10:06:16

Archivage à long terme le : dimanche 30 novembre 2014-11:50:41

Dates et versions

hal-01059348 , version 1 (29-08-2014)

Identifiants

HAL Id : hal-01059348 , version 1

Citer

Vahid Khanagha, Khalid Daoudi, Oriol Pont, Hussein Yahia. Phonetic segmentation of speech signal using local singularity analysis. Digital Signal Processing, 2014. ⟨hal-01059348⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 INRIA IRISA INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

278 Consultations

650 Téléchargements

Phonetic segmentation of speech signal using local singularity analysis

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager