Efficient multipulse approximation of speech excitation using the most singular manifold

Vahid Khanagha; Daoudi Khalid

Autre Publication Année : 2012

Efficient multipulse approximation of speech excitation using the most singular manifold

(1) , (1)

Vahid Khanagha

Fonction : Auteur
PersonId : 865238

Geometry and Statistics in acquisition data

Daoudi Khalid

Fonction : Auteur

Geometry and Statistics in acquisition data

Résumé

We propose a novel approach to find the locations of the multipulse sequence that approximates the speech source excitation. This approach is based on the notion of Most Singular Manifold (MSM) which is associated to the set of less predictable events. The MSM is formed by identifying (directly from the speech waveform) multiscale singularities which may correspond to significant impulsive excitations of the vocal tract. This identification is done through a multiscale measure of local predictability and the estimation of its associated singularity exponents. Once the pulse locations are found using the MSM, their amplitudes are computed using the second stage of the classical MultiPulse Excitation (MPE) coder. The multipulse sequence is then fed to the classical LPC synthesizer to reconstruct speech. The resulting MSM-based algorithm is shown to be significantly more efficient than MPE. We evaluate our algorithm using 1 hour of speech from the TIMIT database and compare its performances to MPE and a recent approach based on compressed sensing (CS). The results show that our algorithm yields similar perceptual quality as MPE and outperforms the CS method when the number of pulses is low.

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Fichier principal

is2012.pdf (98.33 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Vahid Khanagha : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00684895

Soumis le : lundi 18 juin 2012-09:46:09

Dernière modification le : mercredi 15 mars 2023-08:50:07

Archivage à long terme le : mercredi 19 septembre 2012-02:20:51

Dates et versions

hal-00684895 , version 1 (18-06-2012)

Identifiants

HAL Id : hal-00684895 , version 1

Citer

Vahid Khanagha, Daoudi Khalid. Efficient multipulse approximation of speech excitation using the most singular manifold. 2012. ⟨hal-00684895⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INRIA INRIA2

119 Consultations

129 Téléchargements

Efficient multipulse approximation of speech excitation using the most singular manifold

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager