Skip to Main content Skip to Navigation
Other publications

Efficient multipulse approximation of speech excitation using the most singular manifold

Abstract : We propose a novel approach to find the locations of the multipulse sequence that approximates the speech source excitation. This approach is based on the notion of Most Singular Manifold (MSM) which is associated to the set of less predictable events. The MSM is formed by identifying (directly from the speech waveform) multiscale singularities which may correspond to significant impulsive excitations of the vocal tract. This identification is done through a multiscale measure of local predictability and the estimation of its associated singularity exponents. Once the pulse locations are found using the MSM, their amplitudes are computed using the second stage of the classical MultiPulse Excitation (MPE) coder. The multipulse sequence is then fed to the classical LPC synthesizer to reconstruct speech. The resulting MSM-based algorithm is shown to be significantly more efficient than MPE. We evaluate our algorithm using 1 hour of speech from the TIMIT database and compare its performances to MPE and a recent approach based on compressed sensing (CS). The results show that our algorithm yields similar perceptual quality as MPE and outperforms the CS method when the number of pulses is low.
Complete list of metadatas

https://hal.inria.fr/hal-00684895
Contributor : Vahid Khanagha <>
Submitted on : Monday, June 18, 2012 - 9:46:09 AM
Last modification on : Thursday, March 5, 2020 - 4:49:38 PM
Document(s) archivé(s) le : Wednesday, September 19, 2012 - 2:20:51 AM

File

is2012.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00684895, version 1

Collections

Citation

Vahid Khanagha, Daoudi Khalid. Efficient multipulse approximation of speech excitation using the most singular manifold. 2012. ⟨hal-00684895⟩

Share

Metrics

Record views

280

Files downloads

197