Skip to Main content Skip to Navigation
Conference papers

Prediction of Cepstral Excitation Pulses for Voice Conversion

Abstract : Voice conversion is one of useful techniques to enhance pathological speech to be perceived as normal speech, although it concerns also the modifications of normal source speaker's speech to be perceived as if a target speaker had uttered it. The parameters to be converted are obtained by matching the spectral envelope of the vocal tract for the source and the target speech. Gaussian Mixture Models (GMMs) parameters are determined for providing conversion functions. The main contribution of our study consists in the prediction of Fourier cepstrum coefficients related to the excitation signal. Such a prediction leads to a satisfactory voice conversion system. Subjective perceptual results indicate that the proposed approach yields significant improvements in quality of the converted voice.
Complete list of metadata

Cited literature [19 references]  Display  Hide  Download
Contributor : Joseph Di Martino Connect in order to contact the contributor
Submitted on : Monday, December 10, 2012 - 9:43:12 AM
Last modification on : Tuesday, June 22, 2021 - 3:02:01 PM
Long-term archiving on: : Monday, March 11, 2013 - 11:21:25 AM


Files produced by the author(s)


  • HAL Id : hal-00761776, version 1



Fadoua Bahja, Joseph Di Martino, El Hassan Ibn Elhaj, Driss Aboutajdine. Prediction of Cepstral Excitation Pulses for Voice Conversion. 5th. International Conference on Information Systems and Economic Intelligence - SIIE ̓ 2012, Feb 2012, Djerba, Tunisia. ⟨hal-00761776⟩



Les métriques sont temporairement indisponibles