Enhancement of esophageal speech obtained by a voice conversion technique using time dilated Fourier cepstra - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Article Dans Une Revue International Journal of Speech Technology Année : 2018

Enhancement of esophageal speech obtained by a voice conversion technique using time dilated Fourier cepstra

Résumé

This paper presents a novel speaking-aid system for enhancing esophageal speech (ES). The method adopted in this paper aims to improve the quality of esophageal speech using a combination of a voice conversion technique and a time dilation algorithm. In the proposed system, a Deep Neural Network (DNN) is used as a nonlinear mapping function for vocal tract vector transformation. Then the converted frames are used to determine realistic excitation and phase vectors from the target training space using a frame selection algorithm. Next, in order to preserve speaker identity of the esophageal speakers, we use the source vocal tract features and propose to apply on them a time dilation algorithm to reduce the unpleasant esophageal noises. Finally the converted speech is reconstructed using the dilated source vocal tract frames and the predicted excitation and phase. Deep Neural Network (DNN) and Gaussian Mixture model (GMM) based voice conversion systems have been evaluated using objective and subjective measures. Such an experimental study has been realized also in order to evaluate the changes in speech quality and intelligibility of the transformed signals. Experimental results demonstrate that the proposed methods provide considerable improvement in intelligibility and naturalness of the converted esophageal speech.
Fichier non déposé

Dates et versions

hal-01954096 , version 1 (13-12-2018)

Identifiants

Citer

Imen Ben Othmane, Joseph Di Martino, Kaïs Ouni. Enhancement of esophageal speech obtained by a voice conversion technique using time dilated Fourier cepstra. International Journal of Speech Technology, 2018, 22 (1), pp.99-110. ⟨10.1007/s10772-018-09579-1⟩. ⟨hal-01954096⟩
72 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More