Improving the computational performance of standard GMM-based voice conversion systems used in real-time applications - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2018

Improving the computational performance of standard GMM-based voice conversion systems used in real-time applications

Résumé

Voice conversion (VC) can be described as finding a mapping function which transforms the features extracted from a source speaker to those of a target speaker. Gaussian mixture model (GMM) based conversion is the most commonly used technique in VC, but is often sensitive to overfitting and oversmoothing. To address these issues, we propose a secondary classification by applying a K-means classification in each class obtained by a primary classification in order to obtain more precise local conversion functions. This proposal avoids the need for complex training algorithms because the local mapping functions are determined at the same time. The proposed approach consists of a Fourier cepstral analysis, followed by a training phase in order to find the local mapping functions which transform the vocal tract characteristics of the source speaker into those of the target speaker. The converted parameters together with excitation and phase extracted from the target training space using a frame index selection are used in the synthesis step to generate a converted speech with target speech characteristics. Objective and subjective experiments prove that the proposed technique outperforms the baseline GMM approach while greatly reducing the training and transformation computation times.
Fichier non déposé

Dates et versions

hal-01886099 , version 1 (02-10-2018)

Identifiants

Citer

Imen Ben Othmane, Joseph Di Martino, Kaïs Ouni. Improving the computational performance of standard GMM-based voice conversion systems used in real-time applications. ICECOCS’18 - 1st International Conference on Electronics, Control, Optimization and Computer Science, Dec 2018, Kenitra, Morocco. ⟨10.1109/ICECOCS.2018.8610514⟩. ⟨hal-01886099⟩
149 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More