Acoustic-to-articulatory inversion by analysis-by-synthesis using cepstral coefficients

Julie Busset 1 Yves Laprie 1
1 PAROLE - Analysis, perception and recognition of speech
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : This paper deals with acoustic to articulatory inversion of speech by using an analysis by synthesis approach. We used old X-ray films of one speaker to (i) the develop a linear articulatory model presenting a small geometric mismatch with the subject's vocal tract mid sagittal images (ii) and design an adaptation procedure of cepstral vectors used as input data. The adaptation exploits the bilinear transform to warp the frequency scale in order to compensate for deviation between synthetic and natural speech. This enables the comparison of natural speech against synthetic speech without using cepstral liftering. A codebook is used to represent the forward articulatory to acoustic mapping and we designed a loose matching algorithm using spectral peaks to access it. This algorithm, based on dynamic programming, allows some peaks in either synthetic spectra (stored in the codebook) or natural spectra (to be inverted) to be omitted. Quadratic programming is used to improve the acoustic proximity near each good candidate found during codebook exploration. The inversion has been tested on speech signals corresponding to the X-ray films. It achieves a very good geometric precision of 1.5 mm over the whole tongue shape unlike similar works evaluating the error at 3 or 4 points corresponding to sensors located at the front of the tongue.
Type de document :
Communication dans un congrès
ICA - 21st International Congress on Acoustics - 2013, Jun 2013, Montréal, Canada. 2013
Liste complète des métadonnées

Littérature citée [12 références]  Voir  Masquer  Télécharger
Contributeur : Yves Laprie <>
Soumis le : vendredi 21 juin 2013 - 15:13:50
Dernière modification le : jeudi 11 janvier 2018 - 06:25:24
Document(s) archivé(s) le : mercredi 5 avril 2017 - 01:39:08


Fichiers produits par l'(les) auteur(s)


  • HAL Id : hal-00836808, version 1



Julie Busset, Yves Laprie. Acoustic-to-articulatory inversion by analysis-by-synthesis using cepstral coefficients. ICA - 21st International Congress on Acoustics - 2013, Jun 2013, Montréal, Canada. 2013. 〈hal-00836808〉



Consultations de la notice


Téléchargements de fichiers