Skip to Main content Skip to Navigation
Conference papers

Acoustic-to-articulatory inversion by analysis-by-synthesis using cepstral coefficients

Julie Busset 1 Yves Laprie 1
1 PAROLE - Analysis, perception and recognition of speech
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : This paper deals with acoustic to articulatory inversion of speech by using an analysis by synthesis approach. We used old X-ray films of one speaker to (i) the develop a linear articulatory model presenting a small geometric mismatch with the subject's vocal tract mid sagittal images (ii) and design an adaptation procedure of cepstral vectors used as input data. The adaptation exploits the bilinear transform to warp the frequency scale in order to compensate for deviation between synthetic and natural speech. This enables the comparison of natural speech against synthetic speech without using cepstral liftering. A codebook is used to represent the forward articulatory to acoustic mapping and we designed a loose matching algorithm using spectral peaks to access it. This algorithm, based on dynamic programming, allows some peaks in either synthetic spectra (stored in the codebook) or natural spectra (to be inverted) to be omitted. Quadratic programming is used to improve the acoustic proximity near each good candidate found during codebook exploration. The inversion has been tested on speech signals corresponding to the X-ray films. It achieves a very good geometric precision of 1.5 mm over the whole tongue shape unlike similar works evaluating the error at 3 or 4 points corresponding to sensors located at the front of the tongue.
Complete list of metadata

Cited literature [12 references]  Display  Hide  Download
Contributor : Yves Laprie Connect in order to contact the contributor
Submitted on : Friday, June 21, 2013 - 3:13:50 PM
Last modification on : Saturday, October 16, 2021 - 11:26:08 AM
Long-term archiving on: : Wednesday, April 5, 2017 - 1:39:08 AM


Files produced by the author(s)


  • HAL Id : hal-00836808, version 1



Julie Busset, Yves Laprie. Acoustic-to-articulatory inversion by analysis-by-synthesis using cepstral coefficients. ICA - 21st International Congress on Acoustics - 2013, Jun 2013, Montréal, Canada. ⟨hal-00836808⟩



Les métriques sont temporairement indisponibles