Acoustic-to-articulatory inversion by analysis-by-synthesis using cepstral coefficients

Julie Busset 1 Yves Laprie 1
1 PAROLE - Analysis, perception and recognition of speech
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : This paper deals with acoustic to articulatory inversion of speech by using an analysis by synthesis approach. We used old X-ray films of one speaker to (i) the develop a linear articulatory model presenting a small geometric mismatch with the subject's vocal tract mid sagittal images (ii) and design an adaptation procedure of cepstral vectors used as input data. The adaptation exploits the bilinear transform to warp the frequency scale in order to compensate for deviation between synthetic and natural speech. This enables the comparison of natural speech against synthetic speech without using cepstral liftering. A codebook is used to represent the forward articulatory to acoustic mapping and we designed a loose matching algorithm using spectral peaks to access it. This algorithm, based on dynamic programming, allows some peaks in either synthetic spectra (stored in the codebook) or natural spectra (to be inverted) to be omitted. Quadratic programming is used to improve the acoustic proximity near each good candidate found during codebook exploration. The inversion has been tested on speech signals corresponding to the X-ray films. It achieves a very good geometric precision of 1.5 mm over the whole tongue shape unlike similar works evaluating the error at 3 or 4 points corresponding to sensors located at the front of the tongue.
Liste complète des métadonnées

Cited literature [12 references]  Display  Hide  Download

https://hal.inria.fr/hal-00836808
Contributor : Yves Laprie <>
Submitted on : Friday, June 21, 2013 - 3:13:50 PM
Last modification on : Tuesday, December 18, 2018 - 4:38:02 PM
Document(s) archivé(s) le : Wednesday, April 5, 2017 - 1:39:08 AM

File

inversioWithAbstract.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00836808, version 1

Collections

Citation

Julie Busset, Yves Laprie. Acoustic-to-articulatory inversion by analysis-by-synthesis using cepstral coefficients. ICA - 21st International Congress on Acoustics - 2013, Jun 2013, Montréal, Canada. ⟨hal-00836808⟩

Share

Metrics

Record views

409

Files downloads

227