Skip to Main content Skip to Navigation
New interface
Conference papers

Adaptation of cepstral coefficients for acoustic-to-articulatory inversion

Julie Busset 1 Yves Laprie 1 
1 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : Acoustic-to-articulatory inversion of speech signals via an analysis-by-synthesis method requires the comparison of natural and synthetic speech spectra either indirectly via formant frequencies, or directly via cepstral coefficients. This paper investigates several strategies of cepstral adaptation (affine transformation of cepstral coefficients, bilinear or piecewise linear frequency warping) when X-ray images of the speaker's vocal tract are available. These images enable the articulatory synthesis of a speech signal which fits the natural signal at best. It is thus possible to investigate the behavior of several cepstral adaptation procedures in order to select the best method, i.e. that which minimizes the deviation between synthetic and natural spectra. Our results show that the affine cepstral adaptation tends to flatten the spectral peaks, i.e. formants. Frequency warping techniques are thus more efficient all the more they can be supplemented by taking into account the spectral tilt.
Complete list of metadata

Cited literature [11 references]  Display  Hide  Download
Contributor : Yves Laprie Connect in order to contact the contributor
Submitted on : Wednesday, June 8, 2011 - 2:58:01 PM
Last modification on : Wednesday, February 2, 2022 - 3:51:32 PM
Long-term archiving on: : Friday, November 9, 2012 - 2:52:03 PM


Files produced by the author(s)


  • HAL Id : inria-00599108, version 1



Julie Busset, Yves Laprie. Adaptation of cepstral coefficients for acoustic-to-articulatory inversion. International Seminar on Speech Production 2011 - ISSP'11, Jun 2011, Montréal, Canada. ⟨inria-00599108⟩



Record views


Files downloads