Adaptation of cepstral coefficients for acoustic-to-articulatory inversion

Julie Busset 1 Yves Laprie 1
1 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : Acoustic-to-articulatory inversion of speech signals via an analysis-by-synthesis method requires the comparison of natural and synthetic speech spectra either indirectly via formant frequencies, or directly via cepstral coefficients. This paper investigates several strategies of cepstral adaptation (affine transformation of cepstral coefficients, bilinear or piecewise linear frequency warping) when X-ray images of the speaker's vocal tract are available. These images enable the articulatory synthesis of a speech signal which fits the natural signal at best. It is thus possible to investigate the behavior of several cepstral adaptation procedures in order to select the best method, i.e. that which minimizes the deviation between synthetic and natural spectra. Our results show that the affine cepstral adaptation tends to flatten the spectral peaks, i.e. formants. Frequency warping techniques are thus more efficient all the more they can be supplemented by taking into account the spectral tilt.
Complete list of metadatas

Cited literature [11 references]  Display  Hide  Download

https://hal.inria.fr/inria-00599108
Contributor : Yves Laprie <>
Submitted on : Wednesday, June 8, 2011 - 2:58:01 PM
Last modification on : Thursday, January 11, 2018 - 6:19:57 AM
Long-term archiving on : Friday, November 9, 2012 - 2:52:03 PM

File

cepstralAdaptation.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : inria-00599108, version 1

Collections

Citation

Julie Busset, Yves Laprie. Adaptation of cepstral coefficients for acoustic-to-articulatory inversion. International Seminar on Speech Production 2011 - ISSP'11, Jun 2011, Montréal, Canada. ⟨inria-00599108⟩

Share

Metrics

Record views

242

Files downloads

154