Audiovisual to area and length functions inversion of human tract

Benjamin Elie 1 Yves Laprie 1
1 PAROLE - Analysis, perception and recognition of speech
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : This paper proposes a multimodal approach to estimate the area function and the length of the vocal tract of oral vowels. The method is based on an iterative technique consisting in deforming an initial area function so that the output acoustic vector matches a specified target. The chosen acoustic vector is the formant frequency pattern. In order to regularize the ill-problem, several constraints are added to the algorithm. First, the lip termination area is estimated via a facial capture software. Then, the area function is constrained in such a way that it does not get too far from a neutral position, and it does not change too quickly from a temporal frame to the next, when dealing with dynamic inversion. The method proves to be efficient to approximate the area function and the length of the vocal tract for oral french vowels, both in static and dynamic configurations.
Complete list of metadatas

Cited literature [17 references]  Display  Hide  Download

https://hal.inria.fr/hal-01096547
Contributor : Yves Laprie <>
Submitted on : Wednesday, December 17, 2014 - 4:25:22 PM
Last modification on : Tuesday, December 18, 2018 - 4:38:02 PM
Long-term archiving on : Monday, March 23, 2015 - 3:47:22 PM

File

Eusipco14.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01096547, version 1

Collections

Citation

Benjamin Elie, Yves Laprie. Audiovisual to area and length functions inversion of human tract. Eusipco 2014, Sep 2014, Lisbonne, Portugal. ⟨hal-01096547⟩

Share

Metrics

Record views

239

Files downloads

219