Audiovisual to area and length functions inversion of human tract

Benjamin Elie 1 Yves Laprie 1
1 PAROLE - Analysis, perception and recognition of speech
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : This paper proposes a multimodal approach to estimate the area function and the length of the vocal tract of oral vowels. The method is based on an iterative technique consisting in deforming an initial area function so that the output acoustic vector matches a specified target. The chosen acoustic vector is the formant frequency pattern. In order to regularize the ill-problem, several constraints are added to the algorithm. First, the lip termination area is estimated via a facial capture software. Then, the area function is constrained in such a way that it does not get too far from a neutral position, and it does not change too quickly from a temporal frame to the next, when dealing with dynamic inversion. The method proves to be efficient to approximate the area function and the length of the vocal tract for oral french vowels, both in static and dynamic configurations.
Type de document :
Communication dans un congrès
Eusipco 2014, Sep 2014, Lisbonne, Portugal. 2014
Liste complète des métadonnées

Littérature citée [17 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01096547
Contributeur : Yves Laprie <>
Soumis le : mercredi 17 décembre 2014 - 16:25:22
Dernière modification le : jeudi 11 janvier 2018 - 06:25:24
Document(s) archivé(s) le : lundi 23 mars 2015 - 15:47:22

Fichier

Eusipco14.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01096547, version 1

Collections

Citation

Benjamin Elie, Yves Laprie. Audiovisual to area and length functions inversion of human tract. Eusipco 2014, Sep 2014, Lisbonne, Portugal. 2014. 〈hal-01096547〉

Partager

Métriques

Consultations de la notice

213

Téléchargements de fichiers

185