Articulatory copy synthesis from cine X-ray films

Abstract : This paper deals with articulatory copy synthesis from X-ray films. The underlying articulatory synthesizer uses an aerodynamic and an acoustic simulation using target area functions, F0 and transition patterns from one area function to the next as input data. The articulators, tongue in particular, have been delineated by hand or semi-automatically from the X-ray films. A specific attention has been paid on the determination of the centerline of the vocal tract from the image and on the coordination between glottal area and vocal tract constrictions since both aspects strongly impact on the acoustics. Experiments show that good quality speech can be resynthesized even if the interval between two images is 40\,ms. The same approach could be easily applied to cine MRI data.
Type de document :
Communication dans un congrès
InterSpeech - 14th Annual Conference of the International Speech Communication Association - 2013, Aug 2013, Lyon, France. 2013
Liste complète des métadonnées

Littérature citée [19 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-00836838
Contributeur : Yves Laprie <>
Soumis le : vendredi 21 juin 2013 - 15:33:45
Dernière modification le : mardi 18 décembre 2018 - 16:38:02
Document(s) archivé(s) le : mercredi 5 avril 2017 - 01:41:21

Fichier

acs.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00836838, version 1

Citation

Yves Laprie, Matthieu Loosvelt, Shinji Maeda, Rudolph Sock, Fabrice Hirsch. Articulatory copy synthesis from cine X-ray films. InterSpeech - 14th Annual Conference of the International Speech Communication Association - 2013, Aug 2013, Lyon, France. 2013. 〈hal-00836838〉

Partager

Métriques

Consultations de la notice

719

Téléchargements de fichiers

222