Articulatory copy synthesis from cine X-ray films

Abstract : This paper deals with articulatory copy synthesis from X-ray films. The underlying articulatory synthesizer uses an aerodynamic and an acoustic simulation using target area functions, F0 and transition patterns from one area function to the next as input data. The articulators, tongue in particular, have been delineated by hand or semi-automatically from the X-ray films. A specific attention has been paid on the determination of the centerline of the vocal tract from the image and on the coordination between glottal area and vocal tract constrictions since both aspects strongly impact on the acoustics. Experiments show that good quality speech can be resynthesized even if the interval between two images is 40\,ms. The same approach could be easily applied to cine MRI data.
Complete list of metadatas

Cited literature [19 references]  Display  Hide  Download

https://hal.inria.fr/hal-00836838
Contributor : Yves Laprie <>
Submitted on : Friday, June 21, 2013 - 3:33:45 PM
Last modification on : Thursday, June 6, 2019 - 2:48:13 PM
Long-term archiving on : Wednesday, April 5, 2017 - 1:41:21 AM

File

acs.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00836838, version 1

Citation

Yves Laprie, Matthieu Loosvelt, Shinji Maeda, Rudolph Sock, Fabrice Hirsch. Articulatory copy synthesis from cine X-ray films. InterSpeech - 14th Annual Conference of the International Speech Communication Association - 2013, Aug 2013, Lyon, France. ⟨hal-00836838⟩

Share

Metrics

Record views

805

Files downloads

245