Multimodal acquisition of articulatory data: Geometrical and temporal registration

Michaël Aron 1 Marie-Odile Berger 2 Erwan Kerrien 2 Brigitte Wrobel-Dautcourt 2 Blaise Potard 3 Yves Laprie 4
2 MAGRIT - Visual Augmentation of Complex Environments
Inria Nancy - Grand Est, LORIA - ALGO - Department of Algorithms, Computation, Image and Geometry
3 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
4 MULTISPEECH - Speech Modeling for Facilitating Oral-Based Communication
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : Acquisition of dynamic articulatory data is of major importance for studying speech production. It turns out that one technique alone often is not enough to get a correct coverage of the whole vocal tract at a sufficient sampling rate. Ultrasound (US) imaging has been proposed as a good acquisition technique for the tongue surface because it offers a good temporal sampling, does not alter speech production, is cheap and widely available. However, it cannot be used alone and this paper describes a multimodal acquisition system which uses electromagnetography sensors to locate the US probe. The paper particularly focuses on the calibration of the ultrasound modality which is the key point of the system. This approach enables ultrasound data to be merged with other data. The use of the system is illustrated via an experiment consisting of measuring the minimal tongue to palate distance in order to evaluate and design Magnetic Resonance Imaging protocols well suited for the acquisition of 3D images of the vocal tract. Compared to manual registration of acquisition modalities which is often used in acquisition of articulatory data, the approach presented relies on automatic techniques well founded from geometrical and mathematical points of view.
Complete list of metadatas

Cited literature [24 references]  Display  Hide  Download

https://hal.inria.fr/hal-01269578
Contributor : Yves Laprie <>
Submitted on : Friday, February 5, 2016 - 9:33:59 AM
Last modification on : Tuesday, December 18, 2018 - 4:38:02 PM
Long-term archiving on : Saturday, November 12, 2016 - 10:24:03 AM

File

acquisitionSystemJASADepotHAL....
Files produced by the author(s)

Identifiers

Citation

Michaël Aron, Marie-Odile Berger, Erwan Kerrien, Brigitte Wrobel-Dautcourt, Blaise Potard, et al.. Multimodal acquisition of articulatory data: Geometrical and temporal registration. Journal of the Acoustical Society of America, Acoustical Society of America, 2016, 139 (2), pp.13. ⟨10.1121/1.4940666⟩. ⟨hal-01269578⟩

Share

Metrics

Record views

630

Files downloads

2017