Construction faiblement supervisée d'un phonétiseur pour la langue Iban à partir de ressources en Malais

Abstract : This paper describes our experiments and results on using a local dominant language in Malaysia (Malay), to bootstrap automatic speech recognition (ASR) for a very under-resourced language : iban (also spoken in Malaysia on the Borneo Island part). Resources in iban for building a speech recognition were nonexistent. For this, we tried to take advantage of a language from the same family with several similarities. First, to deal with the pronunciation dictionary, we proposed a bootstrapping strategy to develop an iban pronunciation lexicon from a Malay one. A hybrid version, mix of Malay and iban pronunciations, was also built and evaluated. Following this, we experimented with three iban ASRs ; each depended on either one of the three different pronunciation dictionaries : Malay, iban or hybrid.
Document type :
Conference papers
Complete list of metadatas

Cited literature [13 references]  Display  Hide  Download

https://hal.inria.fr/hal-01002921
Contributor : Laurent Besacier <>
Submitted on : Saturday, June 7, 2014 - 2:42:54 PM
Last modification on : Monday, July 8, 2019 - 3:08:22 PM
Long-term archiving on : Sunday, September 7, 2014 - 10:45:49 AM

File

jep2014-sarah.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01002921, version 1

Citation

Sarah Samson Juan, Laurent Besacier, Solange Rossato. Construction faiblement supervisée d'un phonétiseur pour la langue Iban à partir de ressources en Malais. Journées d'Etude sur la Parole (JEP), Jun 2014, Le Mans, France. ⟨hal-01002921⟩

Share

Metrics

Record views

233

Files downloads

198