Comparison of approaches for an efficient phonetic decoding

Luiza Orosanu 1 Denis Jouvet 1
1 PAROLE - Analysis, perception and recognition of speech
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : This article analyzes the phonetic decoding performance obtained with different choices of linguistic units. The context is to later use such an approach as a support for helping communication with deaf people, and to run it on an embedded decoder on a portable terminal, which introduces constrains on the model size. As a first step, this paper presents and analyses the performance of various approaches. Two baseline systems are considered, one relying on a large vocabulary speech recognizer, and another one relying on a phonetic n-gram language model. Then syllable-based lexicons and language models are investigated. Various lexicon sizes are studied by setting thresholds on their frequency of occurrences in the training data. Evaluations are conducted on the ESTER and ETAPE speech corpora. Keeping only the most frequent syllables leads to a limited-size lexicon and language model, which nevertheless provides good phonetic decoding performance. The phone error rate is only 4% worse (absolute) than the phone error rate obtained with the large vocabulary recognizer, and much better than the phone error rate obtained with the phone n-gram language model.
Type de document :
Communication dans un congrès
InterSpeech - 14th Annual Conference of the International Speech Communication Association - 2013, Aug 2013, Lyon, France. 2013
Liste complète des métadonnées

https://hal.inria.fr/hal-00834284
Contributeur : Denis Jouvet <>
Soumis le : vendredi 25 mars 2016 - 17:39:25
Dernière modification le : jeudi 11 janvier 2018 - 06:25:24
Document(s) archivé(s) le : dimanche 26 juin 2016 - 15:22:35

Fichier

articleIS2013-Luiza-Orosanu-fi...
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-00834284, version 1

Collections

Citation

Luiza Orosanu, Denis Jouvet. Comparison of approaches for an efficient phonetic decoding. InterSpeech - 14th Annual Conference of the International Speech Communication Association - 2013, Aug 2013, Lyon, France. 2013. 〈hal-00834284〉

Partager

Métriques

Consultations de la notice

332

Téléchargements de fichiers

94