Comparison and Analysis of Several Phonetic Decoding Approaches

Luiza Orosanu; Denis Jouvet

Communication Dans Un Congrès Année : 2013

Comparison and Analysis of Several Phonetic Decoding Approaches

(1) , (1)

Luiza Orosanu

Fonction : Auteur

Analysis, perception and recognition of speech

Denis Jouvet

Fonction : Auteur
PersonId : 15904
IdHAL : denis-jouvet
IdRef : 029418666

Analysis, perception and recognition of speech

Résumé

This article analyzes the phonetic decoding performance obtained with different choices of linguistic units. The context is to later use such an approach as a support for helping communication with deaf people, and to run it on an embedded decoder on a portable terminal, which introduces constrains on the model size. As a first step, this paper compares the performance of various approaches on the ESTER2 and ETAPE speech corpora. Two baseline systems are considered, one relying on a large vocabulary speech recognizer, and another one relying on a phonetic n-gram language model. The third model which relies on a syllable-based lexicon and a trigram language model, provides a good tradeoff between model size and phonetic decoding performance. The phone error rate is only 4% worse (absolute) than the phone error rate obtained with the large vocabulary recognizer, and much better than the phone error rate obtained with the phone n-gram language model. Phone error rates are then analyzed with respect to SNR and speaking rate.

Mots clés

syllables deaf speech recognition embedded system

Domaines

Traitement du signal et de l'image [eess.SP] Traitement du signal et de l'image [eess.SP]

Fichier principal

articleTSD2013_#98-Luiza-Orosanu-final.pdf (165.12 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Denis Jouvet : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00834313

Soumis le : vendredi 25 mars 2016-17:35:58

Dernière modification le : lundi 11 septembre 2023-17:41:19

Archivage à long terme le : dimanche 26 juin 2016-15:22:40

Dates et versions

hal-00834313 , version 1 (25-03-2016)

Identifiants

HAL Id : hal-00834313 , version 1

Citer

Luiza Orosanu, Denis Jouvet. Comparison and Analysis of Several Phonetic Decoding Approaches. TSD - 16th International Conference on Text, Speech and Dialogue - 2013, Sep 2013, Pilsen, Czech Republic. pp.161-168. ⟨hal-00834313⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD

173 Consultations

88 Téléchargements

Comparison and Analysis of Several Phonetic Decoding Approaches

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager