Statistical Feature Language Model

Kamel Smaïli; Salma Jamoussi; David Langlois; Jean-Paul Haton

Communication Dans Un Congrès Année : 2004

Statistical Feature Language Model

(1) , (1) , (1) , (1)

Kamel Smaïli

Fonction : Auteur
PersonId : 2521
IdHAL : kamel-smaili
IdRef : 034429700

Analysis, perception and recognition of speech

Salma Jamoussi

Fonction : Auteur

Analysis, perception and recognition of speech

David Langlois

Fonction : Auteur
PersonId : 298
IdHAL : david-langlois
IdRef : 070239509

Analysis, perception and recognition of speech

Jean-Paul Haton

Fonction : Auteur
PersonId : 830987

Analysis, perception and recognition of speech

Résumé

Statistical language models are widely used in automatic speech recognition in order to constrain the decoding of a sentence. Most of these models derive from the classical n-gram paradigm. However, the production of a word dends on a large set of linguistic features : lexical, syntactic, semantic, etc. Moreover, in some natural languages the gender and number of the left context affect the production of the next word. Therefore, it seems attractive to design a language model based on a variety of word features. We present in this paper a new statistical language model, called Statistical Feature Language Model, SFLM, based on this idea. In SFLM a word is considered as an array of linguistic features, and the model is defined in a way similar to the n-gram model. Experiments carried out for French and show an improvement in terms of perplexity and predicted words.

Mots clés

automatic speech recognition statistical language modeling reconnaissance automatique de la parole modélisation statistique du langage

Domaines

Autre [cs.OH]

Fichier principal

salma1.pdf (51.28 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Publications Loria : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00100021

Soumis le : mardi 21 novembre 2017-23:46:30

Dernière modification le : vendredi 24 mars 2023-14:53:05

Dates et versions

inria-00100021 , version 1 (21-11-2017)

Identifiants

HAL Id : inria-00100021 , version 1

Citer

Kamel Smaïli, Salma Jamoussi, David Langlois, Jean-Paul Haton. Statistical Feature Language Model. 8th International Conference on Spoken Language Processing - ICSLP' 2004, 2004, Jeju, South Korea. 4 p. ⟨inria-00100021⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE INRIA2 LORIA

133 Consultations

115 Téléchargements

Statistical Feature Language Model

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager