Efficient linear combination for distant n-gram models

David Langlois; Kamel Smaïli; Jean-Paul Haton

Communication Dans Un Congrès Année : 2003

Efficient linear combination for distant n-gram models

(1) , (1) , (1)

David Langlois

Fonction : Auteur
PersonId : 298
IdHAL : david-langlois
IdRef : 070239509

Analysis, perception and recognition of speech

Kamel Smaïli

Fonction : Auteur
PersonId : 2521
IdHAL : kamel-smaili
IdRef : 034429700

Analysis, perception and recognition of speech

Jean-Paul Haton

Fonction : Auteur
PersonId : 830987

Analysis, perception and recognition of speech

Résumé

The objective of this paper is to present a large study concerning the use of distant language models. In order to combine efficiently distant and classical models, an adaptation of the back-off principle is made. Also, we show the importance of each part of a history for the prediction. In fact, each sub-history is analyzed in order to estimate its importance in terms of prediction and then a weight is associated to each class of sub-histories. Therefore, the combined models take into account the features of each history's part and not the whole history as made in other works. The contribution of distant n-gram models in terms of perplexity is significant and improves the results by 12.8%. Making the linear combination depending on sub-histories achieves an improvement of $5.3\%$ in comparison to classical linear combination.

Mots clés

combinaison linéaire modèles distants modélisation statistique du langage distant models statistical language modelisation linear combination

Domaines

Autre [cs.OH]

Fichier principal

eurospeech2003_2.pdf (119.44 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Publications Loria : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00099595

Soumis le : mercredi 22 novembre 2017-11:53:48

Dernière modification le : vendredi 24 mars 2023-14:53:05

Dates et versions

inria-00099595 , version 1 (22-11-2017)

Identifiants

HAL Id : inria-00099595 , version 1

Citer

David Langlois, Kamel Smaïli, Jean-Paul Haton. Efficient linear combination for distant n-gram models. 8th European Conference on Speech Communication and Technology - Eurospeech'03, Sep 2003, Genève, Switzerland. pp.409-412. ⟨inria-00099595⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE INRIA2 LORIA

121 Consultations

73 Téléchargements

Efficient linear combination for distant n-gram models

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager