Skip to Main content Skip to Navigation
Conference papers

A New Based Distance Language Model for a Dictation Machine: application to MAUD

David Langlois 1 Kamel Smaïli 1 
1 PAROLE - Analysis, perception and recognition of speech
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : This paper deals with the use of a stochastic language model based on the split of the words history into d words where d is the length of the history. One of our aims is to modelise the semantic and syntactic relationships between words. This model can be considered as a first step for this goal. We experimented our model through the Shannon game (on 10 000 truncated sentences) and implemented it in MAUD, our dictation machine. Tests on MAUD have been done on 300 sentences pronounced by several women and men. This model predicts more words (in the Shannon game) than any other methods we developed before in our team. However, these models are sophisticated in contrast to the one we describe. Moreover, when including unknown words, the results are better than the model ones we presented in a recent work in terms of mean rank, ranks from 1 to 5 and perplexity. This work has needed to use two interpolation methods inspired from Markov model. Also, we discuss the problem of the unknown word modelling.
Document type :
Conference papers
Complete list of metadata
Contributor : Publications Loria Connect in order to contact the contributor
Submitted on : Tuesday, September 26, 2006 - 8:41:00 AM
Last modification on : Saturday, June 25, 2022 - 7:43:03 PM


  • HAL Id : inria-00098984, version 1



David Langlois, Kamel Smaïli. A New Based Distance Language Model for a Dictation Machine: application to MAUD. 6th European Conference on Speech Communication & Technology - EUROSPEECH'99, 1999, Budapest, Hungary, pp.1779-1782. ⟨inria-00098984⟩



Record views