Skip to Main content Skip to Navigation
Conference papers

Language Independent Statistical Models for on-Line Handwriting Recognition

Abstract : This paper deals with a language modeling approach that is dedicated to an on-line handwriting recognition system. Three main goals are set: i) performances, ii) versatility, and iii) resources. To achieve these goals we propose a statistical word n-class approach, which uses a learning stage to cluster words in classes and defines an estimation of the probability distribution of sequences of classes. Very large corpora from three different languages (English, French and Italian) have been used to train and test the language models. The efficiency of these models are evaluated not only from a linguistic point of view, using perplexity measurements, but also combined inside the recognition system on real ink signals corresponding to written sentences. Using a tri-class model allows a word error rate reduction ranging from to 50 to 60% according to the language.
Complete list of metadata
Contributor : Anne Jaigu <>
Submitted on : Thursday, October 5, 2006 - 1:47:22 PM
Last modification on : Wednesday, November 25, 2020 - 10:28:02 AM
Long-term archiving on: : Tuesday, April 6, 2010 - 5:28:45 PM


  • HAL Id : inria-00103860, version 1


Freddy Perraud, Christian Viard-Gaudin, Emmanuel Morin. Language Independent Statistical Models for on-Line Handwriting Recognition. Tenth International Workshop on Frontiers in Handwriting Recognition, Université de Rennes 1, Oct 2006, La Baule (France). ⟨inria-00103860⟩



Record views


Files downloads