HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Conference papers

Language Independent Statistical Models for on-Line Handwriting Recognition

Abstract : This paper deals with a language modeling approach that is dedicated to an on-line handwriting recognition system. Three main goals are set: i) performances, ii) versatility, and iii) resources. To achieve these goals we propose a statistical word n-class approach, which uses a learning stage to cluster words in classes and defines an estimation of the probability distribution of sequences of classes. Very large corpora from three different languages (English, French and Italian) have been used to train and test the language models. The efficiency of these models are evaluated not only from a linguistic point of view, using perplexity measurements, but also combined inside the recognition system on real ink signals corresponding to written sentences. Using a tri-class model allows a word error rate reduction ranging from to 50 to 60% according to the language.
Complete list of metadata

Contributor : Anne Jaigu Connect in order to contact the contributor
Submitted on : Thursday, October 5, 2006 - 1:47:22 PM
Last modification on : Wednesday, April 27, 2022 - 3:52:05 AM
Long-term archiving on: : Tuesday, April 6, 2010 - 5:28:45 PM


  • HAL Id : inria-00103860, version 1


Freddy Perraud, Christian Viard-Gaudin, Emmanuel Morin. Language Independent Statistical Models for on-Line Handwriting Recognition. Tenth International Workshop on Frontiers in Handwriting Recognition, Université de Rennes 1, Oct 2006, La Baule (France). ⟨inria-00103860⟩



Record views


Files downloads