Language Independent Statistical Models for on-Line Handwriting Recognition

Freddy Perraud; Christian Viard-Gaudin; Emmanuel Morin

Communication Dans Un Congrès Année : 2006

Language Independent Statistical Models for on-Line Handwriting Recognition

(1) , (2) , (3)

1
2
3

Freddy Perraud

Fonction : Auteur
PersonId : 835674

Vision Objects

Christian Viard-Gaudin

Fonction : Auteur
PersonId : 16788
IdHAL : christian-viard-gaudin
IdRef : 098591045

Institut de Recherche en Communications et en Cybernétique de Nantes

Emmanuel Morin

Fonction : Auteur
PersonId : 3632
IdHAL : emmanuel-morin
ORCID : 0000-0001-8208-7039
IdRef : 14379373X

Laboratoire d'Informatique de Nantes Atlantique

Résumé

This paper deals with a language modeling approach that is dedicated to an on-line handwriting recognition system. Three main goals are set: i) performances, ii) versatility, and iii) resources. To achieve these goals we propose a statistical word n-class approach, which uses a learning stage to cluster words in classes and defines an estimation of the probability distribution of sequences of classes. Very large corpora from three different languages (English, French and Italian) have been used to train and test the language models. The efficiency of these models are evaluated not only from a linguistic point of view, using perplexity measurements, but also combined inside the recognition system on real ink signals corresponding to written sentences. Using a tri-class model allows a word error rate reduction ranging from to 50 to 60% according to the language.

Mots clés

Handwriting recognition language modeling n-gram n-class perplexity

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV] Traitement du texte et du document

Fichier principal

cr1020174712354.pdf (280.36 Ko)

Anne Jaigu : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00103860

Soumis le : jeudi 5 octobre 2006-13:47:22

Dernière modification le : vendredi 5 janvier 2024-03:23:24

Archivage à long terme le : mardi 6 avril 2010-17:28:45

Dates et versions

inria-00103860 , version 1 (05-10-2006)

Identifiants

HAL Id : inria-00103860 , version 1

Citer

Freddy Perraud, Christian Viard-Gaudin, Emmanuel Morin. Language Independent Statistical Models for on-Line Handwriting Recognition. Tenth International Workshop on Frontiers in Handwriting Recognition, Université de Rennes 1, Oct 2006, La Baule (France). ⟨inria-00103860⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-NANTES CNRS EC-NANTES LINA LINA-TALN IWFHR10 IRCCYN UNAM LS2N NANTES-UNIVERSITE

445 Consultations

186 Téléchargements

Language Independent Statistical Models for on-Line Handwriting Recognition

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager