Hidden Markov models in text recognition

J.-C. Anigbogu; Abdel Belaïd

doi:10.1142/S0218001495000389

Article Dans Une Revue International Journal of Pattern Recognition and Artificial Intelligence Année : 1995

Hidden Markov models in text recognition

(1) , (2)

1
2

J.-C. Anigbogu

Fonction : Auteur

Schlumberger Austin Systems Center

Abdel Belaïd

Fonction : Auteur
PersonId : 830137

READ

Résumé

A multi-level multifont character recognition is presented. The system proceeds by first delimiting the context of the characters. As a way of enhancing system performance, typographical information is extracted and used for font identification before actual character recognition is performed. This has the advantage of sure character identification as well as text reproduction in its original form. The font identification is based on decision trees where the characters are automatically arranged differently in confusion classes according to the physical characteristics of fonts. The character recognizers are built around the first and second order hidden Markov models (HMM) as well as Euclidean distance measures. The HMMs use the Viterbi and the Extended Viterbi algorithms to which enhancements were made. Also present is a majority-vote system that polls the other systems for advice before deciding on the identity of a character. Among other things, this last system is shown to give better results than each of the other systems applied individually. The system finally uses combinations of stochastic and dictionary verification methods for word recognition and error-correction.

Mots clés

Character recognition Decision tree Second order Markov model Error detection Error correction System performance Arbre décision Ordre 2 Modèle Markov Détection erreur Correction erreur Performance système Système information Multifont character recognition Hidden Markov model Euclidean distance Viterbi algorithm Majority vote Information system Reconnaissance caractère

Domaines

Bibliothèque électronique [cs.DL]

Abdel Belaid : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00533980

Soumis le : lundi 8 novembre 2010-15:50:33

Dernière modification le : vendredi 24 mars 2023-14:52:53

Dates et versions

inria-00533980 , version 1 (08-11-2010)

Identifiants

HAL Id : inria-00533980 , version 1
DOI : 10.1142/S0218001495000389

Citer

J.-C. Anigbogu, Abdel Belaïd. Hidden Markov models in text recognition. International Journal of Pattern Recognition and Artificial Intelligence, 1995, 9 (6), pp.925-958. ⟨10.1142/S0218001495000389⟩. ⟨inria-00533980⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE LORIA

109 Consultations

0 Téléchargements

Hidden Markov models in text recognition

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager