Eyes Wide Open: an interactive learning method for the design of rule-based systems

Abstract : We present in this paper a new general method, the Eyes Wide Open method (EWO) for the design of rule-based document recognition systems. Our contribution is to introduce a learning procedure, through machine learning techniques, in interaction with the user to design the recognition system. Therefore, and unlike many approaches that are manually designed, ours can easily adapt to a new type of documents while taking advantage of the expressiveness of rule-based systems and their ability to convey the hierarchical structure of a document. The EWO method is independent of any existing recognition system. An automatic analysis of an annotated corpus, guided by the user, is made to help the adaption of the recognition system to a new kind of document. The user will then bring sense to the automatically extracted information. In this paper, we validate EWO by producing two rule-based systems: one for the Mau-rdor international competition, on a heterogeneous corpus of documents, containing handwritten and printed documents, written in different languages and another one for the RIMES competition corpus, a homogeneous corpus of French handwritten business letters. On the RIMES corpus, our method allows an assisted design of a grammatical description that gives better results than all the previously proposed statistical systems.
Type de document :
Article dans une revue
International Journal on Document Analysis and Recognition, Springer Verlag, 2017, 20 (2), pp.91-103. 〈10.1007/s10032-017-0282-x〉
Liste complète des métadonnées

https://hal.inria.fr/hal-01493442
Contributeur : Aurélie Lemaitre <>
Soumis le : mardi 21 mars 2017 - 15:46:38
Dernière modification le : mardi 16 janvier 2018 - 15:54:19
Document(s) archivé(s) le : jeudi 22 juin 2017 - 13:47:26

Fichier

article_ijdar_pre_print.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Citation

Cérès Carton, Aurélie Lemaitre, Bertrand Coüasnon. Eyes Wide Open: an interactive learning method for the design of rule-based systems. International Journal on Document Analysis and Recognition, Springer Verlag, 2017, 20 (2), pp.91-103. 〈10.1007/s10032-017-0282-x〉. 〈hal-01493442〉

Partager

Métriques

Consultations de la notice

215

Téléchargements de fichiers

76