Eyes Wide Open: an interactive learning method for the design of rule-based systems - Archive ouverte HAL Access content directly
Journal Articles International Journal on Document Analysis and Recognition Year : 2017

Eyes Wide Open: an interactive learning method for the design of rule-based systems

(1, 2) , (2, 3) , (1, 2)
1
2
3

Abstract

We present in this paper a new general method, the Eyes Wide Open method (EWO) for the design of rule-based document recognition systems. Our contribution is to introduce a learning procedure, through machine learning techniques, in interaction with the user to design the recognition system. Therefore, and unlike many approaches that are manually designed, ours can easily adapt to a new type of documents while taking advantage of the expressiveness of rule-based systems and their ability to convey the hierarchical structure of a document. The EWO method is independent of any existing recognition system. An automatic analysis of an annotated corpus, guided by the user, is made to help the adaption of the recognition system to a new kind of document. The user will then bring sense to the automatically extracted information. In this paper, we validate EWO by producing two rule-based systems: one for the Mau-rdor international competition, on a heterogeneous corpus of documents, containing handwritten and printed documents, written in different languages and another one for the RIMES competition corpus, a homogeneous corpus of French handwritten business letters. On the RIMES corpus, our method allows an assisted design of a grammatical description that gives better results than all the previously proposed statistical systems.
Fichier principal
Vignette du fichier
article_ijdar_pre_print.pdf (1.72 Mo) Télécharger le fichier
Origin : Files produced by the author(s)

Dates and versions

hal-01493442 , version 1 (21-03-2017)

Identifiers

Cite

Cérès Carton, Aurélie Lemaitre, Bertrand B. Coüasnon. Eyes Wide Open: an interactive learning method for the design of rule-based systems. International Journal on Document Analysis and Recognition, 2017, 20 (2), pp.91-103. ⟨10.1007/s10032-017-0282-x⟩. ⟨hal-01493442⟩
475 View
211 Download

Altmetric

Share

Gmail Facebook Twitter LinkedIn More