Skip to Main content Skip to Navigation
Journal articles

Bayesian networks for incomplete data analysis in form processing

Emilie Philippot 1 Santosh K.C. 1 Abdel Belaïd 1 yolande Belaïd 1 
1 READ - Recognition of writing and analysis of documents
LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : In this paper, we study Bayesian network (BN) for form identification based on partially filled fields. It uses electronic ink-tracing files without having any information about form structure. Given a form format, the ink-tracing files are used to build the BN by providing the possible relationships between corresponding fields using conditional probabilities, that goes from individual fields up to the complete model construction. To simplify the BN, we sub-divide a single form into three different areas: header, body and footer, and integrate them together, where we study three fundamental BN learning algorithms: Naive, Peter & Clark (PC) and maximum weighted spanning tree (MWST). Under this framework, we validate it with a real-world industrial problem i.e., electronic note-taking in form processing. The approach provides satisfactory results, attesting the interest of BN for exploiting the incomplete form analysis problems, in particular.
Complete list of metadata

Cited literature [43 references]  Display  Hide  Download
Contributor : Yolande Belaid Connect in order to contact the contributor
Submitted on : Monday, January 5, 2015 - 10:55:20 AM
Last modification on : Friday, February 25, 2022 - 9:32:04 AM
Long-term archiving on: : Wednesday, June 3, 2015 - 1:06:13 PM


Files produced by the author(s)




Emilie Philippot, Santosh K.C., Abdel Belaïd, yolande Belaïd. Bayesian networks for incomplete data analysis in form processing. International journal of machine learning and cybernetics, Springer, 2014, pp.25. ⟨10.1007/s13042-014-0234-4⟩. ⟨hal-01099727⟩



Record views


Files downloads