Skip to Main content Skip to Navigation
Conference papers

Form item extraction based on line searching

Eric Turolla 1 yolande Belaïd 1 Abdel Belaïd 1 
LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : This paper presents an item searching method which has been applied to various kinds of forms. This approach is based on line detection through the Hough transform. After obtaining the straight lines, Hough directions are used to detect the real segments in the image. Segments can correspond either to continuous line, or to black parts of dashed or dotted lines. So, the segments are grouped together and classified between both adjacent line crossing points. Items are located by searching the minimum cycles of the graph constructed from the line intersection points. The last step consists of verifying the line classes based on the homogeneity hypothesis of item sides. This method was applied to French Tax forms and tables coming from scientific publications. The experimental results have demonstrated the robustness and the reliability of such an approach to various forms with different types of item delimiters.
Document type :
Conference papers
Complete list of metadata
Contributor : Yolande Belaid Connect in order to contact the contributor
Submitted on : Thursday, November 18, 2010 - 11:26:07 AM
Last modification on : Friday, February 26, 2021 - 3:28:07 PM

Links full text




Eric Turolla, yolande Belaïd, Abdel Belaïd. Form item extraction based on line searching. International Workshop on Graphics Recognition - GRCE, Aug 1995, University Park, PA, United States. pp.69-79, ⟨10.1007/3-540-61226-2_7⟩. ⟨inria-00537324⟩



Record views