Administrative Document Analysis and Structure - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Chapitre D'ouvrage Année : 2011

Administrative Document Analysis and Structure

Abdel Belaïd
  • Fonction : Auteur
  • PersonId : 830137
Hatem Hamza
  • Fonction : Auteur
Yolande Belaïd

Résumé

This chapter reports our knowledge about the analysis and recognition of scanned administrative documents. Regarding essentially the administrative paper flow with new and continuous arrivals, all the conventional techniques reserved to static databases modeling and recognition are doomed to failure. For this purpose, a new technique based on the experience was investigated giving very promising results. This technique is related to the case-based reasoning already used in data mining and various problems of machine learning. After the presentation of the context related to the administrative document flow and its requirements in a real time processing, we present a case based reasonning for invoice processing. The case corresponds to the co-existence of a problem and its solution. The problem in an invoice corresponds to a local structure such as the keywords of an address or the line patterns in the amounts table, while the solution is related to their content. This problem is then compared to a document case base using graph probing. For this purpose, we proposed an improvement of an already existing neural network called Incremental Growing Neural Gas
Fichier principal
Vignette du fichier
Belaid-chapter.pdf (662.63 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

inria-00579833 , version 1 (25-03-2011)

Identifiants

Citer

Abdel Belaïd, Vincent Poulain d'Andecy, Hatem Hamza, Yolande Belaïd. Administrative Document Analysis and Structure. Marenglen Biba and Fatos Xhafa. Learning Structure and Schemas from Documents, 375, Springer Verlag, pp.51-72, 2011, Studies in Computational Intelligence, 978-3-642-22912-1. ⟨10.1007/978-3-642-22913-8⟩. ⟨inria-00579833⟩
194 Consultations
8864 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More