EXTRAFOR : automatic EXTRAction of mathematical FORmulas

Afef Kacem 1 Abdel Belaid 2 Mohamed Ben Ahmed 1
2 LORIASI - Loria in the Society of Information
LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : We present a method for automatic extraction of mathematical formulas from images of documents without character recognition. Formula extraction is first done by location of its most significant symbols, then extension to adjoining symbols using contextual rules until delimitation of the whole formula space. Mathematical symbols labelling is realised from models created at the learning stage using fuzzy logic. This paper reviews our current efforts to develop such a system, presents problems we have encountered and summarises our results. The average rate of preliminary labelling rate is about 95.3%. 90% of mathematical formulas are well extracted from documents printed with high quality.
Type de document :
Communication dans un congrès
International Conference on Document Analysis & Recognition - ICDAR'99, 1999, Bangalore, India. IEEE, pp.527-530, 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition, 1999. ICDAR '99. 〈10.1109/ICDAR.1999.791841〉
Liste complète des métadonnées

https://hal.inria.fr/inria-00098838
Contributeur : Publications Loria <>
Soumis le : mardi 26 septembre 2006 - 08:39:03
Dernière modification le : jeudi 11 janvier 2018 - 06:19:48

Lien texte intégral

Identifiants

Collections

Citation

Afef Kacem, Abdel Belaid, Mohamed Ben Ahmed. EXTRAFOR : automatic EXTRAction of mathematical FORmulas. International Conference on Document Analysis & Recognition - ICDAR'99, 1999, Bangalore, India. IEEE, pp.527-530, 1999, Proceedings of the Fifth International Conference on Document Analysis and Recognition, 1999. ICDAR '99. 〈10.1109/ICDAR.1999.791841〉. 〈inria-00098838〉

Partager

Métriques

Consultations de la notice

1445