Skip to Main content Skip to Navigation
Conference papers

Handwritten/Machine-printed and Arabic/Latin Mathematical Formula Discrimination and Recognition

Abstract : In this paper we mainly introduce a method for mathematical formula script and type identification based on handcrafted features. Arabic/Latin scripts are discriminated by detecting specific symbols based on their pixel density and distribution. Once the formula script identified, we proposed to separate between machine-printed and handwritten formulas. For that, we analyzed the spaces between some specific symbols and their neighbors and the pixel density of some other symbols presenting discriminative differences in their form when they are handwritten or machine-printed. The use of such structural characteristics makes the complexity of the method considerably lower than learning based methods. To finally recognize formulas, two main steps are followed: symbol recognition and formula structure analysis. For the first step, we used a combination of statistical features and an instance-based classifier. For the second step, we proceeded by top-down and bottom-up parsing scheme based on operator dominance. A set of replacement rules is defined by a coordinate grammar. Formula parsing consists of applying, from the dominant operator and its context, the appropriate rule to divide the formula into sub-formulas which will be recursively analyzed by the same way. Carried experiments on various mathematical formulas, show the efficiency of both script and type identification and recognition proposed methods.
Document type :
Conference papers
Complete list of metadata

Cited literature [9 references]  Display  Hide  Download

https://hal.inria.fr/hal-01981545
Contributor : Abdel Belaid <>
Submitted on : Tuesday, January 15, 2019 - 10:31:37 AM
Last modification on : Sunday, May 16, 2021 - 3:24:42 AM
Long-term archiving on: : Tuesday, April 16, 2019 - 1:08:48 PM

File

Kawther-paper.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01981545, version 1

Citation

Kawther Ayeb, Afef Echi. Handwritten/Machine-printed and Arabic/Latin Mathematical Formula Discrimination and Recognition. International Workshop on Arabic Script Analysis and Recognition, Apr 2017, NANCY, France. ⟨hal-01981545⟩

Share

Metrics

Record views

93

Files downloads

290