Skip to Main content Skip to Navigation
Preprints, Working Papers, ...

A complete character recognition and transliteration technique for Devanagari script

Abstract : Transliteration involves transformation of one script to another based on phonetic similarities between the characters of two distinctive scripts. In this paper, we present a novel technique for automatic transliteration of Devanagari script using character recognition. One of the first tasks performed to isolate the constituent characters is segmentation. Line segmentation methodology in this manuscript discusses the case of overlapping lines. Character segmentation algorithm is designed to segment conjuncts and separate shadow characters. Presented shadow character segmentation scheme employs connected component method to isolate the character, keeping the constituent characters intact. Statistical features namely different order moments like area, variance, skewness and kurtosis along with structural features of characters are employed in two phase recognition process. After recognition, constituent Devanagari characters are mapped to corresponding roman alphabets in way that resulting roman alphabets have similar pronunciation to source characters.
Document type :
Preprints, Working Papers, ...
Complete list of metadatas

https://hal.inria.fr/hal-03020043
Contributor : Vinay Kumar <>
Submitted on : Monday, November 23, 2020 - 4:30:03 PM
Last modification on : Monday, November 23, 2020 - 4:30:03 PM

Links full text

Identifiers

  • HAL Id : hal-03020043, version 1
  • ARXIV : 2009.13460

Citation

Jasmine Kaur, Vinay Kumar. A complete character recognition and transliteration technique for Devanagari script. 2020. ⟨hal-03020043⟩

Share

Metrics

Record views

12