Inflectional morphology analyser for Sanskrit

Girish Nath Jha; Muktanand Agrawal; Sudhir K. Mishra; Diwakar Mani; Diwakar Mishra; Manji Bhadra; Surjit K. Singh; - Subash

Communication Dans Un Congrès Année : 2007

Inflectional morphology analyser for Sanskrit

(1) , (1) , (1) , (1) , (1) , (1) , (1) , (1)

Girish Nath Jha

Fonction : Auteur

Special Centre for Sanskrit Studies

Muktanand Agrawal

Fonction : Auteur

Special Centre for Sanskrit Studies

Sudhir K. Mishra

Fonction : Auteur

Special Centre for Sanskrit Studies

Diwakar Mani

Fonction : Auteur

Special Centre for Sanskrit Studies

Diwakar Mishra

Fonction : Auteur

Special Centre for Sanskrit Studies

Manji Bhadra

Fonction : Auteur

Special Centre for Sanskrit Studies

Surjit K. Singh

Fonction : Auteur

Special Centre for Sanskrit Studies

- Subash

Fonction : Auteur

Special Centre for Sanskrit Studies

Résumé

The paper describes a Sanskrit morphological analyzer that identifies and analyzes inflected nounforms and verb-forms in any given sandhi-free text. The system which has been developed as java servlet RDBMS can be tested at http://sanskrit.jnu.ac.in (Language Processing Tools > Sanskrit Tinanta Analyzer/Subanta Analyzer) with Sanskrit data as unicode text. Subsequently, the separate systems of subanta and ti_anta will be combined into a single system of sentence analysis with karaka interpretation. Currently, the system checks and labels each word as three basic POS categories - subanta, tinanta, and avyaya. Thereafter, each subanta is sent for subanta processing based on an example database and a rule database. The verbs are examined based on a database of verb roots and forms as well by reverse morphology based on Paninian techniques. Future enhancements include plugging in the amarakosha (http://sanskrit.jnu.ac.in/amara) and other noun lexicons with the subanta system. The tinanta will be enhanced by the krdanta analysis module being developed separately.

Domaines

Traitement du texte et du document

Fichier principal

Jha.pdf (623.23 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Brigitte Briot : Connectez-vous pour contacter le contributeur

https://inria.hal.science/inria-00203476

Soumis le : jeudi 10 janvier 2008-11:41:14

Dernière modification le : jeudi 28 février 2008-15:52:15

Archivage à long terme le : mardi 13 avril 2010-16:55:02

Dates et versions

inria-00203476 , version 1 (10-01-2008)

Identifiants

HAL Id : inria-00203476 , version 1

Citer

Girish Nath Jha, Muktanand Agrawal, Sudhir K. Mishra, Diwakar Mani, Diwakar Mishra, et al.. Inflectional morphology analyser for Sanskrit. First International Sanskrit Computational Linguistics Symposium, INRIA Paris-Rocquencourt, Oct 2007, Rocquencourt, France. ⟨inria-00203476⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

SANSKRIT

194 Consultations

578 Téléchargements

Inflectional morphology analyser for Sanskrit

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager