Inflectional morphology analyser for Sanskrit

Abstract : The paper describes a Sanskrit morphological analyzer that identifies and analyzes inflected nounforms and verb-forms in any given sandhi-free text. The system which has been developed as java servlet RDBMS can be tested at http://sanskrit.jnu.ac.in (Language Processing Tools > Sanskrit Tinanta Analyzer/Subanta Analyzer) with Sanskrit data as unicode text. Subsequently, the separate systems of subanta and ti_anta will be combined into a single system of sentence analysis with karaka interpretation. Currently, the system checks and labels each word as three basic POS categories - subanta, tinanta, and avyaya. Thereafter, each subanta is sent for subanta processing based on an example database and a rule database. The verbs are examined based on a database of verb roots and forms as well by reverse morphology based on Paninian techniques. Future enhancements include plugging in the amarakosha (http://sanskrit.jnu.ac.in/amara) and other noun lexicons with the subanta system. The tinanta will be enhanced by the krdanta analysis module being developed separately.
Type de document :
Communication dans un congrès
Gérard Huet and Amba Kulkarni. First International Sanskrit Computational Linguistics Symposium, Oct 2007, Rocquencourt, France. 2007, http://hal.inria.fr/SANSKRIT/fr/
Liste complète des métadonnées

Littérature citée [9 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00203476
Contributeur : Brigitte Briot <>
Soumis le : jeudi 10 janvier 2008 - 11:41:14
Dernière modification le : jeudi 28 février 2008 - 15:52:15
Document(s) archivé(s) le : mardi 13 avril 2010 - 16:55:02

Fichier

Jha.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inria-00203476, version 1

Collections

Citation

Girish Nath Jha, Muktanand Agrawal, Sudhir Mishra, Diwakar Mani, Diwakar Mishra, et al.. Inflectional morphology analyser for Sanskrit. Gérard Huet and Amba Kulkarni. First International Sanskrit Computational Linguistics Symposium, Oct 2007, Rocquencourt, France. 2007, http://hal.inria.fr/SANSKRIT/fr/. 〈inria-00203476〉

Partager

Métriques

Consultations de la notice

305

Téléchargements de fichiers

441