Fast Development of Basic NLP Tools: Towards a Lexicon and a POS Tagger for Kurmanji Kurdish - Inria - Institut national de recherche en sciences et technologies du numérique Access content directly
Conference Papers Year : 2010

Fast Development of Basic NLP Tools: Towards a Lexicon and a POS Tagger for Kurmanji Kurdish

Abstract

The development of basic NLP resources for minority languages is still a challenge to both formal and computational linguists. In this paper, we show how we were able to develop a medium-scale morphological lexicon for Kurmanji Kurdish in a few days time using only freely accessible resources. We also developed a preliminary POS tagger that shall be used as a pre-annotation tool for developing a POS-annotated corpus, based solely on raw text and on our morphological lexicon.
Fichier principal
Vignette du fichier
clg10kmr.pdf (76.87 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Loading...

Dates and versions

hal-00510999 , version 1 (23-08-2010)

Licence

Attribution

Identifiers

  • HAL Id : hal-00510999 , version 1

Cite

Géraldine Walther, Benoît Sagot, Karen Fort. Fast Development of Basic NLP Tools: Towards a Lexicon and a POS Tagger for Kurmanji Kurdish. International Conference on Lexis and Grammar, Sep 2010, Belgrade, Serbia. ⟨hal-00510999⟩
441 View
415 Download

Share

Gmail Facebook X LinkedIn More