Lemmatization of Polish Person Names - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2007

Lemmatization of Polish Person Names

Résumé

The paper presents two techniques for lemmatization of Polish person names. First, we apply a rule-based approach which relies on linguistic information and heuristics. Then, we investigate an alternative knowledge-poor method which employs string distance measures. We provide an evaluation of the adopted techniques using a set of newspaper texts.

Domaines

Linguistique
Fichier non déposé

Dates et versions

inria-00420996 , version 1 (30-09-2009)

Identifiants

  • HAL Id : inria-00420996 , version 1

Citer

Anna Kupść, Jakub Piskorski, Marcin Sydow. Lemmatization of Polish Person Names. ACL, Jul 2007, Prague, Czech Republic. ⟨inria-00420996⟩
65 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More