Lemmatization of Polish Person Names

Abstract : The paper presents two techniques for lemmatization of Polish person names. First, we apply a rule-based approach which relies on linguistic information and heuristics. Then, we investigate an alternative knowledge-poor method which employs string distance measures. We provide an evaluation of the adopted techniques using a set of newspaper texts.
Type de document :
Communication dans un congrès
ACL, Jul 2007, Prague, Czech Republic. 2007
Liste complète des métadonnées

https://hal.inria.fr/inria-00420996
Contributeur : Anna Kupsc <>
Soumis le : mercredi 30 septembre 2009 - 12:49:50
Dernière modification le : jeudi 11 janvier 2018 - 06:16:07

Identifiants

  • HAL Id : inria-00420996, version 1

Collections

Citation

Anna Kupsc, Jakub Piskorski, Marcin Sydow. Lemmatization of Polish Person Names. ACL, Jul 2007, Prague, Czech Republic. 2007. 〈inria-00420996〉

Partager

Métriques

Consultations de la notice

52