Resources for Named Entity Recognition and Resolution in News Wires

Abstract : In the applicative context of news wire enrichment with metadata, named entity recognition plays an important role, but requires to be followed by a resolution module that maps named entity mentions to entries in a reference database. In this paper, we describe NP, the named entity module embedded in the SXPipe shallow processing chain, that we used for extracting information from French news wires from the Agence France-Presse. We describe the construction of our reference database from freely available external resources, as well as our named entity detection, disambiguation and resolution modules. We also introduce a freely available and manually developped annotated corpus designed for the evaluation of named entity recognition and resolution tools, and provide evaluation figures for NP.
Type de document :
Communication dans un congrès
Entity 2010 Workshop at LREC 2010, May 2010, Valletta, Malta. 2010
Liste complète des métadonnées

Littérature citée [11 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00521240
Contributeur : Benoît Sagot <>
Soumis le : dimanche 26 septembre 2010 - 22:30:44
Dernière modification le : jeudi 15 novembre 2018 - 20:27:26
Document(s) archivé(s) le : jeudi 25 octobre 2012 - 16:00:10

Fichier

entity10np.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inria-00521240, version 1

Collections

Citation

Rosa Stern, Benoît Sagot. Resources for Named Entity Recognition and Resolution in News Wires. Entity 2010 Workshop at LREC 2010, May 2010, Valletta, Malta. 2010. 〈inria-00521240〉

Partager

Métriques

Consultations de la notice

343

Téléchargements de fichiers

299