Resources for Named Entity Recognition and Resolution in News Wires

Abstract : In the applicative context of news wire enrichment with metadata, named entity recognition plays an important role, but requires to be followed by a resolution module that maps named entity mentions to entries in a reference database. In this paper, we describe NP, the named entity module embedded in the SXPipe shallow processing chain, that we used for extracting information from French news wires from the Agence France-Presse. We describe the construction of our reference database from freely available external resources, as well as our named entity detection, disambiguation and resolution modules. We also introduce a freely available and manually developped annotated corpus designed for the evaluation of named entity recognition and resolution tools, and provide evaluation figures for NP.
Document type :
Conference papers
Complete list of metadatas

Cited literature [11 references]  Display  Hide  Download

https://hal.inria.fr/inria-00521240
Contributor : Benoît Sagot <>
Submitted on : Sunday, September 26, 2010 - 10:30:44 PM
Last modification on : Thursday, August 29, 2019 - 2:24:09 PM
Long-term archiving on : Thursday, October 25, 2012 - 4:00:10 PM

File

entity10np.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : inria-00521240, version 1

Collections

Citation

Rosa Stern, Benoît Sagot. Resources for Named Entity Recognition and Resolution in News Wires. Entity 2010 Workshop at LREC 2010, May 2010, Valletta, Malta. ⟨inria-00521240⟩

Share

Metrics

Record views

373

Files downloads

358