Skip to Main content Skip to Navigation
Conference papers

Improving Language-Dependent Named Entity Detection

Abstract : Named Entity Recognition (NER) and Named Entity Linking (NEL) are two research areas that have shown big advancements in recent years. The majority of this research is based on the English language. Hence, some of these improvements are language-dependent and do not necessarily lead to better results when applied to other languages. Therefore, this paper discusses TOMO, an approach to language-aware named entity detection and evaluates it for the German language. This also required the development of a German gold standard dataset, which was based on the English dataset used by the OKE 2016 challenge. An evaluation of the named entity detection task using the web-based platform GERBIL was undertaken and results show that our approach produced higher F1 values than the other annotators did. This indicates that language-dependent features do improve the overall quality of the spotter.
Complete list of metadata

Cited literature [54 references]  Display  Hide  Download

https://hal.inria.fr/hal-01677147
Contributor : Hal Ifip <>
Submitted on : Monday, January 8, 2018 - 9:50:23 AM
Last modification on : Wednesday, March 28, 2018 - 4:35:05 PM
Long-term archiving on: : Thursday, May 3, 2018 - 4:57:44 PM

File

456304_1_En_22_Chapter.pdf
Files produced by the author(s)

Licence


Distributed under a Creative Commons Attribution 4.0 International License

Identifiers

Citation

Gerald Petz, Werner Wetzlinger, Dietmar Nedbal. Improving Language-Dependent Named Entity Detection. 1st International Cross-Domain Conference for Machine Learning and Knowledge Extraction (CD-MAKE), Aug 2017, Reggio, Italy. pp.330-345, ⟨10.1007/978-3-319-66808-6_22⟩. ⟨hal-01677147⟩

Share

Metrics

Record views

146

Files downloads

97