Skip to Main content Skip to Navigation
Conference papers

Improving Language-Dependent Named Entity Detection

Abstract : Named Entity Recognition (NER) and Named Entity Linking (NEL) are two research areas that have shown big advancements in recent years. The majority of this research is based on the English language. Hence, some of these improvements are language-dependent and do not necessarily lead to better results when applied to other languages. Therefore, this paper discusses TOMO, an approach to language-aware named entity detection and evaluates it for the German language. This also required the development of a German gold standard dataset, which was based on the English dataset used by the OKE 2016 challenge. An evaluation of the named entity detection task using the web-based platform GERBIL was undertaken and results show that our approach produced higher F1 values than the other annotators did. This indicates that language-dependent features do improve the overall quality of the spotter.
Complete list of metadata

Cited literature [54 references]  Display  Hide  Download
Contributor : Hal Ifip Connect in order to contact the contributor
Submitted on : Monday, January 8, 2018 - 9:50:23 AM
Last modification on : Wednesday, March 28, 2018 - 4:35:05 PM
Long-term archiving on: : Thursday, May 3, 2018 - 4:57:44 PM


Files produced by the author(s)


Distributed under a Creative Commons Attribution 4.0 International License



Gerald Petz, Werner Wetzlinger, Dietmar Nedbal. Improving Language-Dependent Named Entity Detection. 1st International Cross-Domain Conference for Machine Learning and Knowledge Extraction (CD-MAKE), Aug 2017, Reggio, Italy. pp.330-345, ⟨10.1007/978-3-319-66808-6_22⟩. ⟨hal-01677147⟩



Record views


Files downloads