Skip to Main content Skip to Navigation
Conference papers

Weakly supervised named entity classification

Edouard Grave 1, 2, 3, *
* Corresponding author
2 SIERRA - Statistical Machine Learning and Parsimony
DI-ENS - Département d'informatique de l'École normale supérieure, Inria Paris-Rocquencourt, CNRS - Centre National de la Recherche Scientifique : UMR8548
Abstract : In this paper, we describe a new method for the problem of named entity classifica-tion for specialized or technical domains, using distant supervision. Our approach relies on a simple observation: in some specialized domains, named entities are almost unambiguous. Thus, given a seed list of names of entities, it is cheap and easy to obtain positive examples from unlabeled texts using a simple string match. Those positive examples can then be used to train a named entity classifier, by using the PU learning paradigm, which is learning from positive and unlabeled examples. We introduce a new convex formulation to solve this problem, and apply our technique in order to extract named entities from financial reports cor-responding to healthcare companies.
Complete list of metadata

Cited literature [27 references]  Display  Hide  Download

https://hal.inria.fr/hal-01095596
Contributor : Edouard Grave <>
Submitted on : Monday, December 15, 2014 - 8:13:48 PM
Last modification on : Thursday, July 1, 2021 - 5:58:07 PM
Long-term archiving on: : Monday, March 16, 2015 - 12:46:29 PM

File

grave2014weakly.pdf
Files produced by the author(s)

Licence


Copyright

Identifiers

  • HAL Id : hal-01095596, version 1

Collections

Citation

Edouard Grave. Weakly supervised named entity classification. Workshop on Automated Knowledge Base Construction (AKBC), Dec 2014, Montréal, Canada. ⟨hal-01095596⟩

Share

Metrics

Record views

327

Files downloads

692