Extracting Part-Whole Relations from Online Encyclopedia

Abstract : Automatic discovery of part-whole relations is a fundamental problem in the area of information extraction. In this paper, we present an unsupervised approach to learning lexical patterns from online encyclopedia for extracting part-whole relations. The only input is a few part-whole instances. To tackle the term recognition problem, terms from the domain of the seeds are extracted taking use of the semantic information contained in the online encyclopedia. Instead of collecting sentences that contain relation instances from the seeds, we introduce a novel process to select sentences that may indicate part-whole relations. Patterns are produced from these sentences with terms replaced by Part and Whole tags. A similarity measurement based on a new edit distance is used and an algorithm is described to cluster similar patterns. We rank the pattern clusters according to their frequencies, and patterns from the top-k clusters are chosen to be applied to identify the new part-whole relations. Experimental results show that our method can extract abundant part-whole relations and achieve a preferable precision compared to the other state-of-the-art approaches.
Type de document :
Communication dans un congrès
Zhongzhi Shi; Zhaohui Wu; David Leake; Uli Sattler. 8th International Conference on Intelligent Information Processing (IIP), Oct 2014, Hangzhou, China. Springer, IFIP Advances in Information and Communication Technology, AICT-432, pp.57-66, 2014, Intelligent Information Processing VII. 〈10.1007/978-3-662-44980-6_7〉
Liste complète des métadonnées

Littérature citée [13 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01383317
Contributeur : Hal Ifip <>
Soumis le : mardi 18 octobre 2016 - 14:53:01
Dernière modification le : vendredi 3 novembre 2017 - 22:24:06

Fichier

978-3-662-44980-6_7_Chapter.pd...
Fichiers produits par l'(les) auteur(s)

Licence


Distributed under a Creative Commons Paternité 4.0 International License

Identifiants

Citation

Fei Xia, Cungen Cao. Extracting Part-Whole Relations from Online Encyclopedia. Zhongzhi Shi; Zhaohui Wu; David Leake; Uli Sattler. 8th International Conference on Intelligent Information Processing (IIP), Oct 2014, Hangzhou, China. Springer, IFIP Advances in Information and Communication Technology, AICT-432, pp.57-66, 2014, Intelligent Information Processing VII. 〈10.1007/978-3-662-44980-6_7〉. 〈hal-01383317〉

Partager

Métriques

Consultations de la notice

32

Téléchargements de fichiers

25