Service interruption on Monday 11 July from 12:30 to 13:00: all the sites of the CCSD (HAL, EpiSciences, SciencesConf, AureHAL) will be inaccessible (network hardware connection).
Abstract : Automatic discovery of part-whole relations is a fundamental problem in the area of information extraction. In this paper, we present an unsupervised approach to learning lexical patterns from online encyclopedia for extracting part-whole relations. The only input is a few part-whole instances. To tackle the term recognition problem, terms from the domain of the seeds are extracted taking use of the semantic information contained in the online encyclopedia. Instead of collecting sentences that contain relation instances from the seeds, we introduce a novel process to select sentences that may indicate part-whole relations. Patterns are produced from these sentences with terms replaced by Part and Whole tags. A similarity measurement based on a new edit distance is used and an algorithm is described to cluster similar patterns. We rank the pattern clusters according to their frequencies, and patterns from the top-k clusters are chosen to be applied to identify the new part-whole relations. Experimental results show that our method can extract abundant part-whole relations and achieve a preferable precision compared to the other state-of-the-art approaches.
https://hal.inria.fr/hal-01383317 Contributor : Hal IfipConnect in order to contact the contributor Submitted on : Tuesday, October 18, 2016 - 2:53:01 PM Last modification on : Thursday, March 5, 2020 - 5:41:03 PM
Fei Xia, Cungen Cao. Extracting Part-Whole Relations from Online Encyclopedia. 8th International Conference on Intelligent Information Processing (IIP), Oct 2014, Hangzhou, China. pp.57-66, ⟨10.1007/978-3-662-44980-6_7⟩. ⟨hal-01383317⟩