Adapting Semantic Spreading Activation to Entity Linking in text

Farhad Nooralahzadeh; Cédric Lopez; Elena Cabrio; Fabien Gandon; Frederique Segond

Communication Dans Un Congrès Année : 2016

Adapting Semantic Spreading Activation to Entity Linking in text

(1) , (2) , (1) , (1) , (2)

1
2

Farhad Nooralahzadeh

Fonction : Auteur

Web-Instrumented Man-Machine Interactions, Communities and Semantics

Cédric Lopez

Fonction : Auteur
PersonId : 960390
ORCID : 0000-0002-4933-5720
IdRef : 164704922

VISEO - Objet Direct

Elena Cabrio

Fonction : Auteur
PersonId : 973442

Web-Instrumented Man-Machine Interactions, Communities and Semantics

Fabien Gandon

Fonction : Auteur
PersonId : 3342
IdHAL : fabien-gandon
ORCID : 0000-0003-0543-1232
IdRef : 076340074

Web-Instrumented Man-Machine Interactions, Communities and Semantics

Frederique Segond

Fonction : Auteur
PersonId : 11539
IdHAL : frederique-segond
ORCID : 0000-0001-9420-9654
IdRef : 069068151

VISEO - Objet Direct

Résumé

The extraction and the disambiguation of knowledge guided by textual resources on the web is a crucial process to advance the Web of Linked Data. The goal of our work is to semantically enrich raw data by linking the mentions of named entities in the text to the corresponding known entities in knowledge bases. In our approach multiple aspects are considered: the prior knowledge of an entity in Wikipedia (i.e. the keyphraseness and commonness features that can be precomputed by crawling the Wikipedia dump), a set of features extracted from the input text and from the knowledge base, along with the correlation/relevancy among the resources in Linked Data. More precisely, this work explores the collective ranking approach formalized as a weighted graph model, in which the mentions in the input text and the candidate entities from knowledge bases are linked using the local compatibility and the global relatedness measures. Experiments on the datasets of the Open Knowledge Extraction (OKE) challenge with different configurations of our approach in each phase of the linking pipeline reveal its optimum mode. We investigate the notion of semantic relatedness between two entities represented as sets of neighbours in Linked Open Data that relies on an associative retrieval algorithm, with consideration of common neighbourhood. This measure improves the performance of prior link-based models and outperforms the explicit inter-link relevancy measure among entities (mostly Wikipedia-centric). Thus, our approach is resilient to non-existent or sparse links among related entities.

Mots clés

Entity linking Linked data Collective entity ranking Semantic spreading

Domaines

Traitement du texte et du document Web

Elena Cabrio : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01332626

Soumis le : jeudi 16 juin 2016-11:47:04

Dernière modification le : lundi 26 février 2024-11:22:08

Dates et versions

hal-01332626 , version 1 (16-06-2016)

Identifiants

HAL Id : hal-01332626 , version 1

Citer

Farhad Nooralahzadeh, Cédric Lopez, Elena Cabrio, Fabien Gandon, Frederique Segond. Adapting Semantic Spreading Activation to Entity Linking in text. Proceedings of NLDB 2016 - 21st International Conference on Applications of Natural Language to Information Systems, Jun 2016, Manchester, United Kingdom. ⟨hal-01332626⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA IRISA I3S WIMMICS INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-COTEDAZUR UNIV-RENNES UR1-MATH-NUM

224 Consultations

0 Téléchargements

Adapting Semantic Spreading Activation to Entity Linking in text

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager