Skip to Main content Skip to Navigation
Theses

Content and data linking leveraging ontological knowledge in data journalism

Cheikh Brahim El Vaigh 1, 2 
1 LinkMedia - Creating and exploiting explicit links between multimedia fragments
Inria Rennes – Bretagne Atlantique , IRISA-D6 - MEDIA ET INTERACTIONS
2 SHAMAN - A Symbolic and Human-centric view of dAta MANagement
IRISA-D7 - GESTION DES DONNÉES ET DE LA CONNAISSANCE
Abstract : This thesis is about the creation of links between textual content and ontological knowledge bases (KBs). It pertains several areas of research: natural language processing, information retrieval and semantic web, and in particular RDF-based KBs. We propose to study collective entity linking, which consists in linking at once mentions of entities present in a textual document to entities in a KB. To that end, we leverage semantic measures, i.e., entity relatedness measures which exploit the relationships between the entities in a KB. We contribute by the definition of well-founded entity relatedness measures that benefit to the extent possible from the properties of RDF KBs through (basic) reasoning, and thus allow to improve the state-of-the-art. Furthermore, we are also interested in the alignment of different KBs, based on KBs embedding techniques. This alignment not only allows to enrich the KBs at hand, but also to indirectly improve the collective entity linking. We contribute by an alignment criterion, based on the alignment of the dimensions of the KBs embedding spaces, which, notably does not need any prior knowledge to perform said KBs alignment.
Document type :
Theses
Complete list of metadata

https://hal.inria.fr/tel-03131484
Contributor : ABES STAR :  Contact
Submitted on : Thursday, September 9, 2021 - 2:01:11 PM
Last modification on : Friday, August 5, 2022 - 2:54:52 PM

File

EL_VAIGH_Cheikh_Brahim.pdf
Version validated by the jury (STAR)

Identifiers

  • HAL Id : tel-03131484, version 2

Citation

Cheikh Brahim El Vaigh. Content and data linking leveraging ontological knowledge in data journalism. Computer Science [cs]. Université Rennes 1, 2021. English. ⟨NNT : 2021REN1S001⟩. ⟨tel-03131484v2⟩

Share

Metrics

Record views

193

Files downloads

181