ILP for Mining Linked Open Data: a biomedical Case Study

Gabin Personeni 1 Simon Daget 1 Céline Bonnet 2 Philippe Jonveaux 2 Marie-Dominique Devignes 1 Malika Smaïl-Tabbone 1 Adrien Coulet 1
1 ORPAILLEUR - Knowledge representation, reasonning
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : This paper is a summary of a recent work accepted (as a long paper) for publication in the tenth international conference on Data Integration in Life Sciences (DILS 2014) [1]. We briefly describe in the last section ongoing work for improving the rule-based prediction process. Increasing amounts of biomedical data provided as Linked Open Data (LOD) offer novel opportunities for knowledge discovery. LOD are represented in a standard format, partially integrated, and offer connections with domain knowledge available in semantic web ontologies. Relational data mining methods such as ILP are good candidates to consider together LOD and domain knowledge awareness. We propose in this paper an approach for collecting and mining LOD, using ILP, with the goal of characterizing and predicting genes responsible for a disease. An integration step enables to select and link together relevant pieces of LOD. It results from this integration a graph that is subsequently mined us-ing ILP. For this real-world use case, we design ILP experiments on two subsets of relational descriptors of genes responsible for intellectual dis-ability. We evaluate ILP results and assess the contribution of domain knowledge. Our ongoing efforts explore how the combination of rules coming from distinct theories can improve the prediction accuracy.
Document type :
Conference papers
Complete list of metadatas

Cited literature [23 references]  Display  Hide  Download

https://hal.inria.fr/hal-01095597
Contributor : Adrien Coulet <>
Submitted on : Monday, December 15, 2014 - 8:16:28 PM
Last modification on : Tuesday, December 18, 2018 - 4:38:02 PM
Long-term archiving on : Monday, March 16, 2015 - 12:46:39 PM

File

ilp2014.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01095597, version 1

Collections

Citation

Gabin Personeni, Simon Daget, Céline Bonnet, Philippe Jonveaux, Marie-Dominique Devignes, et al.. ILP for Mining Linked Open Data: a biomedical Case Study. The 24th International Conference on Inductive Logic Programming (ILP 2014), Sep 2014, Nancy, France. ⟨hal-01095597⟩

Share

Metrics

Record views

454

Files downloads

251