Mining Linked Open Data: A Case Study with Genes Responsible for Intellectual Disability

Gabin Personeni 1 Simon Daget 1 Céline Bonnet 2 Philippe Jonveaux 2 Marie-Dominique Devignes 1 Malika Smaïl-Tabbone 1 Adrien Coulet 1
1 ORPAILLEUR - Knowledge representation, reasonning
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : Linked Open Data (LOD) constitute a unique dataset that is in a standard format, partially integrated, and facilitates connections with domain knowledge represented within semantic web ontologies. Increasing amounts of biomedical data provided as LOD consequently offer novel opportunities for knowledge discovery in biomedicine. However, most data mining methods are neither adapted to LOD format, nor adapted to consider domain knowledge. We propose in this paper an approach for selecting, integrating, and mining LOD with the goal of discovering genes responsible for a disease. Selection step relies on a set of choices made by a domain expert to isolate relevant pieces of LOD. Because these pieces are potentially not linked, an integration step is required to connect unlinked pieces. Resulting of LOD. Second, domain knowledge can be added to this input and be considered by ILP. We have implemented and applied this approach to the characterisation of genes responsible for intellectual disability. On the basis of this real world use case, we present an evaluation of our mining approach and discuss its advantages and drawbacks for the mining of biomedical LOD.
Type de document :
Communication dans un congrès
Helena Galhardas, Erhard Rahm. Data Integration in the Life Sciences - 10th International Conference, DILS 2014, Jul 2014, Lisbon, Portugal. Springer, 8574, pp.16 - 31, 2014, Lecture Notes in Computer Science. 〈10.1007/978-3-319-08590-6_2〉
Liste complète des métadonnées

Littérature citée [29 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01095591
Contributeur : Adrien Coulet <>
Soumis le : lundi 15 décembre 2014 - 20:00:18
Dernière modification le : jeudi 8 février 2018 - 16:54:03
Document(s) archivé(s) le : lundi 16 mars 2015 - 12:45:58

Fichier

personeni_et_al_dils14.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Collections

Citation

Gabin Personeni, Simon Daget, Céline Bonnet, Philippe Jonveaux, Marie-Dominique Devignes, et al.. Mining Linked Open Data: A Case Study with Genes Responsible for Intellectual Disability. Helena Galhardas, Erhard Rahm. Data Integration in the Life Sciences - 10th International Conference, DILS 2014, Jul 2014, Lisbon, Portugal. Springer, 8574, pp.16 - 31, 2014, Lecture Notes in Computer Science. 〈10.1007/978-3-319-08590-6_2〉. 〈hal-01095591〉

Partager

Métriques

Consultations de la notice

362

Téléchargements de fichiers

416