Suggesting valid pharmacogenes by mining linked data

Kevin Dalleau 1 Ndeye Coumba Ndiaye 2 Adrien Coulet 1, *
* Auteur correspondant
1 ORPAILLEUR - Knowledge representation, reasonning
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : A standard task in pharmacogenomics research is identifying genes that may be involved in drug response variability, i.e., pharmacogenes. Because genomic experiments tended to generate many false positives, computational approaches based on the use of background knowledge have been proposed. Until now, those have used only molecular networks or the biomedical literature. Here we propose a novel method that consumes an eclectic set of linked data sources to help validating uncertain drug–gene relationships. One of the advantages relies on that linked data are implemented in a standard framework that facilitates the joint use of various sources, making easy the consideration of features of various origins. Consequently, we propose an initial selection of linked data sources relevant to pharmacogenomics. We formatted these data to train a random forest algorithm , producing a model that enables classifying drug–gene pairs as related or not, thus confirming the validity of candidate pharmacogenes. Our model achieve the performance of F-measure=0.92, on a 100 folds cross-validation. A list of top candidates is provided and their obtention is discussed.
Type de document :
Communication dans un congrès
Semantic Web Applications and Tools for Life Sciences (SWAT4LS) 2015, Dec 2015, Cambridge, United Kingdom. 2015, Proceedings of the Semantic Web Applications and Tools for Life Sciences (SWAT4LS) 2015
Liste complète des métadonnées

Littérature citée [36 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01239568
Contributeur : Adrien Coulet <>
Soumis le : mardi 8 décembre 2015 - 17:54:29
Dernière modification le : lundi 23 avril 2018 - 15:07:42
Document(s) archivé(s) le : samedi 29 avril 2017 - 09:21:19

Fichier

SWAT4LS_2015_paper_5.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01239568, version 1

Citation

Kevin Dalleau, Ndeye Coumba Ndiaye, Adrien Coulet. Suggesting valid pharmacogenes by mining linked data. Semantic Web Applications and Tools for Life Sciences (SWAT4LS) 2015, Dec 2015, Cambridge, United Kingdom. 2015, Proceedings of the Semantic Web Applications and Tools for Life Sciences (SWAT4LS) 2015. 〈hal-01239568〉

Partager

Métriques

Consultations de la notice

400

Téléchargements de fichiers

171