Integrative relational machine-learning for understanding drug side-effect profiles

Emmanuel Bresso 1, 2, * Renaud Grisoni 1 Gino Marchetti 1 Arnaud-Sinan Karaboga 2 Michel Souchet 2 Marie-Dominique Devignes 1 Malika Smaïl-Tabbone 1, *
* Auteur correspondant
1 ORPAILLEUR - Knowledge representation, reasonning
Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery
Abstract : Background
Drug side effects represent a common reason for stopping drug development during clinical trials. Improving our ability to understand drug side effects is necessary to reduce attrition rates during drug development as well as the risk of discovering novel side effects in available drugs. Today, most investigations deal with isolated side effects and overlook possible redundancy and their frequent co-occurrence.
Results
In this work, drug annotations are collected from SIDER and DrugBank databases. Terms describing individual side effects reported in SIDER are clustered with a semantic similarity measure into term clusters (TCs). Maximal frequent itemsets are extracted from the resulting drug x TC binary table, leading to the identification of what we call side-effect profiles (SEPs). A SEP is defined as the longest combination of TCs which are shared by a significant number of drugs. Frequent SEPs are explored on the basis of integrated drug and target descriptors using two machine learning methods: decision-trees and inductive-logic programming. Although both methods yield explicit models, inductive-logic programming method performs relational learning and is able to exploit not only drug properties but also background knowledge. Learning efficiency is evaluated by cross-validation and direct testing with new molecules. Comparison of the two machine-learning methods shows that the inductive-logic-programming method displays a greater sensitivity than decision trees and successfully exploit background knowledge such as functional annotations and pathways of drug targets, thereby producing rich and expressive rules. All models and theories are available on a dedicated web site.
Conclusions
Side effect profiles covering significant number of drugs have been extracted from a drug ×side-effect association table. Integration of background knowledge concerning both chemical and biological spaces has been combined with a relational learning method for discovering rules which explicitly characterize drug-SEP associations. These rules are successfully used for predicting SEPs associated with new drugs.
Liste complète des métadonnées

Littérature citée [30 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-00843914
Contributeur : Ed. Bmc <>
Soumis le : vendredi 12 juillet 2013 - 13:49:31
Dernière modification le : jeudi 11 janvier 2018 - 06:25:24
Document(s) archivé(s) le : lundi 14 octobre 2013 - 11:05:56

Fichiers

Identifiants

Collections

Citation

Emmanuel Bresso, Renaud Grisoni, Gino Marchetti, Arnaud-Sinan Karaboga, Michel Souchet, et al.. Integrative relational machine-learning for understanding drug side-effect profiles. BMC Bioinformatics, BioMed Central, 2013, 14 (1), pp.207. 〈10.1186/1471-2105-14-207〉. 〈hal-00843914〉

Partager

Métriques

Consultations de la notice

443

Téléchargements de fichiers

227