Learning a Markov Logic network for supervised gene regulatory network inference

Céline Brouard 1, * Christel Vrain 2 Julie Dubois 1, 2 David Castel 3 Marie-Anne Debily 3, 4 Florence D'Alché-Buc 1, 5, *
* Auteur correspondant
5 AMIB - Algorithms and Models for Integrative Biology
CNRS - Centre National de la Recherche Scientifique : UMR8623, X - École polytechnique, Inria Saclay - Ile de France, UP11 - Université Paris-Sud - Paris 11, LRI - Laboratoire de Recherche en Informatique, LIX - Laboratoire d'informatique de l'École polytechnique [Palaiseau]
Abstract : Background
Gene regulatory network inference remains a challenging problem in systems biology despite the numerous approaches that have been proposed. When substantial knowledge on a gene regulatory network is already available, supervised network inference is appropriate. Such a method builds a binary classifier able to assign a class (Regulation/No regulation) to an ordered pair of genes. Once learnt, the pairwise classifier can be used to predict new regulations. In this work, we explore the framework of Markov Logic Networks (MLN) that combine features of probabilistic graphical models with the expressivity of first-order logic rules.
Results
We propose to learn a Markov Logic network, e.g. a set of weighted rules that conclude on the predicate "regulates", starting from a known gene regulatory network involved in the switch proliferation/differentiation of keratinocyte cells, a set of experimental transcriptomic data and various descriptions of genes all encoded into first-order logic. As training data are unbalanced, we use asymmetric bagging to learn a set of MLNs. The prediction of a new regulation can then be obtained by averaging predictions of individual MLNs. As a side contribution, we propose three in silico tests to assess the performance of any pairwise classifier in various network inference tasks on real datasets. A first test consists of measuring the average performance on balanced edge prediction problem; a second one deals with the ability of the classifier, once enhanced by asymmetric bagging, to update a given network. Finally our main result concerns a third test that measures the ability of the method to predict regulations with a new set of genes. As expected, MLN, when provided with only numerical discretized gene expression data, does not perform as well as a pairwise SVM in terms of AUPR. However, when a more complete description of gene properties is provided by heterogeneous sources, MLN achieves the same performance as a black-box model such as a pairwise SVM while providing relevant insights on the predictions.
Conclusions
The numerical studies show that MLN achieves very good predictive performance while opening the door to some interpretability of the decisions. Besides the ability to suggest new regulations, such an approach allows to cross-validate experimental data with existing knowledge.
Liste complète des métadonnées

https://hal.inria.fr/hal-00868767
Contributeur : Ed. Bmc <>
Soumis le : mardi 1 octobre 2013 - 22:00:24
Dernière modification le : jeudi 12 avril 2018 - 01:42:57
Document(s) archivé(s) le : lundi 6 janvier 2014 - 09:41:15

Identifiants

Citation

Céline Brouard, Christel Vrain, Julie Dubois, David Castel, Marie-Anne Debily, et al.. Learning a Markov Logic network for supervised gene regulatory network inference. BMC Bioinformatics, BioMed Central, 2013, 14 (1), pp.273. 〈10.1186/1471-2105-14-273〉. 〈hal-00868767〉

Partager

Métriques

Consultations de la notice

472

Téléchargements de fichiers

401