Identification of metabolic network models from incomplete high-throughput datasets

Sara Berthoumieux 1 Matteo Brilli 2, 3 Hidde Jong 1, * Daniel Kahn 3 Eugenio Cinquemani 1
* Auteur correspondant
1 IBIS - Modeling, simulation, measurement, and control of bacterial regulatory networks
LAPM - Laboratoire Adaptation et pathogénie des micro-organismes [Grenoble], Inria Grenoble - Rhône-Alpes, Institut Jean Roget
2 BAMBOO - An algorithmic view on genomes, cells, and environments
Inria Grenoble - Rhône-Alpes, LBBE - Laboratoire de Biométrie et Biologie Evolutive
Abstract : Motivation: High-throughput measurement techniques for metabolism and gene expression provide a wealth of information for the identification of metabolic network models. Yet, missing observations scattered over the dataset restrict the number of effectively available datapoints and make classical regression techniques inaccurate or inapplicable. Thorough exploitation of the data by identification techniques that explicitly cope with missing observations is therefore of major importance. Results: We develop a maximum-likelihood approach for the estimation of unknown parameters of metabolic network models that relies on the integration of statistical priors to compensate for the missing data. In the context of the linlog metabolic modeling framework, we implement the identification method by an Expectation-Maximization (EM) algorithm and by a simpler direct numerical optimization method. We evaluate performance of our methods by comparison to existing approaches, and show that our EM method provides the best results over a variety of simulated scenarios. We then apply the EM algorithm to a real problem, the identification of a model for the Escherichia coli central carbon metabolism, based on challenging experimental data from the literature. This leads to promising results and allows us to highlight critical identification issues.
Liste complète des métadonnées

https://hal.inria.fr/hal-00793039
Contributeur : Gaëlle Rivérieux <>
Soumis le : jeudi 21 février 2013 - 14:33:33
Dernière modification le : jeudi 28 juin 2018 - 14:35:59

Lien texte intégral

Identifiants

Collections

Citation

Sara Berthoumieux, Matteo Brilli, Hidde Jong, Daniel Kahn, Eugenio Cinquemani. Identification of metabolic network models from incomplete high-throughput datasets. Bioinformatics, Oxford University Press (OUP), 2011, 27, pp.i186-i195. 〈10.1093/bioinformatics/btr225〉. 〈hal-00793039〉

Partager

Métriques

Consultations de la notice

297