inria-00628038, version 1
Using Kendall-Tau Meta-Bagging to Improve Protein-Protein Docking Predictions
PRIB 2011 (2011) 284-295
Abstract: Predicting the three-dimensional (3D) structures of macromolecular protein-protein complexes from the structures of individual partners (docking), is a major challenge for computational biology. Most docking algorithms use two largely independent stages. First, a fast sampling stage generates a large number (millions or even billions) of candidate conformations, then a scoring stage evaluates these conformations and extracts a small ensemble amongst which a good solution is assumed to exist. Several strategies have been proposed for this stage. However, correctly distinguishing and discarding false positives from the native biological interfaces remains a difficult task. Here, we introduce a new scoring algorithm based on learnt bootstrap aggregation ("bagging") models of protein shape complementarity. 3D Voronoi diagrams are used to describe and encode the surface shapes and physico-chemical properties of proteins. A bagging method based on Kendall-τ distances is then used to minimise the pairwise disagreements between the ranks of the elements obtained from several different bagging approaches. We apply this method to the protein docking problem using 51 protein complexes from the standard Protein Docking Benchmark. Overall, our approach improves in the ranks of near-native conformation and results in more biologically relevant predictions.
- 1:
- CNRS : UMR8623 – Université Paris XI - Paris Sud
- 2:
- INRIA – Polytechnique - X – CNRS : UMR8623 – Université Paris XI - Paris Sud
- 3:
- INRIA – CNRS : UMR7503 – Université Henri Poincaré - Nancy I – Université Nancy II – Institut National Polytechnique de Lorraine (INPL)
- 4:
- Université de Montréal
- 5:
- CNRS : UMR6175 – Institut national de la recherche agronomique (INRA) : UR0085 – Université François Rabelais - Tours
- Domain : Computer Science/Learning
Life Sciences/Quantitative Methods
Computer Science/Bioinformatics - Keywords : Bagging – Docking – Machine Learning – Kendall-Tau distance
- inria-00628038, version 1
- http://hal.inria.fr/inria-00628038
- oai:hal.inria.fr:inria-00628038
- From:
- Submitted on: Friday, 30 September 2011 11:35:27
- Updated on: Friday, 30 September 2011 11:35:27




Export