28605 articles – 22093 Notices  [english version]

inria-00536624, version 1

Maximum Contact Map Overlap Revisited

Rumen Andonov (Auteur à contacter de préférence) a1, Noël Malod-Dognin (), Nicola Yanev () 2

Journal of Computational Biology 18, 1 (2011) 1-15

Résumé : Among the measures for quantifying the similarity between three-dimensional (3D) protein structures, maximum contact map overlap (CMO) received sustained attention during the past decade. Despite this, the known algorithms exhibit modest performance and are not applicable for large-scale comparison. This article offers a clear advance in this respect. We present a new integer programming model for CMO and propose an exact branch-andbound algorithm with bounds obtained by a novel Lagrangian relaxation. The efficiency of the approach is demonstrated on a popular small benchmark (Skolnick set, 40 domains). On this set, our algorithm significantly outperforms the best existing exact algorithms. Many hard CMO instances have been solved for the first time. To further assess our approach, we constructed a large-scale set of 300 protein domains. Computing the similarity measure for any of the 44850 pairs, we obtained a classification in excellent agreement with SCOP. Supplementary Material is available at www.liebertonline.com/cmb.

  • a –  Université de Rennes I
  • 1 :  SYMBIOSE (INRIA - IRISA)
  • CNRS : UMR6074 – INRIA – Institut National des Sciences Appliquées (INSA) - Rennes – Université de Rennes 1
  • 2 :  University of Sofia
  • Bulgarian Academy of Sciences
  • Domaine : Informatique/Bio-informatique
    Sciences du Vivant/Bio-Informatique, Biologie Systémique
    Informatique/Recherche opérationnelle
  • Mots-clés : branch-and-bound – combinatorial optimization – contact map overlap – integer programming – Lagrangian relaxation – protein structure alignment
 
  • inria-00536624, version 1
  • oai:hal.inria.fr:inria-00536624
  • Contributeur : 
  • Soumis le : Mardi 16 Novembre 2010, 15:44:16
  • Dernière modification le : Vendredi 19 Novembre 2010, 10:26:08