28560 articles – 22057 references  [version française]

hal-00685824, version 1

Optimal DALI protein structure alignment

Inken Wohlers () b1, Rumen Andonov () a2, Gunnar W. Klau () b1

N° RR-7915 (2012)

Abstract: We present a mathematical model and exact algorithm for protein structure alignment using dali scoring, which is an NP-hard problem. dali scoring is based on comparing the inter-residue distance matrices of proteins and is the scoring model of the widely used heuristic dali program. Our model and algorithm extend an integer linear programming approach previously used for the related contact map overlap problem. To this end, we introduce a novel type of constraint that handles negative structure scores and relax it in a Lagrangian fashion. We also review options that allow to consider less pairs of inter-residue distances explicitly, because their large number makes it difficult to optimize dali scoring optimally. We use our exact algorithm dalix to compute many provably score-optimal dali alignments for the first time, using four data sets of varying structural similarity. Further, using our exact dalix alignments, it is for the very first time possible to qualitatively benchmark the heuristic dali program in sound mathematical terms. The results indicate that dali often computes optimal or close to optimal alignments, but also that in cases of aligning small proteins it tends to fail generating

  • a –  Université de Rennes I
  • b –  CWI
  • 1:  Life Sciences (MAC4)
  • Centrum Wiskunde & Informatica
  • 2:  GENSCALE (INRIA - IRISA)
  • INRIA – CNRS : UMR6074 – Université de Rennes 1 – École normale supérieure de Cachan - ENS Cachan
  • Domain : Computer Science/Bioinformatics
    Life Sciences/Quantitative Methods
  • Keywords : structure alignment – inter-residue distance matrix – exact algorithm – integer linear program – Lagrangian relaxation – DALI
  • Internal note : RR-7915
  • Available versions :  v1 (2012-04-06) v2 (2012-04-27)
 
  • hal-00685824, version 1
  • oai:hal.inria.fr:hal-00685824
  • From: 
  • Submitted on: Friday, 6 April 2012 10:19:32
  • Updated on: Friday, 6 April 2012 15:54:09