Back-translation for discovering distant protein homologies

Marta Gîrdea 1, 2 Laurent Noé 1, 2, * Gregory Kucherov 1, 2
* Auteur correspondant
2 SEQUOIA - Sequential Learning
LIFL - Laboratoire d'Informatique Fondamentale de Lille, Inria Lille - Nord Europe
Abstract : Frameshift mutations in protein-coding DNA sequences produce a drastic change in the resulting protein sequence, which prevents classic protein alignment methods from revealing the proteins' common origin. Moreover, when a large number of substitutions are additionally involved in the divergence, the homology detection becomes difficult even at the DNA level. To cope with this situation, we propose a novel method to infer distant homology relations of two proteins, that accounts for frameshift and point mutations that may have affected the coding sequences. We design a dynamic programming alignment algorithm over memory-efficient graph representations of the complete set of putative DNA sequences of each protein, with the goal of determining the two putative DNA sequences which have the best scoring alignment under a powerful scoring system designed to reflect the most probable evolutionary process. This allows us to uncover evolutionary information that is not captured by traditional alignment methods, which is confirmed by biologically significant examples.
Type de document :
Communication dans un congrès
the 9th International Workshop in Algorithms in Bioinformatics (WABI), Sep 2009, Philadelphia, United States. 5724, pp.108-120, 2009, Lecture Notes in Computer Science. 〈http://www.springerlink.com/content/3236004m84465n7j/〉. 〈10.1007/978-3-642-04241-6_10〉
Liste complète des métadonnées

Littérature citée [23 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00448741
Contributeur : Laurent Noé <>
Soumis le : lundi 25 janvier 2010 - 21:43:54
Dernière modification le : mardi 6 mars 2018 - 17:40:54
Document(s) archivé(s) le : jeudi 17 juin 2010 - 23:04:03

Fichiers

patharticle.pdf
Fichiers éditeurs autorisés sur une archive ouverte

Identifiants

Citation

Marta Gîrdea, Laurent Noé, Gregory Kucherov. Back-translation for discovering distant protein homologies. the 9th International Workshop in Algorithms in Bioinformatics (WABI), Sep 2009, Philadelphia, United States. 5724, pp.108-120, 2009, Lecture Notes in Computer Science. 〈http://www.springerlink.com/content/3236004m84465n7j/〉. 〈10.1007/978-3-642-04241-6_10〉. 〈inria-00448741〉

Partager

Métriques

Consultations de la notice

407

Téléchargements de fichiers

155