Amplitude spectrum distance: measuring the global shape divergence of protein fragments

Clovis Galiez 1 François Coste 1
1 Dyliss - Dynamics, Logics and Inference for biological Systems and Sequences
Abstract : Background: In structural bioinformatics, there is an increasing interest in identifying and understanding the evolution of local protein structures regarded as key structural or functional protein building blocks. A central need is then to compare these, possibly short, fragments by measuring efficiently and accurately their (dis)similarity. Progress towards this goal has given rise to scores enabling to assess the strong similarity of fragments. Yet, there is still a lack of more progressive scores, with meaningful intermediate values, for the comparison, retrieval or clustering of distantly related fragments. Results: We introduce here the Amplitude Spectrum Distance (ASD), a novel way of comparing protein fragments based on the discrete Fourier transform of their C α distance matrix. Defined as the distance between their amplitude spectra, ASD can be computed efficiently and provides a parameter-free measure of the global shape dissimilarity of two fragments. ASD inherits from nice theoretical properties, making it tolerant to shifts, insertions, deletions, circular permutations or sequence reversals while satisfying the triangle inequality. The practical interest of ASD with respect to RMSD, RMSDd , BC and TM scores is illustrated through zinc finger retrieval experiments and concrete structure examples. The benefits of ASD are also illustrated by two additional clustering experiments: domain linkers fragments and complementarity-determining regions of antibodies. Conclusions: Taking advantage of the Fourier transform to compare fragments at a global shape level, ASD is an objective and progressive measure taking into account the whole fragments. Its practical computation time and its properties make ASD particularly relevant for applications requiring meaningful measures on distantly related protein fragments, such as similar fragments retrieval asking for high recalls as shown in the experiments, or for any application taking also advantage of triangle inequality, such as fragments clustering. ASD program and source code are freely available at:
Type de document :
Article dans une revue
BMC Bioinformatics, BioMed Central, 2015, 16 (1), pp.16. 〈10.1186/s12859-015-0693-y〉
Liste complète des métadonnées

Littérature citée [16 références]  Voir  Masquer  Télécharger
Contributeur : François Coste <>
Soumis le : lundi 12 octobre 2015 - 14:08:26
Dernière modification le : mercredi 16 mai 2018 - 11:23:35
Document(s) archivé(s) le : jeudi 27 avril 2017 - 00:02:12


Publication financée par une institution



Clovis Galiez, François Coste. Amplitude spectrum distance: measuring the global shape divergence of protein fragments. BMC Bioinformatics, BioMed Central, 2015, 16 (1), pp.16. 〈10.1186/s12859-015-0693-y〉. 〈hal-01214482〉



Consultations de la notice


Téléchargements de fichiers