Associating Gene Ontology Terms with Pfam Protein Domains

Seyed Ziaeddin Alborzi 1 Marie-Dominique Devignes 1 David Ritchie 1
1 CAPSID - Computational Algorithms for Protein Structures and Interactions
Inria Nancy - Grand Est, LORIA - AIS - Department of Complex Systems, Artificial Intelligence & Robotics
Abstract : With the growing number of three-dimensional protein structures in the protein data bank (PDB), there is a need to annotate these structures at the domain level in order to relate protein structure to protein function. Thanks to the SIFTS database, many PDB chains are now cross-referenced with Pfam domains and Gene ontology (GO) terms. However, these annotations do not include any explicit relationship between individual Pfam domains and GO terms. Therefore, creating a direct mapping between GO terms and Pfam domains will provide a new and more detailed level of protein structure annotation. This article presents a novel content-based filtering method called GODM that can automatically infer associations between GO terms and Pfam domains directly from existing GO-chain/Pfam-chain associations from the SIFTS database and GO-sequence/Pfam-sequence associations from the UniProt databases. Overall, GODM finds a total of 20,318 non-redundant GO-Pfam associations with a F-measure of 0.98 with respect to the InterPro database, which is treated here as a “Gold Standard”. These associations could be used to annotate thousands of PDB chains or protein sequences for which their domain composition is known but which currently lack any GO annotation. The GODM database is publicly available at http://godm.loria.fr/
Type de document :
Communication dans un congrès
Ignacio Rojas; Francisco Ortuño. 5th International Work-Conference on Bioinformatics and Biomedical Engineering - IWBBIO 2017, Apr 2017, Granada, Spain. Springer, Lecture Notes in Computer Science, 10209, pp.127-138, 2017, Bioinformatics and Biomedical Engineering. 〈http://iwbbio.ugr.es/〉. 〈10.1007/978-3-319-56154-7_13〉
Liste complète des métadonnées

Littérature citée [18 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01531204
Contributeur : David Ritchie <>
Soumis le : vendredi 2 juin 2017 - 13:49:58
Dernière modification le : jeudi 11 janvier 2018 - 06:27:31
Document(s) archivé(s) le : mercredi 13 décembre 2017 - 10:14:34

Fichier

godm_02_feb_2017.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Citation

Seyed Ziaeddin Alborzi, Marie-Dominique Devignes, David Ritchie. Associating Gene Ontology Terms with Pfam Protein Domains. Ignacio Rojas; Francisco Ortuño. 5th International Work-Conference on Bioinformatics and Biomedical Engineering - IWBBIO 2017, Apr 2017, Granada, Spain. Springer, Lecture Notes in Computer Science, 10209, pp.127-138, 2017, Bioinformatics and Biomedical Engineering. 〈http://iwbbio.ugr.es/〉. 〈10.1007/978-3-319-56154-7_13〉. 〈hal-01531204〉

Partager

Métriques

Consultations de la notice

236

Téléchargements de fichiers

261