How to measure the topological quality of protein parse trees? - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Communication Dans Un Congrès Année : 2018

How to measure the topological quality of protein parse trees?

Résumé

Human readability and, consequently, interpretability is often considered a key advantage of grammatical descriptors. Beyond the natural language, this is also true in analyzing biological sequences of RNA, typically modeled by grammars of at least context-free level of expressiveness. However, in protein sequence analysis, the explanatory power of grammatical descriptors beyond regular has never been thoroughly assessed. Since the biological meaning of a protein molecule is directly related to its spatial structure, it is justified to expect that the parse tree of a protein sequence reflects the spatial structure of the protein. In this piece of research, we propose and assess quantitative measures for comparing topology of the parse tree of a context-free grammar with topology of the protein structure succinctly represented by a contact map. Our results are potentially interesting beyond its bioinformatic context wherever a reference matrix of dependencies between sequence constituents is available.
Fichier principal
Vignette du fichier
pyzik18.pdf (1.7 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01938608 , version 1 (28-11-2018)

Identifiants

  • HAL Id : hal-01938608 , version 1

Citer

Mateusz Pyzik, François Coste, Witold Dyrka. How to measure the topological quality of protein parse trees?. ICGI 2018 - 14th International Conference on Grammatical Inference, Sep 2018, Wroclaw, Poland. pp.118 - 138. ⟨hal-01938608⟩
62 Consultations
74 Téléchargements

Partager

Gmail Facebook X LinkedIn More