Skip to Main content Skip to Navigation
Conference papers

Stochastic Sampling of Structural Contexts Improves the Scalability and Accuracy of RNA 3D Modules Identification

Abstract : RNA structures possess multiple levels of structural organization. Secondary structures are made of canonical (i.e. Watson-Crick and Wobble) helices, connected by loops whose local conformations are critical determinants of global 3D architectures. Such local 3D structures consist of conserved sets of non-canonical base pairs, called RNA modules. Their prediction from sequence data is thus a milestone toward 3D structure modelling. Unfortunately, the computational efficiency and scope of the current 3D module identification methods are too limited yet to benefit from all the knowledge accumulated in modules databases. Here, we introduce BayesPairing 2, a new sequence search algorithm leveraging secondary structure tree decomposition which allows to reduce the computational complexity and improve predictions on new sequences. We benchmarked our methods on 75 modules and 6360 RNA sequences, and report accuracies that are comparable to the state of the art, with considerable running time improvements. When identifying 200 modules on a single sequence, BayesPairing 2 is over 100 times faster than its previous version, opening new doors for genome-wide applications.
Document type :
Conference papers
Complete list of metadatas

Cited literature [39 references]  Display  Hide  Download

https://hal.inria.fr/hal-02354733
Contributor : Yann Ponty <>
Submitted on : Thursday, November 7, 2019 - 9:30:01 PM
Last modification on : Thursday, July 30, 2020 - 11:12:36 AM
Document(s) archivé(s) le : Sunday, February 9, 2020 - 12:15:24 AM

File

BayesPairing2_recomb_submitted...
Files produced by the author(s)

Identifiers

  • HAL Id : hal-02354733, version 1

Citation

Roman Sarrazin-Gendron, Hua-Ting Yao, Vladimir Reinharz, Carlos Oliver, Yann Ponty, et al.. Stochastic Sampling of Structural Contexts Improves the Scalability and Accuracy of RNA 3D Modules Identification. RECOMB 2020 - 24th Annual International Conference on Research in Computational Molecular Biology, May 2020, Padova, Italy. ⟨hal-02354733⟩

Share

Metrics

Record views

93

Files downloads

696