Using Structural and Evolutionary Information to Detect and Correct Pyrosequencing Errors in Noncoding RNAs.

Vladimir Reinharz 1 Yann Ponty 2, 3, * Jérôme Waldispühl 4, *
* Corresponding author
3 AMIB - Algorithms and Models for Integrative Biology
LIX - Laboratoire d'informatique de l'École polytechnique [Palaiseau], LRI - Laboratoire de Recherche en Informatique, UP11 - Université Paris-Sud - Paris 11, Inria Saclay - Ile de France
Abstract : The analysis of the sequence-structure relationship in RNA molecules is not only essential for evolutionary studies but also for concrete applications such as error-correction in next generation sequencing (NGS) technologies. The prohibitive sizes of the mutational and conformational landscapes, combined with the volume of data to process, require efficient algorithms to compute sequence-structure properties. In this article, we address the correction of NGS errors by calculating which mutations most increase the likelihood of a sequence to a given structure and RNA family. We introduce RNApyro, an efficient, linear time and space inside-outside algorithm that computes exact mutational probabilities under secondary structure and evolutionary constraints given as a multiple sequence alignment with a consensus structure. We develop a scoring scheme combining classical stacking base-pair energies to novel isostericity scores and apply our techniques to correct pointwise errors in 5s and 16s rRNA sequences. Our results suggest that RNApyro is a promising algorithm to complement existing tools in the NGS error-correction pipeline.
Liste complète des métadonnées

Cited literature [21 references]  Display  Hide  Download
Contributor : Yann Ponty <>
Submitted on : Thursday, May 30, 2013 - 11:14:17 AM
Last modification on : Wednesday, March 27, 2019 - 4:41:29 PM
Document(s) archivé(s) le : Tuesday, April 4, 2017 - 1:42:51 PM


Files produced by the author(s)




Vladimir Reinharz, Yann Ponty, Jérôme Waldispühl. Using Structural and Evolutionary Information to Detect and Correct Pyrosequencing Errors in Noncoding RNAs.. Journal of Computational Biology, Mary Ann Liebert, 2013, 20 (11), pp.905-19. ⟨10.1089/cmb.2013.0085⟩. ⟨hal-00828062⟩



Record views


Files downloads