Accurate prediction of the statistics of repetitions in random sequences : a case study in Archae genomes

Mireille Regnier 1, 2 Philippe Chassignet 1, 2
2 AMIB - Algorithms and Models for Integrative Biology
LIX - Laboratoire d'informatique de l'École polytechnique [Palaiseau], LRI - Laboratoire de Recherche en Informatique, UP11 - Université Paris-Sud - Paris 11, Inria Saclay - Ile de France
Abstract : Repetitive patterns in genomic sequences have a great biological significance and also algorithmic implications. This paper contributes to a precise knowledge of the length of repetitions in a random sequence. Results are easily computable and are shown accurate for common lengths of sequences. As an application, the sample case of Archae genomes illustrates how biological sequences may differ from random sequences.
Document type :
Preprints, Working Papers, ...
Complete list of metadatas

https://hal.inria.fr/hal-01253628
Contributor : Mireille Regnier <>
Submitted on : Monday, January 11, 2016 - 10:31:00 AM
Last modification on : Friday, April 26, 2019 - 2:56:45 PM

Identifiers

  • HAL Id : hal-01253628, version 1

Citation

Mireille Regnier, Philippe Chassignet. Accurate prediction of the statistics of repetitions in random sequences : a case study in Archae genomes. 2015. ⟨hal-01253628⟩

Share

Metrics

Record views

420