Rare Events and Conditional Events on Random Strings - Archive ouverte HAL Access content directly
Journal Articles Discrete Mathematics and Theoretical Computer Science Year : 2004

Rare Events and Conditional Events on Random Strings

(1) , (2)
1
2
Mireille Régnier
  • Function : Author
Alain Denise

Abstract

Some strings -the texts- are assumed to be randomly generated, according to a probability model that is either a Bernoulli model or a Markov model. A rare event is the over or under-representation of a word or a set of words. The aim of this paper is twofold. First, a single word is given. One studies the tail distribution of the number of its occurrences. Sharp large deviation estimates are derived. Second, one assumes that a given word is overrepresented. The distribution of a second word is studied; formulae for the expectation and the variance are derived. In both cases, the formulae are accurate and actually computable. These results have applications in computational biology, where a genome is viewed as a text.
Fichier principal
Vignette du fichier
dm060203.pdf (173.33 Ko) Télécharger le fichier
Origin : Files produced by the author(s)

Dates and versions

hal-00959004 , version 1 (13-03-2014)

Identifiers

Cite

Mireille Régnier, Alain Denise. Rare Events and Conditional Events on Random Strings. Discrete Mathematics and Theoretical Computer Science, 2004, Vol. 6 no. 2 (2), pp.191-214. ⟨10.46298/dmtcs.310⟩. ⟨hal-00959004⟩
93 View
644 Download

Altmetric

Share

Gmail Facebook Twitter LinkedIn More