Rare Events and Conditional Events on Random Strings - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Article Dans Une Revue Discrete Mathematics and Theoretical Computer Science Année : 2004

Rare Events and Conditional Events on Random Strings

Mireille Régnier
  • Fonction : Auteur
Alain Denise

Résumé

Some strings -the texts- are assumed to be randomly generated, according to a probability model that is either a Bernoulli model or a Markov model. A rare event is the over or under-representation of a word or a set of words. The aim of this paper is twofold. First, a single word is given. One studies the tail distribution of the number of its occurrences. Sharp large deviation estimates are derived. Second, one assumes that a given word is overrepresented. The distribution of a second word is studied; formulae for the expectation and the variance are derived. In both cases, the formulae are accurate and actually computable. These results have applications in computational biology, where a genome is viewed as a text.
Fichier principal
Vignette du fichier
dm060203.pdf (173.33 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-00959004 , version 1 (13-03-2014)

Identifiants

Citer

Mireille Régnier, Alain Denise. Rare Events and Conditional Events on Random Strings. Discrete Mathematics and Theoretical Computer Science, 2004, Vol. 6 no. 2 (2), pp.191-214. ⟨10.46298/dmtcs.310⟩. ⟨hal-00959004⟩
97 Consultations
802 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More