Skip to Main content Skip to Navigation
Conference papers

Weighted random generation of context-free languages: Analysis of collisions in random urn occupancy models

Danièle Gardy 1 Yann Ponty 2, 3, *
* Corresponding author
2 AMIB - Algorithms and Models for Integrative Biology
LIX - Laboratoire d'informatique de l'École polytechnique [Palaiseau], LRI - Laboratoire de Recherche en Informatique, UP11 - Université Paris-Sud - Paris 11, Inria Saclay - Ile de France
Abstract : The present work analyzes the redundancy of sets of combinatorial objects produced by a weighted random generation algorithm proposed by Denise et al. This scheme associates weights to the terminals symbols of a weighted context-free grammar, extends this weight definition multiplicatively on words, and draws words of length $n$ with probability proportional their weight. We investigate the level of redundancy within a sample of $k$ word, the proportion of the total probability covered by $k$ words (coverage), the time (number of generations) of the first collision, and the time of the full collection. For these four questions, we use an analytic urn analogy to derive asymptotic estimates and/or polynomially computable exact forms. We illustrate these tools by an analysis of an RNA secondary structure statistical sampling algorithm introduced by Ding et al.
Document type :
Conference papers
Complete list of metadata

Cited literature [25 references]  Display  Hide  Download
Contributor : Yann Ponty Connect in order to contact the contributor
Submitted on : Monday, December 6, 2010 - 9:25:47 AM
Last modification on : Wednesday, October 20, 2021 - 12:24:14 AM
Long-term archiving on: : Monday, March 7, 2011 - 2:31:33 AM


Files produced by the author(s)


  • HAL Id : inria-00543150, version 1
  • ARXIV : 1012.1129



Danièle Gardy, Yann Ponty. Weighted random generation of context-free languages: Analysis of collisions in random urn occupancy models. GASCOM - 8th conference on random generation of combinatorial structures - 2010, LACIM, UQAM, Sep 2010, Montréal, Canada. 14pp. ⟨inria-00543150⟩



Les métriques sont temporairement indisponibles