Counting, generating, analyzing and sampling tree alignments

Cedric Chauve; Julien Courtiel; Yann Ponty

Article Dans Une Revue International Journal of Foundations of Computer Science Année : 2018

Counting, generating, analyzing and sampling tree alignments

(1) , (2) , (3, 4, 5)

1
2
3
4
5

Cedric Chauve

Fonction : Auteur
PersonId : 846009
ORCID : 0000-0001-9837-1878

Department of Mathematics [Burnaby]

Julien Courtiel

Fonction : Auteur
PersonId : 169755
IdHAL : julien-courtiel
ORCID : 0000-0002-3441-2818
IdRef : 184309263

Laboratoire d'Informatique de Paris-Nord

Yann Ponty

Fonction : Auteur
PersonId : 3138
IdHAL : yann-ponty
ORCID : 0000-0002-7615-3930
IdRef : 113491611

Algorithms and Models for Integrative Biology

Laboratoire d'informatique de l'École polytechnique [Palaiseau]

Algorithms and Models for Integrative BIOlogy

Résumé

Pairwise ordered tree alignment are combinatorial objects that appear in important applications , such as RNA secondary structure comparison. However, the usual representation of tree alignments as supertrees is ambiguous, i.e. two distinct supertrees may induce identical sets of matches between identical pairs of trees. This ambiguity is uninformative, and detrimental to any probabilistic analysis. In this work, we consider tree alignments up to equivalence. Our first result is a precise asymptotic enumeration of tree alignments, obtained from a context-free grammar by mean of basic analytic combinatorics. Our second result focuses on alignments between two given ordered trees S and T. By refining our grammar to align specific trees, we obtain a decomposition scheme for the space of alignments, and use it to design an efficient dynamic programming algorithm for sampling alignments under the Gibbs-Boltzmann probability distribution. This generalizes existing tree alignment algorithms, and opens the door for a probabilistic analysis of the space of suboptimal alignments.

Mots clés

Tree alignments Analytic combinatorics Gibbs/Bolzmann sampling Average- case complexity analysis 1

Domaines

Algorithme et structure de données [cs.DS] Mathématique discrète [cs.DM]

Fichier principal

main.pdf (676.55 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Yann Ponty : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01500116

Soumis le : dimanche 2 avril 2017-12:50:39

Dernière modification le : mercredi 3 avril 2024-11:42:03

Archivage à long terme le : lundi 3 juillet 2017-13:20:51

Dates et versions

hal-01500116 , version 1 (02-04-2017)

Licence

Paternité - Pas d'utilisation commerciale

Identifiants

HAL Id : hal-01500116 , version 1

Citer

Cedric Chauve, Julien Courtiel, Yann Ponty. Counting, generating, analyzing and sampling tree alignments. International Journal of Foundations of Computer Science, 2018, 29 (5), pp.741--767. ⟨hal-01500116⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

X UNIV-PARIS13 CNRS INRIA LIX X-LIX X-DEP-INFO LIPN INRIA2 TDS-MACS USPC UNIV-PARIS-SACLAY GALILE SORBONNE-PARIS-NORD GS-COMPUTER-SCIENCE ACT-R

1914 Consultations

243 Téléchargements

Counting, generating, analyzing and sampling tree alignments

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Partager