Analysis of pattern overlaps and exact computation of P-values of pattern occurrences numbers: case of Hidden Markov Models - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Article Dans Une Revue Algorithms for Molecular Biology Année : 2014

Analysis of pattern overlaps and exact computation of P-values of pattern occurrences numbers: case of Hidden Markov Models

Résumé

We present a novel algorithm, SufPref, computing an exact pvalue for Hidden Markov models (HMM). The algorithm inductively traverses specific data structure, the overlap graph. Nodes of the graph are associated with the overlaps of words from a given set H. Edges are associated to the prefix and suffix relations between ovelaps. An originality of our data structure is that pattern H need not be explicitly represented in nodes or leaves. The algorithm relies on the Cartesian product of the overlap graph and the graph of HMM states; the approach is analogous to a weighted automaton approach. The gain in size of SufPref data structure leads to significant space and time complexity improvements. We suppose that all words in the pattern H are of the same length m. The algorithm SufPref was implemented as a C++ program; it can be used both as Web-server and a stand alone program for Linux and Windows.
Cet article présente un nouvel algorithme, SufPref, qui calcule la pvaleur pour un ensemble de mots qui prend en compte les modèles de Markov cachés (HMM). Il est implémenté en C++ et peut être utilisé en ligne ou téléchargé.
Fichier principal
Vignette du fichier
jwtemplate.pdf (328.03 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00858701 , version 1 (05-09-2013)

Identifiants

Citer

Mireille Régnier, Evgenia Furletova, Victor Yakovlev, Mikhail Roytberg. Analysis of pattern overlaps and exact computation of P-values of pattern occurrences numbers: case of Hidden Markov Models. Algorithms for Molecular Biology, 2014, 9 (1), ⟨10.1186/s13015-014-0025-1⟩. ⟨hal-00858701⟩
425 Consultations
250 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More