Local Decoding of Sequences and Alignment-Free Comparison - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Article Dans Une Revue Journal of Computational Biology Année : 2006

Local Decoding of Sequences and Alignment-Free Comparison

Résumé

Subword composition plays an important role in a lot of analyses of sequences. Here we define and study the "local decoding of order N of sequences," an alternative that avoids some drawbacks of "subwords of length N" approaches while keeping informations about environments of length N in the sequences ("decoding" is taken here in the sense of hidden Markov modeling, i.e., associating some state to all positions of the sequence). We present an algorithm for computing the local decoding of order N of a given set of sequences. Its complexity is linear in the total length of the set (whatever the order N) both in time and memory space. In order to show a use of local decoding, we propose a very basic dissimilarity measure between sequences which can be computed both from local decoding of order N and composition in subwords of length N. The accuracies of these two dissimilarities are evaluated, over several datasets, by computing their linear correlations with a reference alignment-based distance. These accuracies are also compared to the one obtained from another recent alignment-free comparison.
Fichier non déposé

Dates et versions

inria-00289089 , version 1 (19-06-2008)

Identifiants

Citer

Gilles Didier, Ivan Laprevotte, Maude Pupin, Alain Hénaut. Local Decoding of Sequences and Alignment-Free Comparison. Journal of Computational Biology, 2006, 13 (8), pp.1465-1476. ⟨10.1089/cmb.2006.13.1465⟩. ⟨inria-00289089⟩
106 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More