inria-00179719, version 1
Indexing gapped-factors using a tree
Pierre Peterlongo
a, 1Julien Allali
b, 2Marie-France Sagot
a, 3
International Journal of Foundation of Computer Science (2007)
Résumé : We present a data structure to index a specific kind of factors, that is of substrings, called gapped-factors. A gapped-factor is a factor containing a gap that is ignored during the indexation. The data structure presented is based on the suffix tree and indexes all the gapped-factors of a text with a fixed size of gap, and only those. The construction of this data structure is done online in linear time and space. Such a data structure may play an important role in various pattern matching and motif inference problems, for instance in text filtration.
- a – INRIA
- b – CNRS
- 1 : SYMBIOSE (INRIA - IRISA)
- CNRS : UMR6074 – INRIA – INSA Rennes – Université de Rennes 1
- 2 : Laboratoire Bordelais de Recherche en Informatique (LaBRI)
- CNRS : UMR5800 – Université Sciences et Technologies - Bordeaux I – École Nationale Supérieure d'Électronique, Informatique et Radiocommunications de Bordeaux (ENSEIRB) – Université Victor Segalen - Bordeaux II
- 3 : HELIX (INRIA Rhône-Alpes)
- INRIA – CNRS : UMR5558 – Université Claude Bernard - Lyon I
- Domaine : Informatique/Bio-informatique
Sciences du Vivant/Bio-Informatique, Biologie Systémique - Mots-clés : suffix tree – k-factor tree – string index – gapped-factor – gapped-factor tree
- inria-00179719, version 1
- http://hal.inria.fr/inria-00179719
- oai:hal.inria.fr:inria-00179719
- Contributeur : Pierre Peterlongo
- Soumis le : Mercredi 24 Octobre 2007, 12:19:38
- Dernière modification le : Mercredi 24 Octobre 2007, 14:18:48






Documents associés
Exporter