inria-00070629, version 1
A unifying framework for seed sensitivity and its application to subset seeds
Gregory Kucherov 1Laurent Noé
1Mikhail Roytberg
N° RR-5374 (2004)
Abstract: We propose a general approach to compute the seed sensitivity, that can be applied to different definitions of seeds. It treats separately three components of the seed sensitivity problem -- a set of target alignments, an associated probability distribution, and a seed model -- that are specified by distinct finite automata. The approach is then applied to a new concept of subset seeds for which we propose an efficient automaton construction. Experimental results confirm that sensitive subset seeds can be efficiently designed using our approach, and can then be used in similarity search producing better results than ordinary spaced seeds.
- 1: ADAGE (INRIA Lorraine - LORIA)
- INRIA – CNRS : UMR7503 – Université Henri Poincaré - Nancy I – Université Nancy II – Institut National Polytechnique de Lorraine (INPL)
- Domain : Computer Science/Other
- Keywords : local alignment – DNA – spaced seeds – subset seeds – transition-constrained seeds – finite automata – seed sensitivity – dynamic programming
- Internal note : RR-5374
- inria-00070629, version 1
- http://hal.inria.fr/inria-00070629
- oai:hal.inria.fr:inria-00070629
- From: Rapport De Recherche Inria
- Submitted on: Friday, 19 May 2006 21:03:08
- Updated on: Friday, 23 June 2006 09:42:38






Associated documents

Export