inria-00178325, version 1
Protein similarity search with subset seeds on a dedicated reconfigurable hardware
Pierre Peterlongo
a, 1Laurent Noé
2Dominique Lavenier
b, 1Gilles Georges a, 1Julien Jacques a, 1Gregory Kucherov 2Mathieu Giraud
2
Parallel Bio-Computing (2007)
Abstract: Genome sequencing of numerous species raises the need of complete genome comparison with precise and fast similarity searches. Today, advanced seed-based techniques (spaced seeds, multiple seeds, subset seeds) provide better sensitivity/specificity ratios. We present an implementation of such a seed-based technique onto parallel specialized hardware embedding reconfigurable architecture (FPGA), where the FPGA is tightly connected to large capacity Flash memories. This parallel system allows large databases to be fully indexed and rapidly accessed. Compared to traditional approaches like the Blastp software, we obtain both significant speed-up and better results. As our knowledge, this is the first attempt to exploit modern seed features for parallelizing similarity search.
- a – INRIA
- b – CNRS
- 1: SYMBIOSE (INRIA - IRISA)
- CNRS : UMR6074 – INRIA – INSA Rennes – Université de Rennes 1
- 2: Laboratoire d'Informatique Fondamentale de Lille (LIFL)
- CNRS : UMR8022 – INRIA – IRCICA – Université des Sciences et Technologies de Lille - Lille I
- Domain : Computer Science/Architecture
Computer Science/Bioinformatics
Life Sciences/Quantitative Methods - Keywords : bioinformatics – subset seeds – sequence comparison – FLASH technology – FPGA
- inria-00178325, version 1
- http://hal.inria.fr/inria-00178325
- oai:hal.inria.fr:inria-00178325
- From: Dominique Lavenier
- Submitted on: Thursday, 11 October 2007 12:04:54
- Updated on: Friday, 12 October 2007 11:35:09






Associated documents
Export