inria-00178325, version 1
Protein similarity search with subset seeds on a dedicated reconfigurable hardware
Pierre Peterlongo
a, 1Laurent Noé
2Dominique Lavenier
b, 1Gilles Georges a, 1Julien Jacques a, 1Gregory Kucherov 2Mathieu Giraud
2
Parallel Bio-Computing (2007)
Résumé : Genome sequencing of numerous species raises the need of complete genome comparison with precise and fast similarity searches. Today, advanced seed-based techniques (spaced seeds, multiple seeds, subset seeds) provide better sensitivity/specificity ratios. We present an implementation of such a seed-based technique onto parallel specialized hardware embedding reconfigurable architecture (FPGA), where the FPGA is tightly connected to large capacity Flash memories. This parallel system allows large databases to be fully indexed and rapidly accessed. Compared to traditional approaches like the Blastp software, we obtain both significant speed-up and better results. As our knowledge, this is the first attempt to exploit modern seed features for parallelizing similarity search.
- a – INRIA
- b – CNRS
- 1 : SYMBIOSE (INRIA - IRISA)
- CNRS : UMR6074 – INRIA – INSA Rennes – Université de Rennes 1
- 2 : Laboratoire d'Informatique Fondamentale de Lille (LIFL)
- CNRS : UMR8022 – INRIA – IRCICA – Université Lille 1 - Sciences et Technologies
- Domaine : Informatique/Architecture
Informatique/Bio-informatique
Sciences du Vivant/Bio-Informatique, Biologie Systémique - Mots-clés : bioinformatics – subset seeds – sequence comparison – FLASH technology – FPGA
- inria-00178325, version 1
- http://hal.inria.fr/inria-00178325
- oai:hal.inria.fr:inria-00178325
- Contributeur : Dominique Lavenier
- Soumis le : Jeudi 11 Octobre 2007, 12:04:54
- Dernière modification le : Vendredi 12 Octobre 2007, 11:35:09






Documents associés
Exporter