Protein similarity search with subset seeds on a dedicated reconfigurable hardware

Abstract : Genome sequencing of numerous species raises the need of complete genome comparison with precise and fast similarity searches. Today, advanced seed-based techniques (spaced seeds, multiple seeds, subset seeds) provide better sensitivity/specificity ratios. We present an implementation of such a seed-based technique onto parallel specialized hardware embedding reconfigurable architecture (FPGA), where the FPGA is tightly connected to large capacity Flash memories. This parallel system allows large databases to be fully indexed and rapidly accessed. Compared to traditional approaches like the Blastp software, we obtain both significant speed-up and better results. As our knowledge, this is the first attempt to exploit modern seed features for parallelizing similarity search.
Complete list of metadatas

Cited literature [24 references]  Display  Hide  Download

https://hal.inria.fr/inria-00178325
Contributor : Dominique Lavenier <>
Submitted on : Thursday, October 11, 2007 - 12:04:54 PM
Last modification on : Thursday, February 21, 2019 - 10:52:45 AM
Long-term archiving on : Friday, April 9, 2010 - 5:01:59 PM

File

Lav07cb.pdf
Publisher files allowed on an open archive

Identifiers

  • HAL Id : inria-00178325, version 1

Citation

Pierre Peterlongo, Laurent Noé, Dominique Lavenier, Gilles Georges, Julien Jacques, et al.. Protein similarity search with subset seeds on a dedicated reconfigurable hardware. Parallel Bio-Computing, Sep 2007, Gdansk,, Poland. ⟨inria-00178325⟩

Share

Metrics

Record views

693

Files downloads

291