Manycore high-performance computing in bioinformatics

Mining the increasing amount of genomic data requires having very efficient tools. Increasing the efficiency can be obtained with better algorithms, but one could also take advantage of the hardware itself to reduce the application runtimes. Since a few years, issues with heat dissipation prevent the processors from having higher frequencies. One of the answers to maintain Moore's Law is parallel processing. Grid environments provide tools for effective implementation of coarse grain parallelization. Recently, another kind of hardware has attracted interest: multicore processors. Graphic processing units (GPUs) are a first step towards massively multicore processors. They allow everyone to have some teraflops of cheap computing power in its personal computer. The CUDA library (released in 2007) and the new standard OpenCL (specified in 2008) make programming of such devices very convenient. OpenCL is likely to gain a wide industrial support and to become a standard of choice for parallel programming. In all cases, the best speedups are obtained when combining precise algorithmic studies with a knowledge of the computing architectures. This is especially true with the memory hierarchy: the algorithms have to find a good balance between using large (and slow) global memories and some fast (but small) local memories. In this chapter, we will show how those manycore devices enable more efficient bioinformatics applications. We will first give some insights into architectures and parallelism. Then we will describe recent implementations specifically designed for manycore architectures, including algorithms on sequence alignment and RNA structure prediction. We will conclude with some thoughts about the dissemination of those algorithms and implementations: are they today available on the bookshelf for everyone?

Keywords

bioinfomatics manycore processors GPU parallelism

Domains

Bioinformatics [q-bio.QM] Quantitative Methods [q-bio.QM] Distributed, Parallel, and Cluster Computing [cs.DC]

Fichier principal

varre-manycore-bioinformatics.pdf (593.81 Ko)

Origin : Files produced by the author(s)

Mathieu Giraud : Connect in order to contact the contributor

https://hal.science/hal-00563408

Submitted on : Friday, February 4, 2011-6:11:55 PM

Last modification on : Friday, March 24, 2023-2:52:54 PM

Long-term archiving on: Tuesday, November 6, 2012-1:30:43 PM

Dates and versions

hal-00563408 , version 1 (04-02-2011)

Identifiers

HAL Id : hal-00563408 , version 1

Cite

Jean-Stéphane Varré, Bertil Schmidt, Stéphane Janot, Mathieu Giraud. Manycore high-performance computing in bioinformatics. Laura Elnitski, Helen Piontkivska, Lonnie R Welch. Advances in Genomic Sequence Analysis and Pattern Discovery, World Scientific, chapter 8, 2011. ⟨hal-00563408⟩

Export

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-LILLE3 CNRS INRIA LIFL CRISTAL INRIA2 CRISTAL-BONSAI

406 View

553 Download