High-Performance Haplotype Assembly

Abstract : The problem of Haplotype Assembly is an essential step in human genome analysis. It is typically formalised as the Minimum Error Correction (MEC) problem which is NP-hard. MEC has been approached using heuristics, integer linear programming, and fixed-parameter tractability (FPT), including approaches whose runtime is exponential in the length of the DNA fragments obtained by the sequencing process. Technological improvements are currently increasing fragment length, which drastically elevates computational costs for such methods. We present pWhatsHap, a multi-core parallelisation of WhatsHap, a recent FPT optimal approach to MEC. WhatsHap moves complexity from fragment length to fragment overlap and is hence of particular interest when considering sequencing technology's current trends. pWhat-sHap further improves the efficiency in solving the MEC problem, as shown by experiments performed on datasets with high coverage.
Document type :
Conference papers
Liste complète des métadonnées

Cited literature [20 references]  Display  Hide  Download

https://hal.inria.fr/hal-01526652
Contributor : Marie-France Sagot <>
Submitted on : Tuesday, May 23, 2017 - 12:33:46 PM
Last modification on : Thursday, March 21, 2019 - 2:51:22 PM
Document(s) archivé(s) le : Friday, August 25, 2017 - 12:34:19 AM

File

CIBB15.pdf
Files produced by the author(s)

Identifiers

Collections

Citation

Marco Aldinucci, Andrea Bracciali, Tobias Marschall, Murray Patterson, Nadia Pisanti, et al.. High-Performance Haplotype Assembly. Computational Intelligence Methods for Bioinformatics and Biostatistics (CIBB'15), 2015, Cambridge, United Kingdom. pp.281 - 258, ⟨10.1016/j.compbiolchem.2005.05.001⟩. ⟨hal-01526652⟩

Share

Metrics

Record views

143

Files downloads

124