Skip to Main content Skip to Navigation
Conference papers

Strain-Aware Assembly of Genomes from Mixed Samples Using Flow Variation Graphs

Abstract : The goal of haplotype-aware genome assembly is to reconstruct all individual haplotypes from a mixed sample and to provide corresponding abundance estimates. We provide a reference-genome-independent solution based on the construction of a variation graph, capturing all diversity present in the sample. We solve the contig abundance estimation problem and propose a greedy algorithm to efficiently build maximal-length haplotypes. Finally, we obtain accurate frequency estimates for the reconstructed haplotypes through linear programming techniques. Our method outperforms state-of-the-art approaches on viral quasispecies benchmarks and has the potential to assemble bacterial genomes in a strain-aware manner as well.
Document type :
Conference papers
Complete list of metadata

Cited literature [44 references]  Display  Hide  Download
Contributor : Marie-France Sagot Connect in order to contact the contributor
Submitted on : Friday, October 2, 2020 - 10:44:19 AM
Last modification on : Monday, December 7, 2020 - 2:14:36 PM
Long-term archiving on: : Monday, January 4, 2021 - 8:53:24 AM


Files produced by the author(s)




Jasmijn Baaijens, Leen Stougie, Alexander Schönhuth. Strain-Aware Assembly of Genomes from Mixed Samples Using Flow Variation Graphs. RECOMB 2020 - 24th International Conference on Research in Computational Molecular Biology, May 2020, Padova (Virtual), Italy. pp.221-222, ⟨10.1007/978-3-030-45257-5_14⟩. ⟨hal-02955692⟩



Record views


Files downloads