Evaluation of genome assembly software based on long reads

Abstract : During the last 30 years, Genomics has been revolutionized by the development of first- and second-generation sequencing (SGS) technologies, enabling the completion of many remarkable projects as the Human Genome Project, the 1000 Genomes Project and the Human Microbiome Project. In the last decade, SGS technologies based on massive parallel sequencing have dominated the market, thanks to their ability to produce enormous volumes of data cheaply. However, often genes and regions of interest are not completely or accurately assembled, complicating analyses or requiring additional cloning efforts for obtaining the correct sequences. The fundamental obstacle in SGS technologies for obtaining high quality genome assembly is the existence of repetitions in the sequences. A promising solution to this issue is the advent of Third-generation sequencing (TGS) technologies based on long read sequencing. TGS technologies have been used to produce highly accurate de novo assemblies of hundreds of microbial genomes and highly contiguous reconstructions of many dozens of plant and animal genomes, enabling new insights into evolution and sequence diversity. They have also been applied to resequencing analyses, to create detailed maps of structural variations in many species. Also, these new technologies have been used to fill in many of the gaps in the human reference genome. In this report, we compare and evaluate several genome assembly software based on TSG technology. The experimentation has been performed on 4 reference genomes and the results evaluated with the QUAST software.
Type de document :
[Research Report] France Genomique. 2017
Liste complète des métadonnées

Contributeur : Dominique Lavenier <>
Soumis le : jeudi 2 mars 2017 - 23:37:30
Dernière modification le : mercredi 21 février 2018 - 01:42:47




Laurent Bouri, Dominique Lavenier, Jean-Francois Gibrat, Victoria Fabia Dominguez del Angel. Evaluation of genome assembly software based on long reads. [Research Report] France Genomique. 2017. 〈hal-01481801〉



Consultations de la notice