Evaluation of genome assembly software based on long reads - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Rapport (Rapport De Recherche) Année : 2017

Evaluation of genome assembly software based on long reads

Résumé

During the last 30 years, Genomics has been revolutionized by the development of first- and second-generation sequencing (SGS) technologies, enabling the completion of many remarkable projects as the Human Genome Project, the 1000 Genomes Project and the Human Microbiome Project. In the last decade, SGS technologies based on massive parallel sequencing have dominated the market, thanks to their ability to produce enormous volumes of data cheaply. However, often genes and regions of interest are not completely or accurately assembled, complicating analyses or requiring additional cloning efforts for obtaining the correct sequences. The fundamental obstacle in SGS technologies for obtaining high quality genome assembly is the existence of repetitions in the sequences. A promising solution to this issue is the advent of Third-generation sequencing (TGS) technologies based on long read sequencing. TGS technologies have been used to produce highly accurate de novo assemblies of hundreds of microbial genomes and highly contiguous reconstructions of many dozens of plant and animal genomes, enabling new insights into evolution and sequence diversity. They have also been applied to resequencing analyses, to create detailed maps of structural variations in many species. Also, these new technologies have been used to fill in many of the gaps in the human reference genome. In this report, we compare and evaluate several genome assembly software based on TSG technology. The experimentation has been performed on 4 reference genomes and the results evaluated with the QUAST software.
Fichier non déposé

Dates et versions

hal-01481801 , version 1 (02-03-2017)

Identifiants

Citer

Laurent Bouri, Dominique Lavenier, Jean-Francois J.-F. Gibrat, Victoria Fabia Dominguez del Angel. Evaluation of genome assembly software based on long reads. [Research Report] France Genomique. 2017. ⟨hal-01481801⟩
986 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More