Mapsembler, targeted assembly of larges genomes on a desktop computer

Pierre Peterlongo 1, * Rayan Chikhi 1
* Auteur correspondant
1 SYMBIOSE - Biological systems and models, bioinformatics and sequences
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
Abstract : Background: The analysis of next-generation sequencing data from large genomes is a timely research topic. Sequencers are producing billions of short sequence fragments from newly sequenced organisms. Computational methods for reconstructing sequences (whole-genome assemblers) are typically employed to process such data. However, one of the main drawback of these methods is the high memory requirement. Results: We present Mapsembler, an iterative targeted assembler which processes large datasets of reads on commodity hardware. Mapsembler checks for the presence of given regions of interest in the reads and reconstructs their neighborhood, either as a plain sequence (consensus) or as a graph (full sequence structure). We introduce new algorithms to retrieve homologues of a sequence from reads and construct an extension graph. Conclusions: Mapsembler is the rst software that enables de novo discovery around a region of interest of gene homologues, SNPs, exon skipping as well as other structural events, directly from raw sequencing reads. Compared to traditional assembly software, memory requirement and execution time of Mapsembler are considerably lower, as data indexing is localized. Mapsembler can be used at http://mapsembler.genouest.org
Type de document :
Rapport
[Research Report] RR-7565, INRIA. 2011, pp.17
Liste complète des métadonnées

https://hal.inria.fr/inria-00577218
Contributeur : Pierre Peterlongo <>
Soumis le : mercredi 16 mars 2011 - 17:42:06
Dernière modification le : mercredi 11 avril 2018 - 01:56:42
Document(s) archivé(s) le : jeudi 8 novembre 2012 - 12:00:17

Fichier

RR-7565.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inria-00577218, version 1

Citation

Pierre Peterlongo, Rayan Chikhi. Mapsembler, targeted assembly of larges genomes on a desktop computer. [Research Report] RR-7565, INRIA. 2011, pp.17. 〈inria-00577218〉

Partager

Métriques

Consultations de la notice

550

Téléchargements de fichiers

130