Genome sequence analysis with MonetDB - A case study on Ebola virus diversity

Abstract : Next-generation sequencing (NGS) technology has led the life sciences into the big data era. Today, sequencing genomes takes little time and cost, but results in terabytes of data to be stored and analysed. Biologists are often exposed to excessively time consuming and error-prone data management and analysis hurdles. In this paper, we propose a database management system (DBMS) based approach to accelerate and substantially simplify genome sequence analysis. We have extended MonetDB, an open-source column-based DBMS, with a BAM module, which enables easy, flexible, and rapid management and analysis of sequence alignment data stored as Sequence Alignment/Map (SAM/BAM) files. We describe the main features of MonetDB/BAM using a case study on Ebola virus genomes.
Document type :
Journal articles
Complete list of metadatas

Cited literature [15 references]  Display  Hide  Download

https://hal.inria.fr/hal-01248546
Contributor : Marie-France Sagot <>
Submitted on : Tuesday, May 30, 2017 - 2:10:00 PM
Last modification on : Wednesday, April 11, 2018 - 1:56:06 AM
Long-term archiving on : Wednesday, September 6, 2017 - 2:01:03 PM

File

Cijvat-Genome_sequence_analysi...
Files produced by the author(s)

Identifiers

Collections

Citation

Robin Cijvat, Stefan Manegold, Martin Kersten, Gunnar W. Klau, Alexander Schönhuth, et al.. Genome sequence analysis with MonetDB - A case study on Ebola virus diversity. Datenbank-Spektrum, Springer, 2015, 15 (3), pp.185-191. ⟨10.1007/s13222-015-0198-x⟩. ⟨hal-01248546⟩

Share

Metrics

Record views

239

Files downloads

135