Using cascading Bloom filters to improve the memory usage for de Brujin graphs

Kamil Salikhov; Gustavo Sacomoto; Gregory Kucherov

doi:10.1186/1748-7188-9-2

Article Dans Une Revue Algorithms for Molecular Biology Année : 2014

Using cascading Bloom filters to improve the memory usage for de Brujin graphs

(1) , (2, 3) , (4, 5)

1
2
3
4
5

Kamil Salikhov

Fonction : Auteur
PersonId : 954695

Lomonosov Moscow State University

Gustavo Sacomoto

Fonction : Auteur
PersonId : 936380

Baobab [LBBE]

An algorithmic view on genomes, cells, and environments

Gregory Kucherov

Fonction : Auteur correspondant
PersonId : 14903
IdHAL : gregory-kucherov
ORCID : 0000-0001-5899-5424
IdRef : 093602189

Connectez-vous pour contacter l'auteur

Department of Computer Science [Beer-Sheva]

Laboratoire d'Informatique Gaspard-Monge

Résumé

Background
De Brujin graphs are widely used in bioinformatics for processing next-generation sequencing data. Due to a very large size of NGS datasets, it is essential to represent de Bruijn graphs compactly, and several approaches to this problem have been proposed recently.
Results
In this work, we show how to reduce the memory required by the data structure of Chikhi and Rizk (WABI'12) that represents de Brujin graphs using Bloom filters. Our method requires 30% to 40% less memory with respect to their method, with insignificant impact on construction time. At the same time, our experiments showed a better query time compared to the method of Chikhi and Rizk.
Conclusion
The proposed data structure constitutes, to our knowledge, currently the most efficient practical representation of de Bruijn graphs.

Domaines

Biologie moléculaire

Fichier principal

1748-7188-9-2.pdf (570.35 Ko)

1748-7188-9-2.xml (102.31 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Format : Autre

Ed. BMC : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-00971576

Soumis le : jeudi 3 avril 2014-09:27:37

Dernière modification le : vendredi 17 mai 2024-17:12:03

Archivage à long terme le : jeudi 3 juillet 2014-11:06:09

Dates et versions

hal-00971576 , version 1 (03-04-2014)

Identifiants

HAL Id : hal-00971576 , version 1
DOI : 10.1186/1748-7188-9-2
PUBMED : 24565280

Citer

Kamil Salikhov, Gustavo Sacomoto, Gregory Kucherov. Using cascading Bloom filters to improve the memory usage for de Brujin graphs. Algorithms for Molecular Biology, 2014, 9 (1), pp.2. ⟨10.1186/1748-7188-9-2⟩. ⟨hal-00971576⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENPC CNRS INRIA UNIV-LYON1 PARISTECH LIGM BAMBOO LIGM_MOA BIOENVIS INRIA2 ESIEE-PARIS LBBE UDL ANR UNIV-EIFFEL LIGM_ADA

250 Consultations

132 Téléchargements

Using cascading Bloom filters to improve the memory usage for de Brujin graphs

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager