H. Burton and . Bloom, Space/time trade-offs in hash coding with allowable errors, Commun. ACM, vol.13, issue.7, pp.422-426, 1970.

R. Chikhi and G. Rizk, Space-efficient and exact de bruijn graph representation based on a bloom filter, Algorithms for Molecular Biology, vol.8, p.22, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00868805

. Genomes-project-consortium, A global reference for human genetic variation, Nature, vol.526, issue.7571, p.68, 2015.

, International HapMap Consortium et al. The international hapmap project, Nature, vol.426, issue.6968, p.789, 2003.

, The Computational Pan-Genomics Consortium. Computational pan-genomics: status, promises and challenges, Briefings in Bioinformatics, vol.19, issue.1, p.2016

. Uk10k, The uk10k project identifies rare variants in health and disease, Nature, vol.526, issue.7571, p.82, 2015.

G. Cormode and S. Muthukrishnan, An improved data stream summary: the count-min sketch and its applications, J. Algorithms, vol.55, issue.1, pp.58-75, 2005.

P. Danecek, A. Auton, G. Abecasis, C. A. Albers, E. Banks et al., Richard Durbin, and 1000 Genomes Project Analysis Group. The variant call format and VCFtools, Bioinformatics, vol.27, issue.15, pp.2156-2158, 2011.

E. Mark-a-depristo, R. Banks, . Poplin, V. Kiran, J. R. Garimella et al., A framework for variation discovery and genotyping using next-generation dna sequencing data, Nature genetics, vol.43, issue.5, p.491, 2011.

M. Dugosz, M. Kokot, and S. Deorowicz, KMC 3: counting and manipulating k-mer statistics, Bioinformatics, vol.33, issue.17, pp.2759-2761, 2017.

H. Hannes-p-eggertsson, S. Jonsson, E. Kristmundsdottir, B. Hjartarson, G. Kehr et al., Graphtyper enables population-scale genotyping using pangenome graphs, Nature genetics, vol.49, issue.11, p.1654, 2017.

S. Gog, T. Beller, A. Moffat, and M. Petri, From theory to practice: Plug and play with succinct data structures, 13th International Symposium on Experimental Algorithms, pp.326-337, 2014.

X. Mohammad-shabbir-hasan, L. Wu, and . Zhang, Performance evaluation of indel calling tools using real short-read data, Human genomics, vol.9, issue.1, p.20, 2015.

T. Kaneo, S. Tahara, and M. Matsuo, Non-linear accumulation of 8-hydroxy-2-deoxyguanosine, a marker of oxidized dna damage, during aging. Mutation Research/DNAging, vol.316, pp.277-285, 1996.

P. Krusche, L. Trigg, C. Paul, C. E. Boutros, F. Mason et al., Best practices for benchmarking germline small variant calls in human genomes. bioRxiv, p.270157, 2018.

E. Y. Chee-seng-ku, A. Loy, Y. Salim, K. Pawitan, and . Chia, The discovery of human genetic variations and their use as disease markers: past, present and future, Journal of human genetics, vol.55, issue.7, p.403, 2010.

B. Langmead, L. Steven, and . Salzberg, Fast gapped-read alignment with bowtie 2, Nature methods, vol.9, issue.4, p.357, 2012.

H. Li, A statistical framework for snp calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data, Bioinformatics, vol.27, issue.21, pp.2987-2993, 2011.

H. Li and R. Durbin, Fast and accurate short read alignment with burrows-wheeler transform, Bioinformatics, vol.25, issue.14, pp.1754-1760, 2009.

H. Li, B. Handsaker, A. Wysoker, T. Fennell, J. Ruan et al., The sequence alignment/map format and samtools, Bioinformatics, vol.25, issue.16, pp.2078-2079, 2009.

G. Daniel, C. Macarthur, and . Tyler-smith, Loss-of-function variants in the genomes of healthy humans, Human molecular genetics, vol.19, issue.R2, pp.125-130, 2010.

A. Mckenna, M. Hanna, E. Banks, A. Sivachenko, K. Cibulskis et al., The genome analysis toolkit: a mapreduce framework for analyzing next-generation dna sequencing data, Genome research, vol.20, issue.9, pp.1297-1303, 2010.

P. Melsted and J. K. Pritchard, Efficient counting of k-mers in DNA sequences using a bloom filter, BMC Bioinformatics, vol.12, p.333, 2011.

E. Ryan, . Mills, T. Christopher, C. E. Luttig, A. Larkins et al., An initial map of insertion and deletion (indel) variation in the human genome, Genome research, vol.16, issue.9, pp.1182-1190, 2006.

B. Stephen, D. L. Montgomery, E. Goode, . Kvikstad, A. Cornelis et al., The origin, evolution, and functional impact of short insertion-deletion variants identified in 179 human genomes, Genome research, 2013.

M. Julienne, R. E. Mullaney, . Mills, S. E. Stephen-pittard, and . Devine, Small insertions and deletions (indels) in human genomes, Human molecular genetics, vol.19, issue.R2, pp.131-136, 2010.

F. Pajuste, L. Kaplinski, M. Möls, T. Puurand, M. Lepamets et al., Fastgt: an alignment-free method for calling common snvs directly from raw sequencing reads, Scientific reports, vol.7, issue.1, p.2537, 2017.

A. Rimmer, H. Phan, I. Mathieson, Z. Iqbal, R. F. Stephen et al., Integrating mapping-, assembly-and haplotype-based approaches for calling variants in clinical sequencing applications, Nature genetics, vol.46, issue.8, p.912, 2014.

V. Lecia, R. G. Sequist, D. Martins, . Spigel, M. Steven et al., First-line gefitinib in patients with advanced non-small-cell lung cancer harboring somatic egfr mutations, Journal of clinical oncology, vol.26, issue.15, pp.2442-2449, 2008.

A. Shajii, D. Yorukoglu, Y. W. Yu, and B. Berger, Fast genotyping of known snps through approximate k-mer matching, Bioinformatics, vol.32, issue.17, pp.538-544, 2016.

J. Sirén, Indexing variation graphs, 2017 Proceedings of the ninteenth workshop on algorithm engineering and experiments (ALENEX), pp.13-27, 2017.

H. Peter, T. Sudmant, E. J. Rausch, R. E. Gardner, A. Handsaker et al., An integrated map of structural variation in 2,504 human genomes, Nature, vol.526, issue.7571, p.75, 2015.

C. Sun, R. S. Harris, R. Chikhi, and P. Medvedev, Allsome sequence bloom trees, Research in Computational Molecular Biology -21st Annual International Conference, pp.272-286, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01575350

C. Sun and P. Medvedev, Toward fast and accurate snp genotyping from whole genome sequencing data for bedside diagnostics, Bioinformatics, p.641, 2018.

C. Venter, D. Mark, E. W. Adams, . Myers, W. Peter et al., The sequence of the human genome. science, vol.291, pp.1304-1351, 2001.
URL : https://hal.archives-ouvertes.fr/hal-00465088

S. Vigna, Broadword implementation of rank/select queries, Experimental Algorithms, 7th International Workshop, pp.154-168, 2008.

F. Stephen-t-warren, G. R. Zhang, J. F. Licameli, and . Peters, The fragile x site in somatic cell hybrids: an approach for molecular cloning of fragile sites, Science, vol.237, issue.4813, pp.420-423, 1987.

M. Justin, B. Zook, J. Chapman, D. Wang, O. Mittelman et al., was not peer-reviewed) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint, Nature biotechnology, vol.32, issue.3, p.246, 2014.

, HomoRef HetRef HomoAlt HetAlt Uncalled Called GT

, HomoRef HetRef HomoAlt HetAlt