Space/time trade-offs in hash coding with allowable errors, Commun. ACM, vol.13, issue.7, pp.422-426, 1970. ,
Space-efficient and exact de bruijn graph representation based on a bloom filter, Algorithms for Molecular Biology, vol.8, p.22, 2013. ,
URL : https://hal.archives-ouvertes.fr/hal-00868805
A global reference for human genetic variation, Nature, vol.526, issue.7571, p.68, 2015. ,
, International HapMap Consortium et al. The international hapmap project, Nature, vol.426, issue.6968, p.789, 2003.
, The Computational Pan-Genomics Consortium. Computational pan-genomics: status, promises and challenges, Briefings in Bioinformatics, vol.19, issue.1, p.2016
The uk10k project identifies rare variants in health and disease, Nature, vol.526, issue.7571, p.82, 2015. ,
An improved data stream summary: the count-min sketch and its applications, J. Algorithms, vol.55, issue.1, pp.58-75, 2005. ,
Richard Durbin, and 1000 Genomes Project Analysis Group. The variant call format and VCFtools, Bioinformatics, vol.27, issue.15, pp.2156-2158, 2011. ,
A framework for variation discovery and genotyping using next-generation dna sequencing data, Nature genetics, vol.43, issue.5, p.491, 2011. ,
KMC 3: counting and manipulating k-mer statistics, Bioinformatics, vol.33, issue.17, pp.2759-2761, 2017. ,
Graphtyper enables population-scale genotyping using pangenome graphs, Nature genetics, vol.49, issue.11, p.1654, 2017. ,
From theory to practice: Plug and play with succinct data structures, 13th International Symposium on Experimental Algorithms, pp.326-337, 2014. ,
Performance evaluation of indel calling tools using real short-read data, Human genomics, vol.9, issue.1, p.20, 2015. ,
Non-linear accumulation of 8-hydroxy-2-deoxyguanosine, a marker of oxidized dna damage, during aging. Mutation Research/DNAging, vol.316, pp.277-285, 1996. ,
Best practices for benchmarking germline small variant calls in human genomes. bioRxiv, p.270157, 2018. ,
The discovery of human genetic variations and their use as disease markers: past, present and future, Journal of human genetics, vol.55, issue.7, p.403, 2010. ,
Fast gapped-read alignment with bowtie 2, Nature methods, vol.9, issue.4, p.357, 2012. ,
A statistical framework for snp calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data, Bioinformatics, vol.27, issue.21, pp.2987-2993, 2011. ,
Fast and accurate short read alignment with burrows-wheeler transform, Bioinformatics, vol.25, issue.14, pp.1754-1760, 2009. ,
The sequence alignment/map format and samtools, Bioinformatics, vol.25, issue.16, pp.2078-2079, 2009. ,
Loss-of-function variants in the genomes of healthy humans, Human molecular genetics, vol.19, issue.R2, pp.125-130, 2010. ,
The genome analysis toolkit: a mapreduce framework for analyzing next-generation dna sequencing data, Genome research, vol.20, issue.9, pp.1297-1303, 2010. ,
Efficient counting of k-mers in DNA sequences using a bloom filter, BMC Bioinformatics, vol.12, p.333, 2011. ,
An initial map of insertion and deletion (indel) variation in the human genome, Genome research, vol.16, issue.9, pp.1182-1190, 2006. ,
The origin, evolution, and functional impact of short insertion-deletion variants identified in 179 human genomes, Genome research, 2013. ,
Small insertions and deletions (indels) in human genomes, Human molecular genetics, vol.19, issue.R2, pp.131-136, 2010. ,
Fastgt: an alignment-free method for calling common snvs directly from raw sequencing reads, Scientific reports, vol.7, issue.1, p.2537, 2017. ,
Integrating mapping-, assembly-and haplotype-based approaches for calling variants in clinical sequencing applications, Nature genetics, vol.46, issue.8, p.912, 2014. ,
First-line gefitinib in patients with advanced non-small-cell lung cancer harboring somatic egfr mutations, Journal of clinical oncology, vol.26, issue.15, pp.2442-2449, 2008. ,
Fast genotyping of known snps through approximate k-mer matching, Bioinformatics, vol.32, issue.17, pp.538-544, 2016. ,
Indexing variation graphs, 2017 Proceedings of the ninteenth workshop on algorithm engineering and experiments (ALENEX), pp.13-27, 2017. ,
An integrated map of structural variation in 2,504 human genomes, Nature, vol.526, issue.7571, p.75, 2015. ,
Allsome sequence bloom trees, Research in Computational Molecular Biology -21st Annual International Conference, pp.272-286, 2017. ,
URL : https://hal.archives-ouvertes.fr/hal-01575350
Toward fast and accurate snp genotyping from whole genome sequencing data for bedside diagnostics, Bioinformatics, p.641, 2018. ,
The sequence of the human genome. science, vol.291, pp.1304-1351, 2001. ,
URL : https://hal.archives-ouvertes.fr/hal-00465088
Broadword implementation of rank/select queries, Experimental Algorithms, 7th International Workshop, pp.154-168, 2008. ,
The fragile x site in somatic cell hybrids: an approach for molecular cloning of fragile sites, Science, vol.237, issue.4813, pp.420-423, 1987. ,
was not peer-reviewed) is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. The copyright holder for this preprint, Nature biotechnology, vol.32, issue.3, p.246, 2014. ,
, HomoRef HetRef HomoAlt HetAlt Uncalled Called GT
, HomoRef HetRef HomoAlt HetAlt