S. F. Altschul, W. Gish, W. Miller, E. W. Myers, and D. J. Lipman, Basic local alignment search tool, Journal of Molecular Biology, vol.215, issue.3, pp.403-410, 1990.
DOI : 10.1016/S0022-2836(05)80360-2

A. S. Amend, K. A. Seifert, and T. D. Bruns, Quantifying microbial communities with 454 pyrosequencing: does read abundance count?, Molecular Ecology, vol.59, issue.24, pp.5555-5565, 2010.
DOI : 10.1111/j.1365-294X.2010.04898.x

H. J. Atkinson, J. H. Morris, T. E. Ferrin, and P. C. Babbitt, Using Sequence Similarity Networks for Visualization of Relationships Across Diverse Protein Superfamilies, PLoS ONE, vol.28, issue.69, p.4345, 2009.
DOI : 10.1371/journal.pone.0004345.s010

E. Bapteste, C. Bicep, and P. Lopez, Evolution of genetic diversity using networks: the human gut microbiome as a case study, Clinical Microbiology and Infection, vol.18, issue.4, pp.40-43, 2012.
DOI : 10.1111/j.1469-0691.2012.03856.x

D. Belazzougui, P. Boldi, G. Ottaviano, R. Venturini, and S. Vigna, Cache-Oblivious Peeling of Random Hypergraphs, 2014 Data Compression Conference, pp.352-361, 2014.
DOI : 10.1109/DCC.2014.48

D. Belazzougui and R. Venturini, Compressed Static Functions with Applications, Proceedings of the Twenty-Fourth Annual ACM-SIAM Symposium on Discrete Algorithms, SODA '13, pp.229-240
DOI : 10.1137/1.9781611973105.17

G. Benoit, P. Peterlongo, M. Mariadassou, E. Drezen, S. Schbath et al., Lemaitre: Multiple Comparative Metagenomics using Multiset k-mer Counting, pp.1-17, 2016.

E. Boon, S. Halary, E. Bapteste, and M. Hijri, Studying Genome Heterogeneity within the Arbuscular Mycorrhizal Fungal Cytoplasm, Genome Biology and Evolution, vol.7, issue.2, pp.505-521, 2015.
DOI : 10.1093/gbe/evv002

URL : https://hal.archives-ouvertes.fr/hal-01224215

D. Charles and K. Chellapilla, Bloomier Filters: A Second Look, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) LNCS, vol.5193, pp.259-270, 2008.
DOI : 10.1007/978-3-540-87744-8_22

E. Corel, P. Lopez, R. Meheust, and E. Bapteste, Network-Thinking: Graphs to Analyze Microbial Complexity and Evolution, Trends in Microbiology, vol.24, issue.3, pp.224-237, 2016.
DOI : 10.1016/j.tim.2015.12.003

URL : https://hal.archives-ouvertes.fr/hal-01300043

V. B. Dubinkina, D. S. Ischenko, V. I. Ulyantsev, A. V. Tyakht, and D. G. , Alexeev: Assessment of k-mer spectrum applicability for metagenomic dissimilarity analysis, BMC Bioinformatics, vol.17, issue.1, pp.2016-2054

P. Ferragina and G. Manzini, Indexing compressed text, Journal of the ACM, vol.52, issue.4, pp.552-581, 2000.
DOI : 10.1145/1082036.1082039

M. Fondi, A. Karkman, M. Tamminen, E. Bosi, M. Virta et al., ???Every Gene Is Everywhere but the Environment Selects???: Global Geolocalization of Gene Sharing in Environmental Samples through Network Analysis, Genome Biology and Evolution, vol.8, issue.5, 2016.
DOI : 10.1093/gbe/evw077

D. Forster, L. Bittner, S. Karkar, M. Dunthorn, S. Romac et al., Testing ecological theories with sequence similarity networks: marine ciliates exhibit similar geographic dispersal patterns as multicellular organisms, BMC Biology, vol.163, issue.1, pp.2015-2031
DOI : 10.1186/s12915-015-0125-5

URL : https://hal.archives-ouvertes.fr/hal-01144173

M. G. Grabherr, B. J. Haas, M. Yassour, J. Z. Levin, D. A. Thompson et al., Full-length transcriptome assembly from RNA-Seq data without a reference genome, Nature Biotechnology, vol.30, issue.7, pp.29-644, 2011.
DOI : 10.1101/GR.229202. ARTICLE PUBLISHED ONLINE BEFORE MARCH 2002

. Banfield, A new view of the tree of life, Nature Microbiology, vol.1, p.16048, 2016.

S. W. Kembel, M. Wu, J. A. Eisen, and J. L. Green, Incorporating 16S Gene Copy Number Information Improves Estimates of Microbial Diversity and Abundance, PLoS Computational Biology, vol.9, issue.10, pp.2012-2013
DOI : 10.1371/journal.pcbi.1002743.s004

A. Kirsch and M. Mitzenmacher, Less hashing, same performance: Building a better Bloom filter, in Algorithms, pp.456-467, 2006.

V. Kunin, A. Engelbrektson, H. Ochman, and P. Hugenholtz, Wrinkles in the rare biosphere: pyrosequencing errors can lead to artificial inflation of diversity estimates, Environmental Microbiology, vol.64, issue.1, pp.118-123, 2010.
DOI : 10.1111/j.1462-2920.2009.02051.x

B. Langmead and S. L. , Fast gapped-read alignment with Bowtie 2, Nature Methods, vol.9, issue.4, pp.357-359, 2012.
DOI : 10.1093/bioinformatics/btp352

H. Li and R. Durbin, Fast and accurate short read alignment with Burrows-Wheeler transform, Bioinformatics, vol.25, issue.14, pp.1754-1760, 2009.
DOI : 10.1093/bioinformatics/btp324

P. Lopez, S. Halary, and E. Bapteste, Highly divergent ancient gene families in metagenomic samples are compatible with additional divisions of life, Biology Direct, vol.59, issue.3, p.64, 2015.
DOI : 10.1186/s13062-015-0092-3

URL : https://hal.archives-ouvertes.fr/hal-01257771

N. Maillet, G. Collet, T. Vannier, D. Lavenier, and P. Peterlongo, Commet: Comparing and combining multiple metagenomic datasets, 2014 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp.94-98, 2014.
DOI : 10.1109/BIBM.2014.6999135

URL : https://hal.archives-ouvertes.fr/hal-01080050

N. Maillet, C. Lemaitre, R. Chikhi, D. Lavenier, and P. Peterlongo, Compareads: comparing huge metagenomic experiments, BMC Bioinformatics, vol.13, issue.Suppl 19, pp.1-10, 2012.
DOI : 10.1371/journal.pbio.0050077

URL : https://hal.archives-ouvertes.fr/hal-00760332

G. Marsaglia, Xorshift RNGs, Journal of Statistical Software, vol.8, issue.14, pp.1-6, 2003.
DOI : 10.18637/jss.v008.i14

G. Rizk, D. Lavenier, and R. Chikhi, DSK: k-mer counting with very low memory usage, Bioinformatics, vol.29, issue.5, pp.652-653, 2013.
DOI : 10.1093/bioinformatics/btt020

URL : https://hal.archives-ouvertes.fr/hal-00778473

G. Robertson, J. Schein, R. Chiu, R. Corbett, M. Field et al., De novo assembly and analysis of RNA-seq data, Nature Methods, vol.7, issue.11, pp.909-912, 2010.
DOI : 10.1038/nbt0509-455

M. Schirmer, U. Z. Ijaz, R. D-'amore, N. Hall, W. T. Sloan et al., Quince: Insight into biases and sequencing errors for amplicon sequencing with the illumina miseq platform, Nucleic Acids Research, 2015.

D. Sharon, H. Tilgner, F. Grubert, and M. Snyder, A single-molecule long-read survey of the human transcriptome, Nature Biotechnology, vol.3, issue.11, pp.31-1009, 2013.
DOI : 10.1038/nature07672

H. Tilgner, F. Grubert, D. Sharon, and M. P. Snyder, Defining a personal, allelespecific , and single-molecule long-read transcriptome, Proceedings of the National Academy of Sciences, pp.9869-9874, 2014.

F. Völkel, E. Bapteste, M. Habib, P. Lopez, and C. Vigliotti, Read networks and k-laminar graphs. arXiv, pp.1-14, 2016.

E. Zorita, P. Cuscó, and G. J. Filion, Starcode: sequence clustering based on all-pairs search, Bioinformatics, vol.31, issue.12, pp.31-1913, 2015.
DOI : 10.1093/bioinformatics/btv053