Detection of mutated primers and impact on targeted metagenomics results

Abstract : High-throughout sequencing platforms are widely used in metabarcoding studies of environmental microbial diversity as they can quickly produce millions of reads. Resulting reads in these studies are clustered into molecular operational taxonomic units (OTUs) and compared statistically. In order to limit the number of spurious OTUs retrieved from samples, eliminating the reads with mutated primers has become the norm. This process has the advantage of not requiring the use of complex tools, since it is possible to search for exact primers with simple regular expressions natively supported by many programming languages (python, perl, ruby, etc.). However, this strategy may also eliminate correct sequences and this practice raises questions: Does the removal of all reads with mutated primers cause information loss? Or, in a more practical perspective : now there are tools to reject the less reliable sequences in clustering as SWARM [Mahe et al, 2015], is there an interest to seek mutated primers? Can it bring new sequences? Are these sequences relevant? Can it contribute to detect more species? Such are the questions that we tried to answer in this work, through a metagenomic analysis that estimates eukaryotic soil biodiversity. To answer these questions, we analysed data on tropical soils to study the impact on metabarcoding results of keeping reads with mutated primers. Main results: The study shows that keeping such reads allows identifying more sequences. The majority of the new sequences are quite similar to sequences with exact primers. A minority of the new sequences contributes to validate new clusters as potential species signatures.
Type de document :
Communication dans un congrès
RCAM'16 "Recent Computational Advances in Metagenomics", Sep 2016, The Hague, Netherlands. 2016
Liste complète des métadonnées

Littérature citée [9 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/hal-01576304
Contributeur : Catherine Belleannée <>
Soumis le : mardi 22 août 2017 - 18:40:09
Dernière modification le : jeudi 11 janvier 2018 - 06:28:15

Fichier

RCAM2016_long_DetectionOfMutat...
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01576304, version 1

Citation

Aymeric Antoine-Lorquin, Frédéric Mahé, Micah Dunthorn, Catherine Belleannée. Detection of mutated primers and impact on targeted metagenomics results. RCAM'16 "Recent Computational Advances in Metagenomics", Sep 2016, The Hague, Netherlands. 2016. 〈hal-01576304〉

Partager

Métriques

Consultations de la notice

238

Téléchargements de fichiers

21