Detection of mutated primers and impact on targeted metagenomics results - Archive ouverte HAL Access content directly
Conference Papers Year : 2016

Detection of mutated primers and impact on targeted metagenomics results

(1, 2) , (3) , (4) , (1, 2)


High-throughout sequencing platforms are widely used in metabarcoding studies of environmental microbial diversity as they can quickly produce millions of reads. Resulting reads in these studies are clustered into molecular operational taxonomic units (OTUs) and compared statistically. In order to limit the number of spurious OTUs retrieved from samples, eliminating the reads with mutated primers has become the norm. This process has the advantage of not requiring the use of complex tools, since it is possible to search for exact primers with simple regular expressions natively supported by many programming languages (python, perl, ruby, etc.). However, this strategy may also eliminate correct sequences and this practice raises questions: Does the removal of all reads with mutated primers cause information loss? Or, in a more practical perspective : now there are tools to reject the less reliable sequences in clustering as SWARM [Mahe et al, 2015], is there an interest to seek mutated primers? Can it bring new sequences? Are these sequences relevant? Can it contribute to detect more species? Such are the questions that we tried to answer in this work, through a metagenomic analysis that estimates eukaryotic soil biodiversity. To answer these questions, we analysed data on tropical soils to study the impact on metabarcoding results of keeping reads with mutated primers. Main results: The study shows that keeping such reads allows identifying more sequences. The majority of the new sequences are quite similar to sequences with exact primers. A minority of the new sequences contributes to validate new clusters as potential species signatures.
Fichier principal
Vignette du fichier
RCAM2016_long_DetectionOfMutatedPrimers_20170822.pdf (520.21 Ko) Télécharger le fichier
Origin : Files produced by the author(s)

Dates and versions

hal-01576304 , version 1 (22-08-2017)


  • HAL Id : hal-01576304 , version 1


Aymeric Antoine-Lorquin, Frédéric Mahé, Micah Dunthorn, Catherine Belleannée. Detection of mutated primers and impact on targeted metagenomics results. RCAM'16 "Recent Computational Advances in Metagenomics", Sep 2016, The Hague, Netherlands. ⟨hal-01576304⟩
623 View
65 Download


Gmail Facebook Twitter LinkedIn More