SV-Bay: structural variant detection in cancer genomes using a Bayesian approach with correction for GC-content and read mappability

D Iakovishina 1, 2 Isabelle Janoueix-Lerosey 3 Emmanuel Barillot 4 Mireille Regnier 1, 2 Valentina Boeva 4
2 AMIB - Algorithms and Models for Integrative Biology
LIX - Laboratoire d'informatique de l'École polytechnique [Palaiseau], LRI - Laboratoire de Recherche en Informatique, UP11 - Université Paris-Sud - Paris 11, Inria Saclay - Ile de France, X - École polytechnique, CNRS - Centre National de la Recherche Scientifique : UMR8623
Abstract : Motivation: Whole genome sequencing of paired-end reads can be applied to characterize the landscape of large somatic rearrange-ments of cancer genomes. Several methods for detecting structural variants with whole genome sequencing data have been developed. So far, none of these methods has combined information about abnormally mapped read pairs connecting rearranged regions and associated global copy number changes. Our aim was to create a computational method that could use both types of information, i.e., normal and abnormal reads, and demonstrate that by doing so we can highly improve both sensitivity and specificity rates of structural variant prediction. Results: We developed a computational method, SV-Bay, to detect structural variants from whole genome sequencing mate-pair or paired-end data using a probabilistic Bayesian approach. This ap-proach takes into account depth of coverage by normal reads and abnormalities in read pair mappings. To estimate the model likeli-hood, SV-Bay considers GC-content and read mappability of the genome, thus making important corrections to the expected read count. For the detection of somatic variants, SV-Bay makes use of a matched normal sample when it is available. We validated SV-Bay on simulated datasets and an experimental mate-pair dataset for the CLB-GA neuroblastoma cell line. The comparison of SV-Bay with several other methods for structural variant detection demonstrated that SV-Bay has better prediction accuracy both in terms of sensitivi-ty and false positive detection rate.
Type de document :
Pré-publication, Document de travail
Liste complète des métadonnées
Contributeur : Mireille Regnier <>
Soumis le : jeudi 3 décembre 2015 - 16:36:49
Dernière modification le : samedi 23 juin 2018 - 01:21:40


  • HAL Id : hal-01217891, version 1


D Iakovishina, Isabelle Janoueix-Lerosey, Emmanuel Barillot, Mireille Regnier, Valentina Boeva. SV-Bay: structural variant detection in cancer genomes using a Bayesian approach with correction for GC-content and read mappability. 2015. 〈hal-01217891〉



Consultations de la notice