Decoding genetic markers of multiple phenotypic layers through biologically constrained Genome-to-Phenome Bayesian Sparse Regression - Archive ouverte HAL Access content directly
Journal Articles Frontiers in Molecular Medicine Year : 2021

Decoding genetic markers of multiple phenotypic layers through biologically constrained Genome-to-Phenome Bayesian Sparse Regression

(1, 2) , (1, 2) , (1, 2) , (1, 2)
1
2

Abstract

The applicability of multivariate approaches for the joint analysis of genomics and phenomics information is currently limited by the lack of scalability, and by the difficulty of interpreting the related findings from a biological perspective. To tackle these limitations, we present Bayesian Genome-to-Phenome Sparse Regression (G2PSR), a novel multivariate regression method based on sparse SNP-gene constraints. The statistical framework of G2PSR is based on a Bayesian neural network, were constraints on SNPs-genes associations are integrated by incorporating a priori knowledge linking variants to their respective genes, to then reconstruct the phenotypic data in the output layer. Interpretability is promoted by inducing sparsity on the genes through variational dropout, allowing to estimate the uncertainty associated with each gene, and related SNPs, in the reconstruction task. Ultimately, G2PSR is conceived to prevent multiple testing correction and to assess the combined effect of SNPs, thus increasing the statistical power in detecting genome-to-phenome associations. The effectiveness of G2P was demonstrated on synthetic and real data, with respect to state-of-the-art methods based on group-wise sparsity constraints. The application on real data consisted in an imaging-genetics analysis on the Alzheimer's Disease Neuroimaging Initiative data, relating SNPS from more than 3500 genes to clinical and multi-variate brain volumetric information. The experimental results show that our method can provide accurate selection of relevant genes in dataset with large SNPs-to-samples ratio, thus overcoming the main limitations of current genome-to-phenome association methods.
Fichier principal
Vignette du fichier
G2PSR_hal.pdf (8.61 Mo) Télécharger le fichier
Vignette du fichier
Figure_7.pdf (1.41 Mo) Télécharger le fichier
Origin : Files produced by the author(s)

Dates and versions

hal-03477486 , version 1 (17-12-2021)

Identifiers

Cite

Marie Deprez, Julien Moreira, Maxime Sermesant, Marco Lorenzi. Decoding genetic markers of multiple phenotypic layers through biologically constrained Genome-to-Phenome Bayesian Sparse Regression. Frontiers in Molecular Medicine, In press, ⟨10.3389/fmmed.2022.830956⟩. ⟨hal-03477486⟩
85 View
74 Download

Altmetric

Share

Gmail Facebook Twitter LinkedIn More