Group and sparse group partial least square approaches applied in genomics context

Motivation: The association between two blocks of ‘omics’ data brings challenging issues in computational biology due to their size and complexity. Here, we focus on a class of multivariate statistical methods called partial least square (PLS). Sparse version of PLS (sPLS) operates integration of two datasets while simultaneously selecting the contributing variables. However, these methods do not take into account the important structural or group effects due to the relationship between markers among biological pathways. Hence, considering the predefined groups of markers (e.g. genesets), this could improve the relevance and the efficacy of the PLS approach. Results: We propose two PLS extensions called group PLS (gPLS) and sparse gPLS (sgPLS). Our algorithm enables to study the relationship between two different types of omics data (e.g. SNP and gene expression) or between an omics dataset and multivariate phenotypes (e.g. cytokine secretion). We demonstrate the good performance of gPLS and sgPLS compared with the sPLS in the context of grouped data. Then, these methods are compared through an HIV therapeutic vaccine trial. Our approaches provide parsimonious models to reveal the relationship between gene abundance and the immunological response to the vaccine.

Domaines

Sciences du Vivant [q-bio] Santé publique et épidémiologie

Sandrine DARMIGNY : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01288891

Soumis le : mardi 15 mars 2016-16:53:25

Dernière modification le : vendredi 3 mai 2024-14:04:57

Dates et versions

hal-01288891 , version 1 (15-03-2016)

Identifiants

HAL Id : hal-01288891 , version 1
DOI : 10.1093/bioinformatics/btv535
PUBMED : 26358727

Citer

Benoit Liquet, Pierre Lafaye de Micheaux, Boris P. Hejblum, Rodolphe Thiébaut. Group and sparse group partial least square approaches applied in genomics context. Bioinformatics, 2016, 32 (1), pp.35-42. ⟨10.1093/bioinformatics/btv535⟩. ⟨hal-01288891⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

GENES CNRS INRIA UNIV-PAU LMA-PAU INSMI ENSAI INRIA2 U1219

154 Consultations

0 Téléchargements