Skip to Main content Skip to Navigation
Preprints, Working Papers, ...

Cluster-Specific Predictions with Multi-Task Gaussian Processes

Arthur Leroy 1 Pierre Latouche 1 Benjamin Guedj 2, 3, 4, 5 Servane Gey 1
5 MODAL - MOdel for Data Analysis and Learning
LPP - Laboratoire Paul Painlevé - UMR 8524, Université de Lille, Sciences et Technologies, Inria Lille - Nord Europe, METRICS - Evaluation des technologies de santé et des pratiques médicales - ULR 2694, Polytech Lille - École polytechnique universitaire de Lille
Abstract : A model involving Gaussian processes (GPs) is introduced to simultaneously handle multi-task learning, clustering, and prediction for multiple functional data. This procedure acts as a model-based clustering method for functional data as well as a learning step for subsequent predictions for new tasks. The model is instantiated as a mixture of multi-task GPs with common mean processes. A variational EM algorithm is derived for dealing with the optimisation of the hyper-parameters along with the hyper-posteriors' estimation of latent variables and processes. We establish explicit formulas for integrating the mean processes and the latent clustering variables within a predictive distribution, accounting for uncertainty on both aspects. This distribution is defined as a mixture of cluster-specific GP predictions, which enhances the performances when dealing with group-structured data. The model handles irregular grid of observations and offers different hypotheses on the covariance structure for sharing additional information across tasks. The performances on both clustering and prediction tasks are assessed through various simulated scenarios and real datasets. The overall algorithm, called MagmaClust, is publicly available as an R package.
Complete list of metadatas

Cited literature [52 references]  Display  Hide  Download

https://hal.inria.fr/hal-03009276
Contributor : Benjamin Guedj <>
Submitted on : Tuesday, November 17, 2020 - 10:38:49 AM
Last modification on : Friday, November 27, 2020 - 2:18:03 PM

File

2011.07866.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-03009276, version 1
  • ARXIV : 2011.07866

Citation

Arthur Leroy, Pierre Latouche, Benjamin Guedj, Servane Gey. Cluster-Specific Predictions with Multi-Task Gaussian Processes. 2020. ⟨hal-03009276⟩

Share

Metrics

Record views

17

Files downloads

23