Parameter-Wise Co-Clustering for High-Dimensional Data

Michael P B Gallaugher; Christophe Biernacki; Paul D Mcnicholas

doi:10.1007/s00180-022-01289-2

Article Dans Une Revue Computational Statistics Année : 2022

Parameter-Wise Co-Clustering for High-Dimensional Data

(1) , (2) , (3)

1
2
3

Michael P B Gallaugher

Fonction : Auteur
PersonId : 1035911

Baylor University

Christophe Biernacki

Fonction : Auteur

MOdel for Data Analysis and Learning

Paul D Mcnicholas

Fonction : Auteur
PersonId : 1035912

McMaster University [Hamilton, Ontario]

Résumé

In recent years, data dimensionality has increasingly become a concern, leading to many parameter and dimension reduction techniques being proposed in the literature. A parameter-wise co-clustering model, for (possibly high-dimensional) data modelled via continuous random variables, is presented. The proposed model, although allowing more flexibility, still maintains the very high degree of parsimony and interpretability achieved by traditional co-clustering. More precisely, the keystone consists of dramatically increasing the number of column-clusters while expressing each as a combination of a limited number of mean-dependent and variance-dependent column-clusters. A stochastic expectation-maximization (SEM) algorithm along with a Gibbs sampler is used for parameter estimation and an integrated complete log-likelihood criterion is used for model selection. Simulated and real datasets are used for illustration and comparison with traditional co-clustering.

Domaines

Statistiques [stat] Méthodologie [stat.ME]

Fichier principal

PWCoClust_Revised2.pdf (2.93 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Michael Gallaugher : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01862824

Soumis le : lundi 21 novembre 2022-09:01:48

Dernière modification le : vendredi 19 avril 2024-14:04:05

Dates et versions

hal-01862824 , version 1 (27-08-2018)

hal-01862824 , version 2 (08-12-2019)

hal-01862824 , version 3 (30-09-2020)

hal-01862824 , version 4 (21-11-2022)

Identifiants

HAL Id : hal-01862824 , version 4
DOI : 10.1007/s00180-022-01289-2

Citer

Michael P B Gallaugher, Christophe Biernacki, Paul D Mcnicholas. Parameter-Wise Co-Clustering for High-Dimensional Data. Computational Statistics, 2022, ⟨10.1007/s00180-022-01289-2⟩. ⟨hal-01862824v4⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA INRIA2 UNIV-LILLE LPP-MATH

105 Consultations

97 Téléchargements

Parameter-Wise Co-Clustering for High-Dimensional Data

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager