Upper and Lower Bounds on the Performance of Kernel PCA

Principal Component Analysis (PCA) is a popular method for dimension reduction and has attracted an unfailing interest for decades. Recently, kernel PCA has emerged as an extension of PCA but, despite its use in practice, a sound theoretical understanding of kernel PCA is missing. In this paper, we contribute lower and upper bounds on the efficiency of kernel PCA, involving the empirical eigenvalues of the kernel Gram matrix. Two bounds are for fixed estimators, and two are for randomized estimators through the PAC-Bayes theory. We control how much information is captured by kernel PCA on average, and we dissect the bounds to highlight strengths and limitations of the kernel PCA algorithm. Therefore, we contribute to the better understanding of kernel PCA. Our bounds are briefly illustrated on a toy numerical example.

Mots clés

Statistical learning theory kernel PCA PAC-Bayes dimension reduction

Domaines

Apprentissage [cs.LG] Machine Learning [stat.ML] Théorie [stat.TH]

Fichier principal

2012.10369.pdf (498.52 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Benjamin Guedj : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-03084598

Soumis le : lundi 21 décembre 2020-11:29:19

Dernière modification le : samedi 27 avril 2024-03:11:03

Dates et versions

hal-03084598 , version 1 (21-12-2020)

Identifiants

HAL Id : hal-03084598 , version 1
ARXIV : 2012.10369

Citer

Maxime Haddouche, Benjamin Guedj, Omar Rivasplata, John Shawe-Taylor. Upper and Lower Bounds on the Performance of Kernel PCA. 2020. ⟨hal-03084598⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA ENS-CACHAN INSMI INRIA2 INRIA-CWI UNIV-LILLE INRIA-LONDON ENS-PARIS-SACLAY LPP-MATH

54 Consultations

143 Téléchargements