Distributed adaptive sampling for kernel matrix approximation

Daniele Calandriello; Alessandro Lazaric; Michal Valko

Communication Dans Un Congrès Année : 2017

Distributed adaptive sampling for kernel matrix approximation

(1) , (1) , (1)

Daniele Calandriello

Fonction : Auteur
PersonId : 960706

Sequential Learning

Alessandro Lazaric

Fonction : Auteur
PersonId : 851
IdHAL : alessandro-lazaric
ORCID : 0000-0002-8970-413X
IdRef : 188701486

Sequential Learning

Michal Valko

Fonction : Auteur
PersonId : 284
IdHAL : michal
IdRef : 22360934X

Sequential Learning

Résumé

Most kernel-based methods, such as kernel or Gaussian process regression, kernel PCA, ICA, or $k$-means clustering, do not scale to large datasets, because constructing and storing the kernel matrix $\mathbf{K}_n$ requires at least $\mathcal{O}(n^2)$ time and space for $n$ samples. Recent works show that sampling points with replacement according to their ridge leverage scores (RLS) generates small dictionaries of relevant points with strong spectral approximation guarantees for $\mathbf{K}_n$. The drawback of RLS-based methods is that computing exact RLS requires constructing and storing the whole kernel matrix. In this paper, we introduce SQUEAK, a new algorithm for kernel approximation based on RLS sampling that sequentially processes the dataset, storing a dictionary which creates accurate kernel matrix approximations with a number of points that only depends on the effective dimension $d_{eff}(\gamma)$ of the dataset. Moreover since all the RLS estimations are efficiently performed using only the small dictionary, SQUEAK is the first RLS sampling algorithm that never constructs the whole matrix $\mathbf{K}_n$, runs in linear time $\widetilde{\mathcal{O}}(nd_{eff}(\gamma)^3)$ w.r.t. $n$, and requires only a single pass over the dataset. We also propose a parallel and distributed version of SQUEAK that linearly scales across multiple machines, achieving similar accuracy in as little as $\widetilde{\mathcal{O}}(\log(n)d_{eff}(\gamma)^3)$ time.

Domaines

Machine Learning [stat.ML]

Fichier principal

calandriello2017distributed.pdf (755.01 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Michal Valko : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-01482760

Soumis le : vendredi 3 mars 2017-18:35:50

Dernière modification le : mercredi 24 janvier 2024-09:54:23

Archivage à long terme le : mardi 6 juin 2017-13:15:28

Dates et versions

hal-01482760 , version 1 (03-03-2017)

Identifiants

HAL Id : hal-01482760 , version 1
ARXIV : 1803.10172

Citer

Daniele Calandriello, Alessandro Lazaric, Michal Valko. Distributed adaptive sampling for kernel matrix approximation. International Conference on Artificial Intelligence and Statistics, 2017, Fort Lauderdale, United States. ⟨hal-01482760⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA CRISTAL INRIA2 CRISTAL-SEQUEL UNIV-LILLE ANR

279 Consultations

201 Téléchargements

Distributed adaptive sampling for kernel matrix approximation

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager