Distributed Adaptive Sampling for Kernel Matrix Approximation

Alessandro Lazaric; Daniele Calandriello; Michal Valko

arxiv: 1803.10172 · v1 · pith:FIA2GQGZnew · submitted 2018-03-27 · 📊 stat.ML · cs.DS· cs.LG

Distributed Adaptive Sampling for Kernel Matrix Approximation

Daniele Calandriello , Alessandro Lazaric , Michal Valko This is my paper

classification 📊 stat.ML cs.DScs.LG

keywords kernelmatrixsamplingapproximationdatasetgammamathbfmathcal

0 comments

read the original abstract

Most kernel-based methods, such as kernel or Gaussian process regression, kernel PCA, ICA, or $k$-means clustering, do not scale to large datasets, because constructing and storing the kernel matrix $\mathbf{K}_n$ requires at least $\mathcal{O}(n^2)$ time and space for $n$ samples. Recent works show that sampling points with replacement according to their ridge leverage scores (RLS) generates small dictionaries of relevant points with strong spectral approximation guarantees for $\mathbf{K}_n$. The drawback of RLS-based methods is that computing exact RLS requires constructing and storing the whole kernel matrix. In this paper, we introduce SQUEAK, a new algorithm for kernel approximation based on RLS sampling that sequentially processes the dataset, storing a dictionary which creates accurate kernel matrix approximations with a number of points that only depends on the effective dimension $d_{eff}(\gamma)$ of the dataset. Moreover since all the RLS estimations are efficiently performed using only the small dictionary, SQUEAK is the first RLS sampling algorithm that never constructs the whole matrix $\mathbf{K}_n$, runs in linear time $\widetilde{\mathcal{O}}(nd_{eff}(\gamma)^3)$ w.r.t. $n$, and requires only a single pass over the dataset. We also propose a parallel and distributed version of SQUEAK that linearly scales across multiple machines, achieving similar accuracy in as little as $\widetilde{\mathcal{O}}(\log(n)d_{eff}(\gamma)^3)$ time.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Beyond Averaging in John Ellipsoid Approximation: High-Accuracy Algorithms in the Leverage-Score Model
math.OC 2026-06 unverdicted novelty 7.0

John ellipsoid approximation in the leverage-score model achieves doubly logarithmic accuracy cost after setup by using last-iterate acceleration and Newton steps instead of averaging.