pith. sign in

arxiv: 1908.10292 · v2 · pith:RMTWZXCOnew · submitted 2019-08-27 · 🧮 math.ST · cs.LG· stat.ML· stat.TH

On the Multiple Descent of Minimum-Norm Interpolants and Restricted Lower Isometry of Kernels

classification 🧮 math.ST cs.LGstat.MLstat.TH
keywords minimum-norminterpolantskernelalphaanalysisrestrictedrisksample
0
0 comments X
read the original abstract

We study the risk of minimum-norm interpolants of data in Reproducing Kernel Hilbert Spaces. Our upper bounds on the risk are of a multiple-descent shape for the various scalings of $d = n^{\alpha}$, $\alpha\in(0,1)$, for the input dimension $d$ and sample size $n$. Empirical evidence supports our finding that minimum-norm interpolants in RKHS can exhibit this unusual non-monotonicity in sample size; furthermore, locations of the peaks in our experiments match our theoretical predictions. Since gradient flow on appropriately initialized wide neural networks converges to a minimum-norm interpolant with respect to a certain kernel, our analysis also yields novel estimation and generalization guarantees for these over-parametrized models. At the heart of our analysis is a study of spectral properties of the random kernel matrix restricted to a filtration of eigen-spaces of the population covariance operator, and may be of independent interest.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Solving Inverse Parametrized Problems via Finite Elements and Extreme Learning Networks

    math.NA 2026-02 unverdicted novelty 6.0

    A hybrid FEM and ELM framework for parameter-dependent PDEs derives existence, uniqueness, regularity, and error estimates for inverse problems in photoacoustic tomography.