Recognition: unknown
Coin Betting and Parameter-Free Online Learning
read the original abstract
In the recent years, a number of parameter-free algorithms have been developed for online linear optimization over Hilbert spaces and for learning with expert advice. These algorithms achieve optimal regret bounds that depend on the unknown competitors, without having to tune the learning rates with oracle choices. We present a new intuitive framework to design parameter-free algorithms for \emph{both} online linear optimization over Hilbert spaces and for learning with expert advice, based on reductions to betting on outcomes of adversarial coins. We instantiate it using a betting algorithm based on the Krichevsky-Trofimov estimator. The resulting algorithms are simple, with no parameters to be tuned, and they improve or match previous results in terms of regret guarantee and per-round complexity.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
A Note on How to Remove the $\ln\ln T$ Term from the Squint Bound
Shifted KT potentials equal a prior change in KT, and this removes the ln ln T factor from Squint's data-independent bound.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.