On the Suboptimality of GP-UCB under Polynomial Effective Optimism
Pith reviewed 2026-05-24 05:26 UTC · model grok-4.3
The pith
Polynomial growth of effective optimism prevents GP-UCB from reaching minimax-optimal regret rates with Matérn kernels.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Under a uniform confidence assumption, we prove a new regret lower bound for GP-UCB with Matérn kernels. The bound shows that polynomial growth of the effective optimism level, up to logarithmic factors, rules out the minimax-optimal regret rate. Since this is the regime covered by most existing analyses, our result identifies a concrete obstacle to proving minimax optimality for standard GP-UCB.
What carries the argument
effective optimism level (product of the exploration coefficient and the regularization parameter in kernel ridge regression), which controls the growth rate that triggers the lower bound
If this is right
- Most published upper bounds on GP-UCB regret fall into the polynomial-growth regime and therefore cannot be minimax-tight.
- The persistent gap between upper and lower bounds on GP-UCB regret may reflect an inherent limitation of the algorithm rather than analysis shortcomings.
- Standard GP-UCB cannot be shown to achieve the minimax rate without changing its optimism-parameter schedule.
- Any proof of minimax optimality for GP-UCB must operate outside the polynomial effective-optimism regime.
Where Pith is reading between the lines
- Keeping the effective optimism level bounded or growing only logarithmically could be necessary for any GP-UCB variant to match minimax rates.
- The same polynomial-growth obstacle may appear in other kernel-based upper-confidence-bound algorithms that share the same effective-optimism structure.
- Empirical tuning of the exploration coefficient to slower growth schedules offers a testable way to check whether the lower-bound regime is avoidable in practice.
Load-bearing premise
The uniform confidence assumption is required to derive the regret lower bound.
What would settle it
A direct calculation or simulation on a Matérn kernel instance showing that GP-UCB regret stays at the minimax rate despite polynomially growing effective optimism would falsify the lower bound.
read the original abstract
Gaussian process upper confidence bound (GP-UCB) is widely used for sequential optimization of expensive black-box functions. Although many upper bounds on its cumulative regret have been established in the literature, whether GP-UCB is minimax optimal remains open. We study this question through the effective optimism level, defined as the product of the exploration coefficient and the regularization parameter in kernel ridge regression. Under a uniform confidence assumption, we prove a new regret lower bound for GP-UCB with Mat\'ern kernels. The bound shows that polynomial growth of the effective optimism level, up to logarithmic factors, rules out the minimax-optimal regret rate. Since this is the regime covered by most existing analyses, our result identifies a concrete obstacle to proving minimax optimality for standard GP-UCB. More broadly, it suggests that the gap between current upper bounds and minimax lower bounds may reflect a real limitation of the algorithm, not only of the analysis.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper claims that under a uniform confidence assumption, GP-UCB with Matérn kernels admits a new regret lower bound showing that polynomial growth (up to logs) of the effective optimism level—the product of the exploration coefficient and the regularization parameter in kernel ridge regression—precludes the minimax-optimal regret rate. The authors conclude that this regime, which covers most existing upper-bound analyses, identifies a concrete obstacle to proving minimax optimality for standard GP-UCB and suggests the upper/lower-bound gap may reflect an algorithmic limitation rather than solely an analysis gap.
Significance. If the uniform confidence assumption holds for standard GP-UCB confidence sets, the result supplies a conditional explanation for why GP-UCB may fail to achieve minimax rates and supplies an independent lower-bound construction that avoids circular dependence on fitted parameters. This is a useful diagnostic for the field even if the assumption requires further verification.
major comments (1)
- [Abstract] Abstract: the lower bound is derived under the uniform confidence assumption, yet the manuscript supplies no argument, reference, or verification establishing that this assumption is satisfied by the standard GP-UCB confidence sets for Matérn kernels. Because the claim that the result identifies an obstacle for standard GP-UCB rests on the assumption applying to the algorithm as implemented, this link is load-bearing for the central conclusion.
Simulated Author's Rebuttal
We thank the referee for their careful reading and constructive feedback. We address the major comment below.
read point-by-point responses
-
Referee: [Abstract] Abstract: the lower bound is derived under the uniform confidence assumption, yet the manuscript supplies no argument, reference, or verification establishing that this assumption is satisfied by the standard GP-UCB confidence sets for Matérn kernels. Because the claim that the result identifies an obstacle for standard GP-UCB rests on the assumption applying to the algorithm as implemented, this link is load-bearing for the central conclusion.
Authors: We agree that the manuscript does not supply an argument, reference, or verification that the uniform confidence assumption holds for the standard GP-UCB confidence sets on Matérn kernels. The lower bound is derived under this assumption, and the central claim is therefore conditional on it. In the revision we will update the abstract and add a short discussion clarifying that the result identifies an obstacle only in the regime where the assumption holds, and that verifying the assumption for standard GP-UCB remains an open question. This change will be made without altering the technical contribution. revision: yes
Circularity Check
No significant circularity detected
full rationale
The paper defines the effective optimism level as the product of the exploration coefficient and regularization parameter, then derives a regret lower bound under an external uniform confidence assumption for GP-UCB with Matérn kernels. This shows that polynomial growth (up to logs) of that level precludes minimax-optimal rates. No step reduces the claimed lower bound to a quantity defined by the same fitted optimism level or by self-citation chains; the construction is presented as independent. The derivation remains self-contained against external benchmarks with no self-definitional, fitted-input, or ansatz-smuggling patterns exhibited in the abstract or described claims.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption uniform confidence assumption
Forward citations
Cited by 1 Pith paper
-
Bayesian Optimization by Kernel Regression and Density-based Exploration
BOKE uses kernel regression and density-based exploration in a confidence-bound acquisition function to achieve quadratic-complexity Bayesian optimization with a claimed global convergence guarantee under noise.
Reference graph
Works this paper leans on
-
[1]
, " * write output.state after.block = add.period write newline
ENTRY address author booktitle chapter doi edition editor eid howpublished institution isbn issn journal key month note number organization pages publisher school series title type url volume year label extra.label sort.label short.list INTEGERS output.state before.all mid.sentence after.sentence after.block FUNCTION init.state.consts #0 'before.all := #1...
-
[2]
" write newline "" before.all 'output.state := FUNCTION n.dashify 't := "" t empty not t #1 #1 substring "-" = t #1 #2 substring "--" = not "--" * t #2 global.max substring 't := t #1 #1 substring "-" = "-" * t #2 global.max substring 't := while if t #1 #1 substring * t #2 global.max substring 't := if while FUNCTION word.in "" FUNCTION format.date year ...
-
[3]
Machine Learning 47(2--3):235--256
Auer P, Cesa-Bianchi N, Fischer P (2002) Finite-time analysis of the multiarmed bandit problem. Machine Learning 47(2--3):235--256
work page 2002
-
[4]
Advances in Neural Information Processing Systems 33, 21524--21538
Balandat M, Karrer B, Jiang DR, Daulton S, Letham B, Wilson AG, Bakshy E (2020) BoTorch : A framework for efficient Monte-Carlo Bayesian optimization. Advances in Neural Information Processing Systems 33, 21524--21538
work page 2020
-
[5]
Proceedings of the 31st Conference on Learning Theory, 1348--1361
Belkin M (2018) Approximation beats concentration? An approximation view on inference with smooth radial kernels. Proceedings of the 31st Conference on Learning Theory, 1348--1361
work page 2018
-
[6]
Annals of Statistics 24(6):2524--2535
Brown LD, Low MG (1996) A constrained risk inequality with applications to nonparametric functional estimation. Annals of Statistics 24(6):2524--2535
work page 1996
-
[7]
Theoretical Computer Science 412(19):1832--1852
Bubeck S, Munos R, Stoltz G (2011) Pure exploration in finitely-armed and continuous-armed bandits. Theoretical Computer Science 412(19):1832--1852
work page 2011
-
[8]
Proceedings of the 34th International Conference on Machine Learning, 844--853
Chowdhury SR, Gopalan A (2017) On kernelized multi-armed bandits. Proceedings of the 34th International Conference on Machine Learning, 844--853
work page 2017
-
[9]
Forrester A, Sobester A, Keane A (2008) Engineering Design via Surrogate Modelling: A Practical Guide (John Wiley & Sons)
work page 2008
-
[10]
Frazier PI (2018) Bayesian optimization. Gel E, Ntaimo L, eds., Recent Advances in Optimization and Modeling of Contemporary Problems, 255--278, INFORMS TutORials in Operations Research (INFORMS)
work page 2018
-
[11]
Frazier PI, Wang J (2016) Bayesian optimization for materials design. Lookman T, Alexander FJ, Rajan K, eds., Information Science for Materials Discovery and Design, 45--75 (Springer)
work page 2016
-
[12]
Fu MC, ed., Handbook of Simulation Optimization, 105--147 (Springer)
Fu MC (2015) Stochastic gradient estimation. Fu MC, ed., Handbook of Simulation Optimization, 105--147 (Springer)
work page 2015
-
[13]
Hong LJ, Zhang X (2021) Surrogate-based simulation optimization. Carlsson JG, ed., Emerging Optimization Methods and Modeling Techniques with Applications, 287--311, INFORMS TutORials in Operations Research (INFORMS)
work page 2021
-
[14]
Horn RA, Johnson CR (2012) Matrix Analysis (Cambridge University Press), 2nd edition
work page 2012
-
[15]
Janz D, Burt D, Gonzalez J (2020) Bandit optimisation of functions in the Mat \'e rn kernel RKHS . Proceedings of the 23rd International Conference on Artificial Intelligence and Statistics, 2486--2495
work page 2020
-
[16]
Journal of Global Optimization 13(4):455--492
Jones DR, Schonlau M, Welch WJ (1998) Efficient global optimization of expensive black-box functions. Journal of Global Optimization 13(4):455--492
work page 1998
-
[17]
Gaussian Processes and Kernel Methods: A Review on Connections and Equivalences
Kanagawa M, Hennig P, Sejdinovic D, Sriperumbudur BK (2018) Gaussian processes and kernel methods: A review on connections and equivalences. Preprint available at https://arxiv.org/abs/1807.02582
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[18]
Journal of Machine Learning Research 20(145):1--30
Letham B, Bakshy E (2019) Bayesian optimization for policy search via online-offline experimentation. Journal of Machine Learning Research 20(145):1--30
work page 2019
-
[19]
Journal of Machine Learning Research 23(54):1--9
Lindauer M, Eggensperger K, Feurer M, Biedenkapp A, Deng D, Benjamins C, Ruhkopf T, Sass R, Hutter F (2022) SMAC3 : A versatile Bayesian optimization package for hyperparameter optimization. Journal of Machine Learning Research 23(54):1--9
work page 2022
-
[20]
Pi Mu Epsilon Journal 14(4):269--273
Muleshkov A, Nguyen T (2016) Easy proof of the Jacobian for the n -dimensional polar coordinates. Pi Mu Epsilon Journal 14(4):269--273
work page 2016
-
[21]
Nemirovski A (2000) Topics in non-parametric statistics. Lectures on Probability Theory and Statistics: Ecole d'Eté de Probabilités de Saint-Flour XXVIII - 1998, volume 1738 of Lecture Notes in Mathematics, 85--277 (Springer)
work page 2000
-
[22]
Pollard D (1990) Empirical Processes: Theory and Applications, volume 2 of NSF-CBMS Regional Conference Series in Probability and Statistics (Institute of Mathematical Statistics)
work page 1990
-
[23]
Rasmussen CE, Williams CKI (2006) Gaussian Processes for Machine Learning (MIT Press)
work page 2006
-
[24]
Advances in Computational Mathematics 42(4):973--993
Santin G, Schaback R (2016) Approximation of eigenfunctions in kernel-based spaces. Advances in Computational Mathematics 42(4):973--993
work page 2016
-
[25]
Proceedings of the 30th Annual Conference on Learning Theory, 1723--1742
Scarlett J, Bogunovic I, Cevher V (2017) Lower bounds on regret for noisy G aussian process bandit optimization. Proceedings of the 30th Annual Conference on Learning Theory, 1723--1742
work page 2017
-
[26]
Proceedings of the 27th International Conference on Machine Learning, 1015--1022
Srinivas N, Krause A, Kakade S, Seeger M (2010) Gaussian process optimization in the bandit setting: No regret and experimental design. Proceedings of the 27th International Conference on Machine Learning, 1015--1022
work page 2010
-
[27]
IEEE Transactions on Information Theory 58(5):3250--3265
Srinivas N, Krause A, Kakade SM, Seeger MW (2012) Information-theoretic regret bounds for Gaussian process optimization in the bandit setting. IEEE Transactions on Information Theory 58(5):3250--3265
work page 2012
-
[28]
Proceedings of the NeurIPS 2020 Competition and Demonstration Track, 3--26
Turner R, Eriksson D, McCourt M, Kiili J, Laaksonen E, Xu Z, Guyon I (2021) Bayesian optimization is superior to random search for machine learning hyperparameter tuning: Analysis of the black-box optimization challenge 2020. Proceedings of the NeurIPS 2020 Competition and Demonstration Track, 3--26
work page 2021
-
[29]
Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 82--90
Vakili S, Khezeli K, Picheny V (2021) On information gain and regret bounds in Gaussian process bandits. Proceedings of the 24th International Conference on Artificial Intelligence and Statistics, 82--90
work page 2021
-
[30]
van de Geer SA (2000) Empirical Processes in M-estimation (Cambridge University Press)
work page 2000
-
[31]
Journal of the American Statistical Association 115(530):920--930
Wang W, Tuo R, Wu CFJ (2020) On prediction properties of kriging: Uniform error bounds and robustness. Journal of the American Statistical Association 115(530):920--930
work page 2020
-
[32]
ACM Computing Surveys 55(13s):Article 287, 36 pages
Wang X, Jin Y, Schmitt S, Olhofer M (2023) Recent advances in Bayesian optimization. ACM Computing Surveys 55(13s):Article 287, 36 pages
work page 2023
-
[33]
Wendland H (2004) Scattered Data Approximation (Cambridge University Press)
work page 2004
-
[34]
Preprint available at https://arxiv.org/abs/2307.07539
Whitehouse J, Wu ZS, Ramdas A (2023) On the sublinear regret of GP-UCB . Preprint available at https://arxiv.org/abs/2307.07539
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.