pith. machine review for the scientific record. sign in

arxiv: 1012.2599 · v1 · submitted 2010-12-12 · 💻 cs.LG

Recognition: unknown

A Tutorial on Bayesian Optimization of Expensive Cost Functions, with Application to Active User Modeling and Hierarchical Reinforcement Learning

Eric Brochu, Nando de Freitas, Vlad M. Cora

classification 💻 cs.LG
keywords bayesianoptimizationfunctionareascostexpensivefunctionshierarchical
0
0 comments X
read the original abstract

We present a tutorial on Bayesian optimization, a method of finding the maximum of expensive cost functions. Bayesian optimization employs the Bayesian technique of setting a prior over the objective function and combining it with evidence to get a posterior function. This permits a utility-based selection of the next observation to make on the objective function, which must take into account both exploration (sampling from areas of high uncertainty) and exploitation (sampling areas likely to offer improvement over the current best observation). We also present two detailed extensions of Bayesian optimization, with experiments---active user modelling with preferences, and hierarchical reinforcement learning---and a discussion of the pros and cons of Bayesian optimization based on our experiences.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 10 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Autonomous operation of the DIAG0 diagnostic line for 6D phase-space monitoring at LCLS-II

    physics.acc-ph 2026-04 unverdicted novelty 7.0

    First autonomous 6D phase-space tomography system at LCLS-II achieves real-time beam reconstructions every 5-10 minutes via ML control and generative analysis.

  2. An Efficient Spatial Branch-and-Bound Algorithm for Global Optimization of Gaussian Process Posterior Mean Functions

    math.OC 2026-04 conditional novelty 7.0

    PALM-Mean combines sign-aware piecewise-linear relaxations of locally important kernel terms with closed-form analytic bounds on the rest inside a reduced-space branch-and-bound framework, yielding valid lower bounds ...

  3. ADKO: Agentic Decentralized Knowledge Optimization

    cs.LG 2026-05 unverdicted novelty 6.0

    ADKO is a decentralized framework where agents share compact GP-derived tokens and LM insights to achieve collaborative Bayesian optimization with a decomposed regret bound that includes compression and approximation losses.

  4. Estimating Decision Uncertainty from Preference Uncertainty: Application to Ground Vehicle Design

    stat.AP 2026-04 unverdicted novelty 6.0

    Preference uncertainty is modeled as random variables that induce a distribution over Pareto-optimal designs, analyzed via Sobol' indices, Shapley values, and Fréchet variance to assess decision stability in ground ve...

  5. Vibrotactile Preference Learning: Uncertainty-Aware Preference Learning for Personalized Vibration Feedback

    cs.HC 2026-04 unverdicted novelty 6.0

    VPL learns individualized vibrotactile preferences efficiently via uncertainty-aware Gaussian process models and active query selection in a 13-participant user study on an Xbox controller.

  6. Stein Variational Black-Box Combinatorial Optimization

    cs.AI 2026-04 unverdicted novelty 6.0

    Integrating Stein variational gradient descent into EDAs introduces repulsion among particles to jointly explore multiple optima in discrete black-box optimization, with competitive or superior results on large-scale ...

  7. Neural Global Optimization via Iterative Refinement from Noisy Samples

    cs.LG 2026-04 unverdicted novelty 6.0

    A neural model learns iterative refinement from noisy samples and spline inputs to find global minima, reporting 8.05% mean error on multi-modal tests versus 36.24% for spline initialization alone.

  8. Bayesian Optimization of Crossbar-Based Compute-In-Memory System Design for Efficient DNN Inference

    cs.ET 2026-05 unverdicted novelty 5.0

    A multi-objective Bayesian optimization framework co-optimizes CIM crossbar hardware and DNN parameters for VGG8/CIFAR-10 and VGG16/Tiny-ImageNet, achieving comparable accuracy with up to 65% smaller area and 52% lowe...

  9. Generative Augmentation of Imbalanced Flight Records for Flight Diversion Prediction: A Multi-objective Optimisation Framework

    cs.LG 2026-04 unverdicted novelty 5.0

    Hyperparameter-optimized generative models augment scarce flight diversion records and substantially improve prediction accuracy over real data alone.

  10. A Tutorial on Bayesian Optimization

    stat.ML 2018-07 unverdicted novelty 4.0

    Bayesian optimization uses Gaussian process regression to build a surrogate model and acquisition functions to guide sampling for optimizing costly objective functions, including a new formal generalization of expecte...