Practical Bayesian Optimization of Machine Learning Algorithms

[Online] · 2012 · stat.ML · arXiv 1206.2944

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

open full Pith review browse 7 citing papers arXiv PDF

abstract

Machine learning algorithms frequently require careful tuning of model hyperparameters, regularization terms, and optimization parameters. Unfortunately, this tuning is often a "black art" that requires expert experience, unwritten rules of thumb, or sometimes brute-force search. Much more appealing is the idea of developing automatic approaches which can optimize the performance of a given learning algorithm to the task at hand. In this work, we consider the automatic tuning problem within the framework of Bayesian optimization, in which a learning algorithm's generalization performance is modeled as a sample from a Gaussian process (GP). The tractable posterior distribution induced by the GP leads to efficient use of the information gathered by previous experiments, enabling optimal choices about what parameters to try next. Here we show how the effects of the Gaussian process prior and the associated inference procedure can have a large impact on the success or failure of Bayesian optimization. We show that thoughtful choices can lead to results that exceed expert-level performance in tuning machine learning algorithms. We also describe new algorithms that take into account the variable cost (duration) of learning experiments and that can leverage the presence of multiple cores for parallel experimentation. We show that these proposed algorithms improve on previous automatic procedures and can reach or surpass human expert-level optimization on a diverse set of contemporary algorithms including latent Dirichlet allocation, structured SVMs and convolutional neural networks.

representative citing papers

Kernel-based guarantees for nonlinear parametric models in Bayesian optimization

stat.ML · 2026-05-13 · unverdicted · novelty 7.0

A kernel framework over parameter space yields confidence bounds for regularized nonlinear models on adaptive data, supporting convergence analysis in Bayesian optimization.

Heterogeneous Sheaf Neural Networks

cs.LG · 2024-09-12 · unverdicted · novelty 7.0

HetSheaf applies cellular sheaves and type-conditioned restriction maps to heterogeneous graphs, plus SheafPool for basis-invariant graph-level representations, delivering competitive accuracy with substantially reduced parameter counts.

Neural Chain-of-Thought Search: Searching the Optimal Reasoning Path to Enhance Large Language Models

cs.CL · 2026-01-16 · unverdicted · novelty 6.0

NCoTS treats chain-of-thought reasoning as a search problem and uses a dual-factor heuristic to find paths that are over 3.5% more accurate and 22% shorter on benchmarks.

Chain of Uncertain Rewards with Large Language Models for Reinforcement Learning

cs.LG · 2026-04-15 · unverdicted · novelty 6.0

CoUR uses LLMs for efficient RL reward design through uncertainty quantification and similarity selection, achieving better performance and lower evaluation costs on IsaacGym and Bidexterous Manipulation benchmarks.

Caliper-in-the-Loop: Black-Box Optimization for Hyperledger Fabric Performance Tuning

cs.DC · 2026-05-04 · unverdicted · novelty 5.0

Bayesian optimization with dimensionality reduction improves Hyperledger Fabric throughput by up to 12% in a 317-dimensional configuration space via an automated Caliper benchmarking loop.

Dual-Stream EEG Decoding for 3D Visual Perception

cs.CV · 2026-06-20 · unverdicted · novelty 4.0

Dual-stream EEG decoder separates identity and orientation to support 3D reconstruction from neural signals via circular regression and conditioned diffusion.

Multi-Variable Batch Bayesian Optimization in Materials Research: Synthetic Data Analysis of Noise Sensitivity and Problem Landscape Effects

stat.ML · 2025-04-04 · unverdicted · novelty 3.0

Synthetic simulations show noise hurts needle-in-haystack optimization far more than smooth landscapes with local optima, and prior domain knowledge of noise and structure is needed for effective BO in materials research.

citing papers explorer

Showing 7 of 7 citing papers.

Kernel-based guarantees for nonlinear parametric models in Bayesian optimization stat.ML · 2026-05-13 · unverdicted · none · ref 19 · internal anchor
A kernel framework over parameter space yields confidence bounds for regularized nonlinear models on adaptive data, supporting convergence analysis in Bayesian optimization.
Heterogeneous Sheaf Neural Networks cs.LG · 2024-09-12 · unverdicted · none · ref 37 · internal anchor
HetSheaf applies cellular sheaves and type-conditioned restriction maps to heterogeneous graphs, plus SheafPool for basis-invariant graph-level representations, delivering competitive accuracy with substantially reduced parameter counts.
Neural Chain-of-Thought Search: Searching the Optimal Reasoning Path to Enhance Large Language Models cs.CL · 2026-01-16 · unverdicted · none · ref 10 · internal anchor
NCoTS treats chain-of-thought reasoning as a search problem and uses a dual-factor heuristic to find paths that are over 3.5% more accurate and 22% shorter on benchmarks.
Chain of Uncertain Rewards with Large Language Models for Reinforcement Learning cs.LG · 2026-04-15 · unverdicted · none · ref 17
CoUR uses LLMs for efficient RL reward design through uncertainty quantification and similarity selection, achieving better performance and lower evaluation costs on IsaacGym and Bidexterous Manipulation benchmarks.
Caliper-in-the-Loop: Black-Box Optimization for Hyperledger Fabric Performance Tuning cs.DC · 2026-05-04 · unverdicted · none · ref 14
Bayesian optimization with dimensionality reduction improves Hyperledger Fabric throughput by up to 12% in a 317-dimensional configuration space via an automated Caliper benchmarking loop.
Dual-Stream EEG Decoding for 3D Visual Perception cs.CV · 2026-06-20 · unverdicted · none · ref 43 · internal anchor
Dual-stream EEG decoder separates identity and orientation to support 3D reconstruction from neural signals via circular regression and conditioned diffusion.
Multi-Variable Batch Bayesian Optimization in Materials Research: Synthetic Data Analysis of Noise Sensitivity and Problem Landscape Effects stat.ML · 2025-04-04 · unverdicted · none · ref 3 · internal anchor
Synthetic simulations show noise hurts needle-in-haystack optimization far more than smooth landscapes with local optima, and prior domain knowledge of noise and structure is needed for effective BO in materials research.

Practical Bayesian Optimization of Machine Learning Algorithms

fields

years

verdicts

representative citing papers

citing papers explorer