Practical bayesian optimization of machine learning algorithms.Advances in neural information processing systems, 25

James Vuckovic · 2012 · Advances in Neural Information Processing Systems 35 · DOI 10.52202/068431-2059

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

open at publisher browse 6 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

SemaTune: Semantic-Aware Online OS Tuning with Large Language Models

cs.OS · 2026-05-14 · unverdicted · novelty 7.0

SemaTune uses LLM guidance with semantic context to tune up to 41 Linux OS parameters, delivering 72.5% performance gains over defaults and 153.3% over non-LLM baselines on 13 workloads while avoiding degraded states.

From Instance Selection to Fixed-Pool Data Recipe Search for Supervised Fine-Tuning

cs.LG · 2026-05-13 · conditional · novelty 7.0

AutoSelection discovers data recipes from a 90K instruction pool that outperform full-data training and other selectors on reasoning tasks for SFT across multiple models.

Back to the Beginning of Heuristic Design: Bridging Code and Knowledge with LLMs

cs.AI · 2026-05-07 · unverdicted · novelty 7.0

A knowledge-first approach to LLM-driven automatic heuristic design in combinatorial optimization yields better discovery efficiency, transfer, and generalization than code-centric baselines by formalizing a distortion-compression trade-off.

FLUID: Continuous-Time Hyperconnected Sparse Transformer for Sink-Free Learning

cs.LG · 2026-05-06 · unverdicted · novelty 7.0

FLUID is a continuous-time transformer using Liquid Attention Networks to model attention as stable ODE solutions that interpolate between discrete SDPA and CT-RNNs, with an explicit sink gate and liquid hyper-connections for better information flow.

When Losses Align: Gradient-Based Composite Loss Weighting for Efficient Pretraining

cs.LG · 2026-05-08 · unverdicted · novelty 6.0

A bilevel method learns composite pretraining loss weights online via gradient alignment with a downstream objective, matching tuned baselines at roughly 30% extra cost over one training run.

Regret-Based $(\epsilon,\delta)$-optimal Stopping Criteria for Bayesian Optimization

cs.LG · 2026-05-21 · unverdicted · novelty 5.0

The paper derives provably tighter instantaneous regret bounds for GP-UCB and proposes (ε,δ)-optimal stopping criteria for Bayesian optimization based on those bounds.

citing papers explorer

Showing 6 of 6 citing papers.

SemaTune: Semantic-Aware Online OS Tuning with Large Language Models cs.OS · 2026-05-14 · unverdicted · none · ref 73
SemaTune uses LLM guidance with semantic context to tune up to 41 Linux OS parameters, delivering 72.5% performance gains over defaults and 153.3% over non-LLM baselines on 13 workloads while avoiding degraded states.
From Instance Selection to Fixed-Pool Data Recipe Search for Supervised Fine-Tuning cs.LG · 2026-05-13 · conditional · none · ref 16
AutoSelection discovers data recipes from a 90K instruction pool that outperform full-data training and other selectors on reasoning tasks for SFT across multiple models.
Back to the Beginning of Heuristic Design: Bridging Code and Knowledge with LLMs cs.AI · 2026-05-07 · unverdicted · none · ref 47
A knowledge-first approach to LLM-driven automatic heuristic design in combinatorial optimization yields better discovery efficiency, transfer, and generalization than code-centric baselines by formalizing a distortion-compression trade-off.
FLUID: Continuous-Time Hyperconnected Sparse Transformer for Sink-Free Learning cs.LG · 2026-05-06 · unverdicted · none · ref 38
FLUID is a continuous-time transformer using Liquid Attention Networks to model attention as stable ODE solutions that interpolate between discrete SDPA and CT-RNNs, with an explicit sink gate and liquid hyper-connections for better information flow.
When Losses Align: Gradient-Based Composite Loss Weighting for Efficient Pretraining cs.LG · 2026-05-08 · unverdicted · none · ref 7
A bilevel method learns composite pretraining loss weights online via gradient alignment with a downstream objective, matching tuned baselines at roughly 30% extra cost over one training run.
Regret-Based $(\epsilon,\delta)$-optimal Stopping Criteria for Bayesian Optimization cs.LG · 2026-05-21 · unverdicted · none · ref 19
The paper derives provably tighter instantaneous regret bounds for GP-UCB and proposes (ε,δ)-optimal stopping criteria for Bayesian optimization based on those bounds.

Practical bayesian optimization of machine learning algorithms.Advances in neural information processing systems, 25

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer