hub

Watanabe, Tree-structured parzen estimator: Understanding its al- gorithm components and their roles for better empirical performance (2023)

· 2023 · cs.LG · arXiv 2304.11127

19 Pith papers cite this work. Polarity classification is still indexing.

19 Pith papers citing it

open full Pith review browse 19 citing papers arXiv PDF

abstract

Recent scientific advances require complex experiment design, necessitating the meticulous tuning of many experiment parameters. Tree-structured Parzen estimator (TPE) is a widely used Bayesian optimization method in recent parameter tuning frameworks such as Hyperopt and Optuna. Despite its popularity, the roles of each control parameter in TPE and the algorithm intuition have not been discussed so far. The goal of this paper is to identify the roles of each control parameter and their impacts on parameter tuning based on the ablation studies using diverse benchmark datasets. The recommended setting concluded from the ablation studies is demonstrated to improve the performance of TPE. Our TPE implementation used in this paper is available at https://github.com/nabenabe0928/tpe/tree/single-opt. OptunaHub now provides our standalone TPE implementation at https://hub.optuna.org/samplers/tpe_tutorial/.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 3 method 1

citation-polarity summary

background 3 use method 1

representative citing papers

ArgBench: Benchmarking LLMs on Computational Argumentation Tasks

cs.CL · 2026-04-19 · unverdicted · novelty 8.0

ArgBench unifies 33 existing datasets into a standardized benchmark for testing LLMs across 46 argumentation tasks and analyzes the impact of prompting techniques and model factors on performance.

Optuna Constrained Tree-Structured Parzen Estimator Is a Joint Density Generalization of c-TPE

cs.LG · 2026-06-03 · unverdicted · novelty 6.0

Optuna's constrained TPE is joint c-TPE, the same expected constrained improvement acquisition function computed from a joint likelihood instead of an independence assumption.

Concise and Logically Consistent Conformal Sets for Neuro-Symbolic Concept-Based Models

cs.LG · 2026-05-18 · unverdicted · novelty 6.0

COCOCO is a conformal framework for NeSy-CBMs that jointly conformalizes concepts and labels, reconciles them via deduction-abduction revision, and satisfies consistency, coverage, and conciseness while retaining distribution-free guarantees.

PEML: Parameter-efficient Multi-Task Learning with Optimized Continuous Prompts

cs.CL · 2026-05-13 · unverdicted · novelty 6.0

PEML co-optimizes continuous prompts and low-rank adaptations to deliver up to 6.67% average accuracy gains over existing multi-task PEFT methods on GLUE, SuperGLUE, and other benchmarks.

From Clever Hans to Scientific Discovery: Interpreting EEG Foundational Transformers with LRP

cs.AI · 2026-05-12 · unverdicted · novelty 6.0

LRP on EEG transformers reveals Clever Hans artifacts in motor imagery tasks and a recurring central electrode cluster as a candidate sensorimotor signature of arousal.

Generative Flow Networks for Model Adaptation in Digital Twins of Natural Systems

cs.LG · 2026-04-22 · unverdicted · novelty 6.0

GFlowNets sample multiple valid mechanistic simulator configurations for digital twin adaptation, recovering main parameter regions and preserving uncertainty in a tomato model case study.

FluidFlow: a flow-matching generative model for fluid dynamics surrogates on unstructured meshes

cs.LG · 2026-03-30 · unverdicted · novelty 6.0

FluidFlow uses conditional flow-matching with U-Net and DiT architectures to predict pressure and friction coefficients on airfoils and 3D aircraft meshes, outperforming MLP baselines with better generalization.

PENEX: AdaBoost-Inspired Neural Network Regularization

cs.LG · 2025-10-02 · unverdicted · novelty 6.0

PENEX is a new formulation of the multi-class exponential loss for neural networks that supports first-order optimization and improves generalization in low-data regimes.

A Leaf-Level Dataset for Soybean-Cotton Detection and Segmentation

cs.CV · 2025-03-03 · unverdicted · novelty 6.0

A new leaf-instance dataset for soybean-cotton detection and segmentation collected across growth stages and conditions from commercial farms is presented and validated with YOLOv11.

Towards Autonomous Commissioning of Industrial Drives via Multi-Objective Bayesian Optimization

eess.SY · 2026-05-27 · unverdicted · novelty 5.0

Multi-objective Bayesian optimization with TPE tunes industrial drive current controllers to expert-level performance in minutes on real hardware without a model or firmware changes.

Toto 2.0: Time Series Forecasting Enters the Scaling Era

cs.LG · 2026-05-19 · unverdicted · novelty 5.0 · 2 refs

Time series foundation models scale under a single training recipe, with forecast quality improving from 4M to 2.5B parameters and new SOTA results on BOOM, GIFT-Eval, and TIME benchmarks.

Inferring identified hadron production in $pp$ collisions with physics-informed machine learning at the LHC

hep-ph · 2026-05-09 · unverdicted · novelty 5.0

A physics-informed neural network infers pT spectra of pi, K, p, Lambda, and Ks in unmeasured rapidity regions from PYTHIA8 pp collisions at 13.6 TeV, achieving 1.5-5.83% yield uncertainties while reproducing yield ratios and freeze-out parameters.

ORTHOBO: Orthogonal Bayesian Hyperparameter Optimization

cs.LG · 2026-05-07 · unverdicted · novelty 5.0

OrthoBO introduces an orthogonal acquisition estimator subtracting an optimally weighted score-function control variate to reduce Monte Carlo variance, preserve the acquisition target, and improve ranking stability in Bayesian hyperparameter optimization.

Combating the Memory Walls: Optimization Pathways for Long-Context Agentic LLM Inference

cs.AR · 2025-09-11 · unverdicted · novelty 5.0

PLENA introduces a co-designed system with three optimization pathways for long-context agentic LLM inference, claiming up to 2.23x throughput over A100 and 4.04x energy efficiency.

Survival of the Cheapest: Cost-Aware Hardware Adaptation for Adversarial Robustness

cs.CR · 2024-09-11 · unverdicted · novelty 5.0

A decision-support framework applies AFT models to show Nvidia L4 GPUs yield 20% longer adversarial survival time at 75% lower cost than V100, with inference latency as the strongest robustness predictor.

Optimizing Memory Allocation in Distributed Clusters with Predictive Modeling

cs.DC · 2026-04-20 · conditional · novelty 4.0

A quantile-regression ensemble with safety factor reduces under-allocated jobs from 4.17% to 2.89% and average overallocation from 148% to 44.51% on SAP build data.

Data-Driven Reduction of Fault Location Errors in Onshore Wind Farm Collectors

eess.SY · 2025-11-26 · unverdicted · novelty 4.0

A Gated Residual Network correction model reduces fault location error by 76% in simulated onshore wind farm collector networks compared to state-of-the-art methods.

Minimal Data, Maximum Clarity: A Heuristic for Explaining Optimization

cs.SE · 2025-09-10 · unverdicted · novelty 4.0

EZR combines active Naive Bayes sampling and decision-tree distillation to reach over 90% of best-known multi-objective optimization performance on 60 datasets while producing clearer explanations than LIME, SHAP or BreakDown.

Transformer-Based Active Learning for Data-Efficient Vaccine Epitope Selection in PRRS

q-bio.BM · 2026-06-27 · unverdicted · novelty 3.0

Transformer models under active learning classify high-binding epitopes from a small docking dataset more accurately than random sampling or other architectures in low-data regimes for PRRS.

citing papers explorer

Showing 1 of 1 citing paper after filters.

ArgBench: Benchmarking LLMs on Computational Argumentation Tasks cs.CL · 2026-04-19 · unverdicted · none · ref 71 · internal anchor
ArgBench unifies 33 existing datasets into a standardized benchmark for testing LLMs across 46 argumentation tasks and analyzes the impact of prompting techniques and model factors on performance.

Watanabe, Tree-structured parzen estimator: Understanding its al- gorithm components and their roles for better empirical performance (2023)

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer