Optuna: A Next-generation Hyperparameter Optimization Framework

Takuya Akiba , Shotaro Sano , Toshihiko Yanase , Takeru Ohta , Masanori Koyama

Authors on Pith no claims yet

classification 💻 cs.LG stat.ML

keywords softwareoptimizationoptunacriteriadefine-by-rundevelopmenthyperparameterintroduce

read the original abstract

The purpose of this study is to introduce new design-criteria for next-generation hyperparameter optimization software. The criteria we propose include (1) define-by-run API that allows users to construct the parameter search space dynamically, (2) efficient implementation of both searching and pruning strategies, and (3) easy-to-setup, versatile architecture that can be deployed for various purposes, ranging from scalable distributed computing to light-weight experiment conducted via interactive interface. In order to prove our point, we will introduce Optuna, an optimization software which is a culmination of our effort in the development of a next generation optimization software. As an optimization software designed with define-by-run principle, Optuna is particularly the first of its kind. We will present the design-techniques that became necessary in the development of the software that meets the above criteria, and demonstrate the power of our new design through experimental results and real world applications. Our software is available under the MIT license (https://github.com/pfnet/optuna/).

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 13 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Multivariate quantum reservoir computing with discrete and continuous variable systems
quant-ph 2026-04 unverdicted novelty 7.0

Quantum reservoirs handle multivariate time series best with task-specific encodings that leverage non-classical effects.
PEML: Parameter-efficient Multi-Task Learning with Optimized Continuous Prompts
cs.CL 2026-05 unverdicted novelty 6.0

PEML co-optimizes continuous prompts and low-rank adaptations to deliver up to 6.67% average accuracy gains over existing multi-task PEFT methods on GLUE, SuperGLUE, and other benchmarks.
Self-Supervised Laplace Approximation for Bayesian Uncertainty Quantification
stat.ML 2026-05 unverdicted novelty 6.0

SSLA approximates the posterior predictive distribution by refitting Bayesian models on self-predicted data, providing a sampling-free method that improves predictive calibration over classical Laplace approximations ...
On Privacy Leakage in Tabular Diffusion Models: Influential Factors, Attacker Knowledge, and Metrics
cs.LG 2026-05 unverdicted novelty 6.0

Tabular diffusion models leak membership information via attacks even with partial attacker knowledge, and common heuristic privacy metrics like distance-to-closest-record are unreliable.
Euclid preparation. CosmoPostProcess: A simulation calibrated framework for weak lensing selection bias in richness-selected galaxy clusters
astro-ph.CO 2026-05 unverdicted novelty 6.0

CosmoPostProcess delivers simulation-calibrated radial corrections for projection-induced selection bias (20-40% amplitude near 1 h^{-1} Mpc) and baryonic effects in Euclid richness-selected cluster weak lensing profiles.
Efficiently emulating distribution functions in gigaparsec volumes for varying cosmological parameters
astro-ph.CO 2026-04 conditional novelty 6.0

A new overdensity-conditioned emulator trained on small subvolumes from Quijote recovers the global halo mass function via integration over the overdensity distribution at 0.026% of the simulation cost.
Natural Language Embeddings of Synthesis and Testing conditions Enhance Glass Dissolution Prediction
cond-mat.mtrl-sci 2026-04 unverdicted novelty 6.0

Natural language embeddings of synthesis and testing conditions improve ML predictions of glass dissolution rates and enable generalization to out-of-distribution compositions with new elements.
Search for the lepton-flavour violating decays $B^+ \to \pi^+ \mu^\pm e^\mp$
hep-ex 2026-04 accept novelty 6.0

No signal observed for B+ → π+ μ± e∓; branching fraction upper limit set at 1.8 × 10^{-9} at 90% CL.
Inferring identified hadron production in $pp$ collisions with physics-informed machine learning at the LHC
hep-ph 2026-05 unverdicted novelty 5.0

A physics-informed neural network infers pT spectra of pi, K, p, Lambda, and Ks in unmeasured rapidity regions from PYTHIA8 pp collisions at 13.6 TeV, achieving 1.5-5.83% yield uncertainties while reproducing yield ra...
Improved Chase-Pyndiah Decoding for Product Codes with Scaled Messages
cs.IT 2026-04 unverdicted novelty 4.0

Scaling extrinsic messages by decoder confidence in Chase-Pyndiah decoding for product codes delivers a 0.1 dB gain over the baseline decoder.
VIGILant: an automatic classification pipeline for glitches in the Virgo detector
gr-qc 2026-04 unverdicted novelty 4.0

VIGILant applies tree-based models and a ResNet CNN to classify Virgo O3b glitches with 98% accuracy and has been deployed for daily use with an interactive dashboard.
PR3DICTR: A modular AI framework for medical 3D image-based detection and outcome prediction
cs.CV 2026-04 unverdicted novelty 4.0

PR3DICTR is a new open-access modular framework for 3D medical image classification and outcome prediction that works with as little as two lines of code.
An Automatic Ground Collision Avoidance System with Reinforcement Learning
cs.LG 2026-04 unverdicted novelty 3.0

The paper designs a reinforcement learning-based automatic ground collision avoidance system for jet trainers that uses limited observations and line-of-sight terrain queries to prevent collisions.