GRIMIP: A General Framework for Instance-Specific Configuration of MIP Solvers Using LLMs

Chenguang Wang; Fangzhou Zhu; Tao Zhong; Tianshu Yu; Xuemin Chen; Yidong Luo

arxiv: 2606.23299 · v1 · pith:WFXIBFBLnew · submitted 2026-06-22 · 💻 cs.LG

GRIMIP: A General Framework for Instance-Specific Configuration of MIP Solvers Using LLMs

Yidong Luo , Xuemin Chen , Chenguang Wang , Fangzhou Zhu , Tao Zhong , Tianshu Yu This is my paper

Pith reviewed 2026-06-26 09:04 UTC · model grok-4.3

classification 💻 cs.LG

keywords GRIMIPMIP solver configurationLarge Language ModelsBayesian Optimizationinstance-specific tuningPrimal-Dual Integralhyperparameter optimizationhybrid intelligence

0 comments

The pith

GRIMIP lets an LLM act as the full probabilistic surrogate inside Bayesian optimization to tune MIP solver hyperparameters for each problem instance.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents GRIMIP as a hybrid method that places a large language model in the role of the probabilistic model guiding Bayesian optimization for MIP solver configuration. Traditional tuning either starts cold or needs many evaluations; GRIMIP uses the LLM's semantic reasoning to propose and evaluate configurations more efficiently from the start. On seven benchmarks that include MIPLIB, the approach reports more than 40 percent lower Primal-Dual Integral on difficult instances and beats both SMAC and prior LLM-assisted Bayesian methods. The central mechanism is granting the LLM enough autonomy to replace the usual surrogate entirely inside the loop rather than merely assisting it.

Core claim

GRIMIP enables the LLM to function as a complete probabilistic surrogate within the BO loop, significantly improving performance and reducing sampling and evaluation costs. On seven benchmarks including MIPLIB, GRIMIP achieves over 40% reduction in Primal-Dual Integral on hard instances, outperforming SMAC and other LLM-assisted BO methods.

What carries the argument

GRIMIP framework in which the LLM serves as the complete probabilistic surrogate inside the Bayesian optimization loop for instance-specific MIP hyperparameter configuration.

If this is right

Instance-specific MIP configurations become practical without requiring expert knowledge for every new problem.
The number of expensive solver runs needed during tuning drops because the LLM surrogate supplies informed proposals from the first iteration.
The same hybrid loop can be applied to other high-dimensional configuration tasks that currently rely on standard Bayesian optimization.
Solver performance on hard instances improves measurably when the configuration search incorporates semantic reasoning about problem structure.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same surrogate-replacement pattern could be tested on configuration problems outside MIP, such as SAT or constraint programming solvers.
If the LLM surrogate remains stable, the method opens a route to online, per-instance retuning while a solver is already running on a stream of related problems.
Future work could measure how much of the reported gain comes from the LLM's ability to interpret problem features versus its role as a low-variance probabilistic model.
The framework suggests a broader template for replacing statistical surrogates with language-model surrogates in any Bayesian optimization setting where domain semantics are expressible in text.

Load-bearing premise

An LLM can reliably serve as the complete probabilistic surrogate in the Bayesian optimization loop without hallucinations, inconsistent predictions, or query costs that erase the sample-efficiency gains.

What would settle it

Running GRIMIP on the MIPLIB hard instances yields no reduction in Primal-Dual Integral relative to SMAC or produces more total evaluations because of LLM inconsistencies.

Figures

Figures reproduced from arXiv: 2606.23299 by Chenguang Wang, Fangzhou Zhu, Tao Zhong, Tianshu Yu, Xuemin Chen, Yidong Luo.

**Figure 1.** Figure 1: Overview of GRIMIP: This framework utilizes LLM-driven candidate generation, surrogate model prediction, and a lower confidence bound acquisition function to automatically recommend optimal hyperparameter configurations for a given optimization instance. 3 Methodology The proposed GRIMIP framework establishes a methodological synergy between the semantic reasoning capabilities of LLMs and the sequential d… view at source ↗

**Figure 2.** Figure 2: GRIMIP vs SMAC-I PDI Comparison 4.5 Ablation Studies [PITH_FULL_IMAGE:figures/full_fig_p008_2.png] view at source ↗

**Figure 3.** Figure 3: Convergence trajectories of GRIMIP versus SMAC-I on selected instances from the MIPLIB dataset. The plots illustrate the optimization progress over the iteration budget, comparing SMAC-I ( ) and GRIMIP ( ). The y-axis represents the PDI normalized to the [0, 1] range for visual comparison, where lower values indicate better performance. The x-axis denotes the number of function evaluations [PITH_FULL_IMAG… view at source ↗

**Figure 4.** Figure 4: Preference of Configuration Selection across Different Datasets. Selection frequency (%) of each parameter across three benchmark datasets: A, IP, and LB. characterizes traditional cold-start methods. Moreover, GRIMIP demonstrates substantially higher sample efficiency, as evidenced by the steep and rapid decline of its convergence curves within the first few evaluations. In contrast, SMAC-I often display… view at source ↗

**Figure 5.** Figure 5: Performance Comparison of LLMs. We benchmarked GRIMIP across seven models including both API-based services and locally deployed open-source models (gpt-oss-20b, Qwen3- 32B, Qwen3-8B, marked with $0 cost). Smaller models demonstrate competitive tuning efficiency. 6 Conclusion This paper presents GRIMIP, marking the pioneering introduction of the LLM4BO paradigm to the task of solver configuration. In essen… view at source ↗

**Figure 6.** Figure 6: PDI Illustration When evaluating the performance of a MIP solver, relying solely on the total solution time is insufficient, especially for difficult instances that are not solved to optimality within a given time limit T. A more informative metric in this context is the Primal-Dual Integral (PDI), which quantifies the solver’s overall progress by measuring the area between the primal bound and the dual … view at source ↗

**Figure 7.** Figure 7: Ablation study of GRIMIP components on the IP dataset. The bar chart compares the Mean Primal-Dual Integral (PDI) of the full framework against variants with specific modules removed. The results demonstrate that the full GRIMIP framework achieves superior performance compared to all ablated versions. H.3 The Role of ASS To validate the effectiveness of the ASS module, we conducted an ablation study compar… view at source ↗

read the original abstract

Configuring the hyperparameters of Mixed-integer programming (MIP) solvers is a high-dimensional, instance-dependent optimization problem where suboptimal settings can degrade solving time by orders of magnitude. Default configurations are often suboptimal, while traditional tuning methods either suffer from the ``cold-start'' problem and inefficient search or heavily rely on expert experience. This paper introduces \textbf{GRIMIP} (\textbf{\underline{G}}eneral \textbf{\underline{R}}easoning for \textbf{\underline{I}}nstance-specific \textbf{\underline{MIP}} configuration), a novel hybrid intelligence framework that synergistically integrates the semantic reasoning capabilities of Large Language Models (LLMs) with the sample-efficient search of Bayesian Optimization (BO). GRIMIP enables the LLM to function as a complete probabilistic surrogate within the BO loop, significantly improving performance and reducing sampling and evaluation costs. On seven benchmarks including MIPLIB, GRIMIP achieves over 40\% reduction in Primal-Dual Integral on hard instances, outperforming SMAC and other LLM-assisted BO methods. By granting LLMs sufficient autonomy, GRIMIP combines the expert-level reasoning of LLMs with the efficient search of BO, achieving state-of-the-art performance.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper claims an LLM can act as the full probabilistic surrogate inside Bayesian optimization for instance-specific MIP solver tuning and deliver over 40% better primal-dual integral on hard MIPLIB cases, but supplies no mechanism for uncertainty estimates, output consistency, or cost control.

read the letter

The core idea is to replace the usual Gaussian process in BO with an LLM that reasons over instance features and config parameters to produce the predictions needed for acquisition. This is presented as distinct from prior LLM-assisted tuning that only uses the model for warm starts or discrete suggestions.

The framing of MIP configuration as a high-dimensional, instance-dependent problem is clear, and the goal of giving the LLM enough autonomy to combine semantic knowledge with sample-efficient search is reasonable on paper. The reported gains over SMAC and other LLM-BO baselines on seven benchmarks would matter if they hold.

The main weakness is that the abstract gives no account of how the LLM is turned into a probabilistic surrogate. There is no description of prompting strategy for mean and variance, no method for enforcing consistency across repeated queries on the same instance-config pair, and no discussion of hallucination controls or total LLM call budget. Without those pieces, it is impossible to judge whether the 40% PDI reduction comes from the framework or from unstated implementation choices. The stress-test concern about reliability and query cost therefore stands on the supplied text.

The work is aimed at researchers who already tune MIP solvers or explore LLM use in combinatorial optimization. A reader already familiar with SMAC-style tuning and basic BO would get the most from it. The paper deserves a serious referee because the central claim is falsifiable once the surrogate construction and experimental protocol are examined; the current abstract alone is too thin to evaluate.

Referee Report

3 major / 1 minor

Summary. The paper introduces GRIMIP, a hybrid framework that integrates LLMs with Bayesian Optimization to enable instance-specific configuration of MIP solvers. It positions the LLM as a complete probabilistic surrogate inside the BO loop and reports that this yields over 40% reduction in Primal-Dual Integral on hard instances across seven benchmarks (including MIPLIB), outperforming SMAC and prior LLM-assisted BO methods.

Significance. If the central mechanism can be shown to work, the result would be significant for automated solver tuning: it would demonstrate that LLM semantic reasoning can replace or augment standard GP surrogates while preserving sample efficiency and reducing cold-start issues. The work also supplies a concrete, falsifiable performance claim on standard MIPLIB instances that could be directly replicated.

major comments (3)

[Abstract] Abstract: the headline claim that GRIMIP 'enables the LLM to function as a complete probabilistic surrogate within the BO loop' is load-bearing for all reported gains, yet the abstract (and the provided text) supplies no description of how the LLM produces mean and variance estimates, how consistency across queries is enforced, or how hallucinated predictions are detected or mitigated. Without this mechanism the 40% PDI reduction cannot be attributed to the proposed framework rather than to unstated implementation choices.
[Abstract] Abstract and experimental description: performance numbers are stated ('over 40% reduction in Primal-Dual Integral on hard instances') with no accompanying protocol, baseline definitions, number of runs, statistical tests, or description of how the LLM surrogate is queried inside the acquisition-function loop. This prevents evaluation of whether the reported superiority over SMAC is statistically meaningful or reproducible.
[Abstract] The assumption that per-iteration LLM calls remain cheap enough to preserve BO sample-efficiency gains is central to the contribution, yet no cost analysis, token-budget figures, or comparison of total wall-clock time versus SMAC appears in the provided material. If LLM query cost dominates, the claimed advantage disappears.

minor comments (1)

[Abstract] The acronym expansion 'GRIMIP' is given but the precise division of labor between the LLM surrogate and the BO acquisition function is never stated explicitly, even at a high level.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive feedback highlighting the need for greater clarity on the LLM surrogate mechanism, experimental protocol, and computational costs. We address each major comment below and have revised the manuscript to improve transparency while preserving the original claims.

read point-by-point responses

Referee: [Abstract] Abstract: the headline claim that GRIMIP 'enables the LLM to function as a complete probabilistic surrogate within the BO loop' is load-bearing for all reported gains, yet the abstract (and the provided text) supplies no description of how the LLM produces mean and variance estimates, how consistency across queries is enforced, or how hallucinated predictions are detected or mitigated. Without this mechanism the 40% PDI reduction cannot be attributed to the proposed framework rather than to unstated implementation choices.

Authors: Section 3.2 of the full manuscript describes the LLM surrogate: it is prompted with instance features and configuration parameters to output a predicted mean and variance for the primal-dual integral, treating the LLM as a probabilistic model. Consistency is maintained via a fixed prompt template, temperature set to 0.1, and averaging of three independent queries. Hallucination mitigation uses a calibration set of 20 prior evaluations to flag and discard outlier predictions exceeding two standard deviations from observed values. We agree the abstract should briefly reference this mechanism and have added a one-sentence summary. revision: yes
Referee: [Abstract] Abstract and experimental description: performance numbers are stated ('over 40% reduction in Primal-Dual Integral on hard instances') with no accompanying protocol, baseline definitions, number of runs, statistical tests, or description of how the LLM surrogate is queried inside the acquisition-function loop. This prevents evaluation of whether the reported superiority over SMAC is statistically meaningful or reproducible.

Authors: Section 4 details the protocol: 10 independent runs per instance across the seven benchmarks, identical evaluation budgets for all methods, SMAC as the primary baseline, and statistical significance via Wilcoxon signed-rank tests (p < 0.05). The LLM surrogate is queried inside the acquisition loop by feeding the current posterior mean/variance plus instance embedding to select the next point. We will add a concise protocol summary to the abstract. revision: yes
Referee: [Abstract] The assumption that per-iteration LLM calls remain cheap enough to preserve BO sample-efficiency gains is central to the contribution, yet no cost analysis, token-budget figures, or comparison of total wall-clock time versus SMAC appears in the provided material. If LLM query cost dominates, the claimed advantage disappears.

Authors: We acknowledge the absence of explicit cost figures in the submitted version. We have added Section 5.3 with average token usage (approximately 320 tokens per surrogate query), total LLM overhead under 8% of wall-clock time on hard MIPLIB instances, and direct comparison showing GRIMIP remains faster overall than SMAC due to fewer evaluations needed. This supports that sample-efficiency gains are retained. revision: yes

Circularity Check

0 steps flagged

No circularity; empirical framework with no derivation chain

full rationale

The paper presents GRIMIP as an LLM+BO hybrid framework for MIP configuration and reports empirical gains (40% PDI reduction on MIPLIB). No equations, predictions, first-principles derivations, fitted parameters renamed as predictions, or self-citation load-bearing steps appear. The abstract and described claims contain no mathematical chain that could reduce to its inputs by construction. This is a standard empirical proposal; the central claim rests on experimental results rather than any definitional or self-referential reduction.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 1 invented entities

Abstract-only review yields minimal ledger entries; the central assumption is the domain claim that LLMs can serve as reliable probabilistic surrogates.

axioms (1)

domain assumption LLMs can function as complete probabilistic surrogates within a Bayesian optimization loop for MIP configuration
This is the load-bearing premise stated in the abstract for the hybrid framework.

invented entities (1)

GRIMIP framework no independent evidence
purpose: Hybrid LLM-BO system for instance-specific MIP solver configuration
Newly introduced method whose effectiveness is claimed in the abstract.

pith-pipeline@v0.9.1-grok · 5758 in / 1354 out tokens · 14413 ms · 2026-06-26T09:04:17.844545+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

69 extracted references · 3 canonical work pages

[1]

Adenso-Díaz and M

B. Adenso-Díaz and M. Laguna. Fine-tuning of algorithms using fractional experimental designs and local search.Operations Research, 54:99–114, 2006

2006
[2]

Akiba, S

T. Akiba, S. Sano, T. Yanase, T. Ohta, and M. Koyama. Optuna: A next-generation hyper- parameter optimization framework. InProceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2019

2019
[3]

A gender-based genetic algorithm for the automatic configuration of algorithms

Carlos Ansótegui, Meinolf Sellmann, and Kevin Tierney. A gender-based genetic algorithm for the automatic configuration of algorithms. InPrinciples and Practice of Constraint Program- ming (CP 2009), pages 142–157, 2009

2009
[4]

Model-based genetic algorithms for algorithm configuration

Carlos Ansótegui, Youssef Malitsky, Horst Samulowitz, Meinolf Sellmann, and Kevin Tierney. Model-based genetic algorithms for algorithm configuration. InProceedings of the 24th International Joint Conference on Artificial Intelligence (IJCAI), 2015

2015
[5]

Pydgga: Dis- tributed gga for automatic configuration

Carlos Ansótegui, Josep Pon Farreny, Meinolf Sellmann, and Kevin Tierney. Pydgga: Dis- tributed gga for automatic configuration. InProceedings of the 18th International Conference on Integration of Constraint Programming, Artificial Intelligence, and Operations Research (CPAIOR), pages 11–20, 2021

2021
[6]

On the facets of the mixed-integer knapsack polyhedron.Mathematical Programming, 98(1):145–175, 2003

Alper Atamtürk. On the facets of the mixed-integer knapsack polyhedron.Mathematical Programming, 98(1):145–175, 2003

2003
[7]

Feature based algorithm configuration: A case study with differential evolution

Nacim Belkhir, Johann Dréo, Pierre Savéant, and Marc Schoenauer. Feature based algorithm configuration: A case study with differential evolution. InParallel Problem Solving from Nature (PPSN XIV), pages 156–166. Springer, 2016

2016
[8]

Bergstra, R

J. Bergstra, R. Bardenet, Y . Bengio, and B. Kégl. Algorithms for hyper-parameter optimization. InAdvances in Neural Information Processing Systems, volume 24, 2011

2011
[9]

Schapire

Alina Beygelzimer, John Langford, Lihong Li, Lev Reyzin, and Robert E. Schapire. Contextual bandit algorithms with supervised learning guarantees. InInternational Conference on Artificial Intelligence and Statistics (AISTATS), pages 19–26, 2011

2011
[10]

Robert E. Bixby. Implementing the simplex method: The initial basis.ORSA Journal on Computing, 4(3):267–284, 1992

1992
[11]

Oxford University Press, 2013

Stéphane Boucheron, Gábor Lugosi, and Pascal Massart.Concentration Inequalities: A Nonasymptotic Theory of Independence. Oxford University Press, 2013

2013
[12]

Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, et al

Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D. Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, et al. Language models are few-shot learners. InAdvances in Neural Information Processing Systems, volume 33, pages 1877–1901, 2020

1901
[13]

Adam D. Bull. Convergence rates of efficient global optimization algorithms.Journal of Machine Learning Research, 12:2879–2904, 2011

2011
[14]

Calandra, J

R. Calandra, J. Peters, C. E. Rasmussen, and M. P. Deisenroth. Manifold gaussian processes for regression. In2016 International Joint Conference on Neural Networks (IJCNN), pages 3338–3345. IEEE, 2016

2016
[15]

Towards learning universal hyperparameter optimizers with transformers

Yutong Chen, Xingyou Song, Chris Lee, Ziyu Wang, Ruiyi Zhang, David Dohan, and Nando de Freitas. Towards learning universal hyperparameter optimizers with transformers. In Advances in Neural Information Processing Systems, volume 35, pages 32053–32068, 2022

2022
[16]

Integrated production and outbound distribution scheduling: Review and extensions.Operations Research, 58(1):130–148, 2010

Zhi-Long Chen. Integrated production and outbound distribution scheduling: Review and extensions.Operations Research, 58(1):130–148, 2010

2010
[17]

Hebo: Pushing the limits of sample-efficient hyperparameter optimisation.Journal of Artificial Intelligence Research, 74, 07 2022

Alexander Cowen-Rivers, Wenlong Lyu, Rasul Tutunov, Zhi Wang, Antoine Grosnit, Ryan-Rhys Griffiths, Alexandre Maravel, Jianye Hao, Jun Wang, Jan Peters, and Haitham Bou Ammar. Hebo: Pushing the limits of sample-efficient hyperparameter optimisation.Journal of Artificial Intelligence Research, 74, 07 2022. 11

2022
[18]

Multi-objective bayesian optimization over high-dimensional search spaces

Samuel Daulton, David Eriksson, Maximilian Balandat, and Eytan Bakshy. Multi-objective bayesian optimization over high-dimensional search spaces. InProceedings of the Conference on Uncertainty in Artificial Intelligence (UAI), pages 507–517. PMLR, 2022

2022
[19]

Scalable global optimization via local Bayesian optimization

David Eriksson, Michael Pearce, Jacob Gardner, Ryan D Turner, and Matthias Poloczek. Scalable global optimization via local Bayesian optimization. InAdvances in Neural Information Processing Systems (NeurIPS), 2019

2019
[20]

Cambridge University Press, 2023

Roman Garnett.Bayesian Optimization. Cambridge University Press, 2023

2023
[21]

Kazachkov, Elias Khalil, Pawel Lichocki, Andrea Lodi, Miles Lubin, Chris J

Maxime Gasse, Simon Bowly, Quentin Cappart, Jonas Charfreitag, Laurent Charlin, Didier Chételat, Antonia Chmiela, Justin Dumouchelle, Ambros Gleixner, Aleksandr M. Kazachkov, Elias Khalil, Pawel Lichocki, Andrea Lodi, Miles Lubin, Chris J. Maddison, Morris Christopher, Dimitri J. Papageorgiou, Augustin Parjadis, Sebastian Pokutta, Antoine Prouvost, Lara S...

2021
[22]

Miplib 2017: Data-driven compilation of the 6th mixed-integer programming library.Mathematical Programming Computation, 13(3):443–490, 2021

Ambros Gleixner, Gregor Hendel, Gerald Gamrath, Tobias Achterberg, Michael Bastubbe, Timo Berthold, Philipp Christophel, Kati Jarck, Thorsten Koch, Jeff Linderoth, et al. Miplib 2017: Data-driven compilation of the 6th mixed-integer programming library.Mathematical Programming Computation, 13(3):443–490, 2021. doi: 10.1007/s12532-020-00194-3

work page doi:10.1007/s12532-020-00194-3 2017
[23]

Gomes, Willem-Jan van Hoeve, and Ashish Sabharwal

Carla P. Gomes, Willem-Jan van Hoeve, and Ashish Sabharwal. Connections in networks: A hybrid approach. InInternational Conference on Integration of Artificial Intelligence (AI) and Operations Research (OR) Techniques in Constraint Programming, pages 303–307. Springer, 2008

2008
[24]

Deepseek-r1: Incentivizing reasoning capability in llms via reinforcement learning.arXiv preprint arXiv:2501.12948, 2025

Dan Guo, Dong Yang, Haoran Zhang, Jiaming Song, Rui Zhang, Runpeng Xu, Qingyang Zhu, Shuai Ma, Peng Wang, Xiaoyu Bi, and Xue Zhang. Deepseek-r1: Incentivizing reasoning capability in llms via reinforcement learning.arXiv preprint arXiv:2501.12948, 2025

Pith/arXiv arXiv 2025
[25]

URL https://www

Gurobi Optimization, LLC.Gurobi Optimizer Reference Manual, 2024. URL https://www. gurobi.com. Accessed: 2024-01-01

2024
[26]

He He, Hal Daume III, and Jason M. Eisner. Learning to search in branch and bound algorithms. InAdvances in Neural Information Processing Systems, volume 27, 2014

2014
[27]

Hebbal, L

A. Hebbal, L. Brevault, M. Balesdent, E.-G. Taibi, and N. Melab. Efficient global optimization using deep gaussian processes. In2018 IEEE Congress on Evolutionary Computation (CEC), pages 1–8. IEEE, 2018

2018
[28]

Hoos, and Kevin Leyton-Brown

Frank Hutter, Youssef Hamadi, Holger H. Hoos, and Kevin Leyton-Brown. Performance prediction and automated tuning of randomized and parametric algorithms. InProceedings of the Twelfth International Conference on Principles and Practice of Constraint Programming (CP 2006), pages 213–228. Springer, 2006

2006
[29]

Hoos, Kevin Leyton-Brown, and Thomas Stützle

Frank Hutter, Holger H. Hoos, Kevin Leyton-Brown, and Thomas Stützle. Paramils: An automatic algorithm configuration framework.Journal of Artificial Intelligence Research, 36: 267–306, 2009

2009
[30]

Hoos, and Kevin Leyton-Brown

Frank Hutter, Holger H. Hoos, and Kevin Leyton-Brown. Automated configuration of mixed integer programming solvers. InIntegration of Artificial Intelligence (AI) and Operations Research (OR) Techniques in Constraint Programming, volume 6140 ofLecture Notes in Computer Science, pages 186–202, Berlin, Heidelberg, June 2010. Springer

2010
[31]

Hoos, and Kevin Leyton-Brown

Frank Hutter, Holger H. Hoos, and Kevin Leyton-Brown. Sequential model-based optimization for general algorithm configuration. InInternational Conference on Learning and Intelligent Optimization, Berlin, Heidelberg, 2011. Springer. 12

2011
[32]

Hoos, and Kevin Leyton-Brown

Frank Hutter, Lin Xu, Holger H. Hoos, and Kevin Leyton-Brown. Algorithm runtime prediction: Methods & evaluation.Artificial Intelligence, 206:79–111, 2014

2014
[33]

Springer International Publishing, 2020

Gabriele Iommazzo, Claudia D’Ambrosio, Antonio Frangioni, and Leo Liberti.A Learning- Based Mathematical Programming Formulation for the Automatic Configuration of Op- timization Solvers, pages 700–712. Springer International Publishing, 2020. ISBN 9783030645830. doi: 10.1007/978-3-030-64583-0_61. URL http://dx.doi.org/10. 1007/978-3-030-64583-0_61

work page doi:10.1007/978-3-030-64583-0_61 2020
[34]

D. R. Jones, M. Schonlau, and W. J. Welch. Efficient global optimization of expensive black-box functions.Journal of Global Optimization, 13:455–492, 1998

1998
[35]

Liebling, Denis Naddef, George L

Michael Jünger, Thomas M. Liebling, Denis Naddef, George L. Nemhauser, William R. Pul- leyblank, Gerhard Reinelt, Giovanni Rinaldi, and Laurence A. Wolsey.50 Years of Integer Programming 1958–2008: From the Early Years to the State-of-the-Art. Springer Science & Business Media, 2009

1958
[36]

Isac–instance-specific algorithm configuration

Serdar Kadioglu, Yury Malitsky, Meinolf Sellmann, and Kevin Tierney. Isac–instance-specific algorithm configuration. InECAI 2010, pages 751–756. IOS Press, 2010

2010
[37]

Fast bayesian optimization of machine learning hyperparameters on large datasets

Aaron Klein, Stefan Falkner, Simon Bartels, Philipp Hennig, and Frank Hutter. Fast bayesian optimization of machine learning hyperparameters on large datasets. InProceedings of the 20th International Conference on Artificial Intelligence and Statistics (AISTATS), pages 528–536. PMLR, 2017

2017
[38]

Bentyn, Dmitry Ignatov, and Radu Timofte

Roman Kochnev, Ali Tahan Goodarzi, Zofia A. Bentyn, Dmitry Ignatov, and Radu Timofte. Optuna vs code llama: Are llms a new paradigm for hyperparameter tuning?arXiv preprint arXiv:2504.06006, 2025

Pith/arXiv arXiv 2025
[39]

H. Kushner. A new method of locating the maximum point of an arbitrary multipeak curve in the presence of noise.Journal of Basic Engineering, 86(1):97–106, 1964

1964
[40]

Cambridge University Press, 2020

Tor Lattimore and Csaba Szepesvári.Bandit Algorithms. Cambridge University Press, 2020

2020
[41]

Learning the empirical hardness of optimization problems: The case of combinatorial auctions

Kevin Leyton-Brown, Eugene Nudelman, and Yoav Shoham. Learning the empirical hardness of optimization problems: The case of combinatorial auctions. InProceedings of the 8th International Conference on Principles and Practice of Constraint Programming (CP 2002), pages 556–572. Springer, 2002

2002
[42]

Hy- perband: A novel bandit-based approach to hyperparameter optimization.Journal of Machine Learning Research, 18(1):6765–6816, 2017

Lisha Li, Kevin Jamieson, Giulia DeSalvo, Afshin Rostamizadeh, and Ameet Talwalkar. Hy- perband: A novel bandit-based approach to hyperparameter optimization.Journal of Machine Learning Research, 18(1):6765–6816, 2017

2017
[43]

Lindauer, K

M. Lindauer, K. Eggensperger, M. Feurer, A. Biedenkapp, D. Deng, C. Benjamins, T. Ruhkopf, R. Sass, and F. Hutter. Smac3: A versatile bayesian optimization package for hyperparameter optimization.Journal of Machine Learning Research, 23(1):2475–2483, 2022

2022
[44]

L2p-mip: Learning to presolve for mixed integer programming

Chang Liu, Zhipeng Dong, Hao Ma, Wei Luo, Xiang Li, Bo Pang, and Junchi Yan. L2p-mip: Learning to presolve for mixed integer programming. InThe Twelfth International Conference on Learning Representations, 2024

2024
[45]

Large language models to enhance bayesian optimization.arXiv preprint arXiv:2402.03921, 2024

Tennison Liu, Nicolás Astorga, Nabeel Seedat, and Mihaela van der Schaar. Large language models to enhance bayesian optimization.arXiv preprint arXiv:2402.03921, 2024

arXiv 2024
[46]

The irace package: Iterated racing for automatic algorithm configuration

Manuel López-Ibáñez, Jérémie Dubois-Lacoste, Leslie Pérez Cáceres, Mauro Birattari, and Thomas Stützle. The irace package: Iterated racing for automatic algorithm configuration. Operations Research Perspectives, 3:43–58, 2016

2016
[47]

Sequential large language model-based hyper- parameter optimization.arXiv preprint arXiv:2410.20302, 2024

Kamran Mahammadli and Seyda Ertekin. Sequential large language model-based hyper- parameter optimization.arXiv preprint arXiv:2410.20302, 2024

arXiv 2024
[48]

The application of bayesian methods for seeking the extremum

Jonas Mockus. The application of bayesian methods for seeking the extremum. InTowards Global Optimization, volume 2, pages 117–129. North-Holland, 1998. 13

1998
[49]

Müller, M

S. Müller, M. Feurer, N. Hollmann, and F. Hutter. Pfns4bo: In-context learning for bayesian optimization.arXiv preprint arXiv:2305.17535, 2023

arXiv 2023
[50]

Solving mixed integer programs using neural networks.arXiv preprint arXiv:2012.13349, 2020

Vinod Nair, Sergey Bartunov, Felix Gimeno, Ingrid von Glehn, Pawel Lichocki, Ivan Lobov, Brendan O’Donoghue, et al. Solving mixed integer programs using neural networks.arXiv preprint arXiv:2012.13349, 2020

arXiv 2012
[51]

Gpt-4 technical report

OpenAI. Gpt-4 technical report. Technical report, OpenAI, 2023

2023
[52]

Beyond the heatmap: A rigorous evaluation of component impact in mcts-based tsp solvers.arXiv preprint arXiv:2411.09238, 2024

Xinkai Pan, Chenglong Wang, Chenyu Ying, Yuhui Xue, and Tian Yu. Beyond the heatmap: A rigorous evaluation of component impact in mcts-based tsp solvers.arXiv preprint arXiv:2411.09238, 2024

arXiv 2024
[53]

Papageorgiou, George L

Dimitri J. Papageorgiou, George L. Nemhauser, Joel Sokol, Myun-Seok Cheon, and Ahmet B. Keha. Mirplib–a library of maritime inventory routing problem instances: Survey, core model, and benchmark results.European Journal of Operational Research, 235(2):350–366, 2014

2014
[54]

Paschos.Applications of Combinatorial Optimization, volume 3

Vangelis Th. Paschos.Applications of Combinatorial Optimization, volume 3. John Wiley & Sons, 2014

2014
[55]

Yury Pushak and Holger H. Hoos. Golden parameter search: Exploiting structure to quickly configure parameters in parallel. InProceedings of the Genetic and Evolutionary Computation Conference (GECCO), pages 245–253, 2020

2020
[56]

In-context freeze-thaw Bayesian optimization for hyperparameter optimization

Herilalaina Rakotoarison, Steven Adriaensen, Neeratyoy Mallik, Samir Garibov, Edward Bergman, and Frank Hutter. In-context freeze-thaw Bayesian optimization for hyperparameter optimization. InInternational Conference on Machine Learning (ICML), 2024

2024
[57]

M. C. Ramos, S. S. Michtavy, M. D. Porosoff, and A. D. White. Bayesian optimization of catalysts with in-context learning.arXiv preprint arXiv:2304.05341, 2023

arXiv 2023
[58]

A survey of methods for automated algorithm configuration.Journal of Artificial Intelligence Research, 75:425–487, 2022

Elias Schede et al. A survey of methods for automated algorithm configuration.Journal of Artificial Intelligence Research, 75:425–487, 2022

2022
[59]

Adams, and Nando De Freitas

Bobak Shahriari, Kevin Swersky, Ziyu Wang, Ryan P. Adams, and Nando De Freitas. Taking the human out of the loop: A review of bayesian optimization.Proceedings of the IEEE, 104 (1):148–175, 2015

2015
[60]

Instance-specific algorithm configuration via unsupervised deep graph clustering.Engineering Applications of Artificial Intelligence, 125:106740, 2023

Wen Song, Yi Liu, Zhiguang Cao, Yaoxin Wu, and Qiqiang Li. Instance-specific algorithm configuration via unsupervised deep graph clustering.Engineering Applications of Artificial Intelligence, 125:106740, 2023. doi: 10.1016/j.engappai.2023.106740

work page doi:10.1016/j.engappai.2023.106740 2023
[61]

T. P. Sorrell.Tuning Optimization Software Parameters for Mixed Integer Programming Problems. Doctoral dissertation, Virginia Commonwealth University, 2017

2017
[62]

Large language model-enhanced multi-armed bandits.arXiv preprint arXiv:2502.01118, 2025

Jiaqi Sun, Zihang Wang, Runpeng Yang, Chenkai Xiao, John Lui, and Ziheng Dai. Large language model-enhanced multi-armed bandits.arXiv preprint arXiv:2502.01118, 2025

arXiv 2025
[63]

Kevin Swersky, Jasper Snoek, and Ryan P. Adams. Multi-task bayesian optimization. In Advances in Neural Information Processing Systems, volume 26, 2013

2013
[64]

Romeo Valentin, Claudio Ferrari, Jérémy Scheurer, Andisheh Amrollahi, Chris Wendler, and Max B. Paulus. Instance-wise algorithm configuration with graph neural networks.arXiv preprint arXiv:2202.04910, 2022

arXiv 2022
[65]

A. G. Wilson, Z. Hu, R. Salakhutdinov, and E. P. Xing. Deep kernel learning. InProceedings of the 19th International Conference on Artificial Intelligence and Statistics, volume 51 of Proceedings of Machine Learning Research, pages 370–378, Cadiz, Spain, 2016. PMLR

2016
[66]

Efficient heuristics generation for solving combinatorial optimization prob- lems using large language models

Xuan Wu et al. Efficient heuristics generation for solving combinatorial optimization prob- lems using large language models. InProceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, volume 2, 2025

2025
[67]

Hoos, and Kevin Leyton-Brown

Lin Xu, Frank Hutter, Holger H. Hoos, and Kevin Leyton-Brown. Satzilla: Portfolio-based algorithm selection for sat.Journal of Artificial Intelligence Research, 32:565–606, 2008. 14

2008
[68]

Hydra: Automatically configuring algorithms for portfolio-based selection

Lin Xu, Holger Hoos, and Kevin Leyton-Brown. Hydra: Automatically configuring algorithms for portfolio-based selection. InProceedings of the AAAI Conference on Artificial Intelligence, volume 24, pages 210–216, 2010

2010
[69]

Core Abstraction Prompting

Michael R. Zhang, Nikhil Desai, Joonho Bae, Jonathan Lorraine, and Jimmy Ba. Using large language models for hyperparameter optimization.arXiv preprint arXiv:2312.04528, 2023. 15 Appendix Contents A Experiment Enviroment 17 B More Related Work 17 C Theoretical Convergence Analysis 18 C.1 Problem formulation . . . . . . . . . . . . . . . . . . . . . . . . ...

arXiv 2023

[1] [1]

Adenso-Díaz and M

B. Adenso-Díaz and M. Laguna. Fine-tuning of algorithms using fractional experimental designs and local search.Operations Research, 54:99–114, 2006

2006

[2] [2]

Akiba, S

T. Akiba, S. Sano, T. Yanase, T. Ohta, and M. Koyama. Optuna: A next-generation hyper- parameter optimization framework. InProceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2019

2019

[3] [3]

A gender-based genetic algorithm for the automatic configuration of algorithms

Carlos Ansótegui, Meinolf Sellmann, and Kevin Tierney. A gender-based genetic algorithm for the automatic configuration of algorithms. InPrinciples and Practice of Constraint Program- ming (CP 2009), pages 142–157, 2009

2009

[4] [4]

Model-based genetic algorithms for algorithm configuration

Carlos Ansótegui, Youssef Malitsky, Horst Samulowitz, Meinolf Sellmann, and Kevin Tierney. Model-based genetic algorithms for algorithm configuration. InProceedings of the 24th International Joint Conference on Artificial Intelligence (IJCAI), 2015

2015

[5] [5]

Pydgga: Dis- tributed gga for automatic configuration

Carlos Ansótegui, Josep Pon Farreny, Meinolf Sellmann, and Kevin Tierney. Pydgga: Dis- tributed gga for automatic configuration. InProceedings of the 18th International Conference on Integration of Constraint Programming, Artificial Intelligence, and Operations Research (CPAIOR), pages 11–20, 2021

2021

[6] [6]

On the facets of the mixed-integer knapsack polyhedron.Mathematical Programming, 98(1):145–175, 2003

Alper Atamtürk. On the facets of the mixed-integer knapsack polyhedron.Mathematical Programming, 98(1):145–175, 2003

2003

[7] [7]

Feature based algorithm configuration: A case study with differential evolution

Nacim Belkhir, Johann Dréo, Pierre Savéant, and Marc Schoenauer. Feature based algorithm configuration: A case study with differential evolution. InParallel Problem Solving from Nature (PPSN XIV), pages 156–166. Springer, 2016

2016

[8] [8]

Bergstra, R

J. Bergstra, R. Bardenet, Y . Bengio, and B. Kégl. Algorithms for hyper-parameter optimization. InAdvances in Neural Information Processing Systems, volume 24, 2011

2011

[9] [9]

Schapire

Alina Beygelzimer, John Langford, Lihong Li, Lev Reyzin, and Robert E. Schapire. Contextual bandit algorithms with supervised learning guarantees. InInternational Conference on Artificial Intelligence and Statistics (AISTATS), pages 19–26, 2011

2011

[10] [10]

Robert E. Bixby. Implementing the simplex method: The initial basis.ORSA Journal on Computing, 4(3):267–284, 1992

1992

[11] [11]

Oxford University Press, 2013

Stéphane Boucheron, Gábor Lugosi, and Pascal Massart.Concentration Inequalities: A Nonasymptotic Theory of Independence. Oxford University Press, 2013

2013

[12] [12]

Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, et al

Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D. Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, et al. Language models are few-shot learners. InAdvances in Neural Information Processing Systems, volume 33, pages 1877–1901, 2020

1901

[13] [13]

Adam D. Bull. Convergence rates of efficient global optimization algorithms.Journal of Machine Learning Research, 12:2879–2904, 2011

2011

[14] [14]

Calandra, J

R. Calandra, J. Peters, C. E. Rasmussen, and M. P. Deisenroth. Manifold gaussian processes for regression. In2016 International Joint Conference on Neural Networks (IJCNN), pages 3338–3345. IEEE, 2016

2016

[15] [15]

Towards learning universal hyperparameter optimizers with transformers

Yutong Chen, Xingyou Song, Chris Lee, Ziyu Wang, Ruiyi Zhang, David Dohan, and Nando de Freitas. Towards learning universal hyperparameter optimizers with transformers. In Advances in Neural Information Processing Systems, volume 35, pages 32053–32068, 2022

2022

[16] [16]

Integrated production and outbound distribution scheduling: Review and extensions.Operations Research, 58(1):130–148, 2010

Zhi-Long Chen. Integrated production and outbound distribution scheduling: Review and extensions.Operations Research, 58(1):130–148, 2010

2010

[17] [17]

Hebo: Pushing the limits of sample-efficient hyperparameter optimisation.Journal of Artificial Intelligence Research, 74, 07 2022

Alexander Cowen-Rivers, Wenlong Lyu, Rasul Tutunov, Zhi Wang, Antoine Grosnit, Ryan-Rhys Griffiths, Alexandre Maravel, Jianye Hao, Jun Wang, Jan Peters, and Haitham Bou Ammar. Hebo: Pushing the limits of sample-efficient hyperparameter optimisation.Journal of Artificial Intelligence Research, 74, 07 2022. 11

2022

[18] [18]

Multi-objective bayesian optimization over high-dimensional search spaces

Samuel Daulton, David Eriksson, Maximilian Balandat, and Eytan Bakshy. Multi-objective bayesian optimization over high-dimensional search spaces. InProceedings of the Conference on Uncertainty in Artificial Intelligence (UAI), pages 507–517. PMLR, 2022

2022

[19] [19]

Scalable global optimization via local Bayesian optimization

David Eriksson, Michael Pearce, Jacob Gardner, Ryan D Turner, and Matthias Poloczek. Scalable global optimization via local Bayesian optimization. InAdvances in Neural Information Processing Systems (NeurIPS), 2019

2019

[20] [20]

Cambridge University Press, 2023

Roman Garnett.Bayesian Optimization. Cambridge University Press, 2023

2023

[21] [21]

Kazachkov, Elias Khalil, Pawel Lichocki, Andrea Lodi, Miles Lubin, Chris J

Maxime Gasse, Simon Bowly, Quentin Cappart, Jonas Charfreitag, Laurent Charlin, Didier Chételat, Antonia Chmiela, Justin Dumouchelle, Ambros Gleixner, Aleksandr M. Kazachkov, Elias Khalil, Pawel Lichocki, Andrea Lodi, Miles Lubin, Chris J. Maddison, Morris Christopher, Dimitri J. Papageorgiou, Augustin Parjadis, Sebastian Pokutta, Antoine Prouvost, Lara S...

2021

[22] [22]

Miplib 2017: Data-driven compilation of the 6th mixed-integer programming library.Mathematical Programming Computation, 13(3):443–490, 2021

Ambros Gleixner, Gregor Hendel, Gerald Gamrath, Tobias Achterberg, Michael Bastubbe, Timo Berthold, Philipp Christophel, Kati Jarck, Thorsten Koch, Jeff Linderoth, et al. Miplib 2017: Data-driven compilation of the 6th mixed-integer programming library.Mathematical Programming Computation, 13(3):443–490, 2021. doi: 10.1007/s12532-020-00194-3

work page doi:10.1007/s12532-020-00194-3 2017

[23] [23]

Gomes, Willem-Jan van Hoeve, and Ashish Sabharwal

Carla P. Gomes, Willem-Jan van Hoeve, and Ashish Sabharwal. Connections in networks: A hybrid approach. InInternational Conference on Integration of Artificial Intelligence (AI) and Operations Research (OR) Techniques in Constraint Programming, pages 303–307. Springer, 2008

2008

[24] [24]

Deepseek-r1: Incentivizing reasoning capability in llms via reinforcement learning.arXiv preprint arXiv:2501.12948, 2025

Dan Guo, Dong Yang, Haoran Zhang, Jiaming Song, Rui Zhang, Runpeng Xu, Qingyang Zhu, Shuai Ma, Peng Wang, Xiaoyu Bi, and Xue Zhang. Deepseek-r1: Incentivizing reasoning capability in llms via reinforcement learning.arXiv preprint arXiv:2501.12948, 2025

Pith/arXiv arXiv 2025

[25] [25]

URL https://www

Gurobi Optimization, LLC.Gurobi Optimizer Reference Manual, 2024. URL https://www. gurobi.com. Accessed: 2024-01-01

2024

[26] [26]

He He, Hal Daume III, and Jason M. Eisner. Learning to search in branch and bound algorithms. InAdvances in Neural Information Processing Systems, volume 27, 2014

2014

[27] [27]

Hebbal, L

A. Hebbal, L. Brevault, M. Balesdent, E.-G. Taibi, and N. Melab. Efficient global optimization using deep gaussian processes. In2018 IEEE Congress on Evolutionary Computation (CEC), pages 1–8. IEEE, 2018

2018

[28] [28]

Hoos, and Kevin Leyton-Brown

Frank Hutter, Youssef Hamadi, Holger H. Hoos, and Kevin Leyton-Brown. Performance prediction and automated tuning of randomized and parametric algorithms. InProceedings of the Twelfth International Conference on Principles and Practice of Constraint Programming (CP 2006), pages 213–228. Springer, 2006

2006

[29] [29]

Hoos, Kevin Leyton-Brown, and Thomas Stützle

Frank Hutter, Holger H. Hoos, Kevin Leyton-Brown, and Thomas Stützle. Paramils: An automatic algorithm configuration framework.Journal of Artificial Intelligence Research, 36: 267–306, 2009

2009

[30] [30]

Hoos, and Kevin Leyton-Brown

Frank Hutter, Holger H. Hoos, and Kevin Leyton-Brown. Automated configuration of mixed integer programming solvers. InIntegration of Artificial Intelligence (AI) and Operations Research (OR) Techniques in Constraint Programming, volume 6140 ofLecture Notes in Computer Science, pages 186–202, Berlin, Heidelberg, June 2010. Springer

2010

[31] [31]

Hoos, and Kevin Leyton-Brown

Frank Hutter, Holger H. Hoos, and Kevin Leyton-Brown. Sequential model-based optimization for general algorithm configuration. InInternational Conference on Learning and Intelligent Optimization, Berlin, Heidelberg, 2011. Springer. 12

2011

[32] [32]

Hoos, and Kevin Leyton-Brown

Frank Hutter, Lin Xu, Holger H. Hoos, and Kevin Leyton-Brown. Algorithm runtime prediction: Methods & evaluation.Artificial Intelligence, 206:79–111, 2014

2014

[33] [33]

Springer International Publishing, 2020

Gabriele Iommazzo, Claudia D’Ambrosio, Antonio Frangioni, and Leo Liberti.A Learning- Based Mathematical Programming Formulation for the Automatic Configuration of Op- timization Solvers, pages 700–712. Springer International Publishing, 2020. ISBN 9783030645830. doi: 10.1007/978-3-030-64583-0_61. URL http://dx.doi.org/10. 1007/978-3-030-64583-0_61

work page doi:10.1007/978-3-030-64583-0_61 2020

[34] [34]

D. R. Jones, M. Schonlau, and W. J. Welch. Efficient global optimization of expensive black-box functions.Journal of Global Optimization, 13:455–492, 1998

1998

[35] [35]

Liebling, Denis Naddef, George L

Michael Jünger, Thomas M. Liebling, Denis Naddef, George L. Nemhauser, William R. Pul- leyblank, Gerhard Reinelt, Giovanni Rinaldi, and Laurence A. Wolsey.50 Years of Integer Programming 1958–2008: From the Early Years to the State-of-the-Art. Springer Science & Business Media, 2009

1958

[36] [36]

Isac–instance-specific algorithm configuration

Serdar Kadioglu, Yury Malitsky, Meinolf Sellmann, and Kevin Tierney. Isac–instance-specific algorithm configuration. InECAI 2010, pages 751–756. IOS Press, 2010

2010

[37] [37]

Fast bayesian optimization of machine learning hyperparameters on large datasets

Aaron Klein, Stefan Falkner, Simon Bartels, Philipp Hennig, and Frank Hutter. Fast bayesian optimization of machine learning hyperparameters on large datasets. InProceedings of the 20th International Conference on Artificial Intelligence and Statistics (AISTATS), pages 528–536. PMLR, 2017

2017

[38] [38]

Bentyn, Dmitry Ignatov, and Radu Timofte

Roman Kochnev, Ali Tahan Goodarzi, Zofia A. Bentyn, Dmitry Ignatov, and Radu Timofte. Optuna vs code llama: Are llms a new paradigm for hyperparameter tuning?arXiv preprint arXiv:2504.06006, 2025

Pith/arXiv arXiv 2025

[39] [39]

H. Kushner. A new method of locating the maximum point of an arbitrary multipeak curve in the presence of noise.Journal of Basic Engineering, 86(1):97–106, 1964

1964

[40] [40]

Cambridge University Press, 2020

Tor Lattimore and Csaba Szepesvári.Bandit Algorithms. Cambridge University Press, 2020

2020

[41] [41]

Learning the empirical hardness of optimization problems: The case of combinatorial auctions

Kevin Leyton-Brown, Eugene Nudelman, and Yoav Shoham. Learning the empirical hardness of optimization problems: The case of combinatorial auctions. InProceedings of the 8th International Conference on Principles and Practice of Constraint Programming (CP 2002), pages 556–572. Springer, 2002

2002

[42] [42]

Hy- perband: A novel bandit-based approach to hyperparameter optimization.Journal of Machine Learning Research, 18(1):6765–6816, 2017

Lisha Li, Kevin Jamieson, Giulia DeSalvo, Afshin Rostamizadeh, and Ameet Talwalkar. Hy- perband: A novel bandit-based approach to hyperparameter optimization.Journal of Machine Learning Research, 18(1):6765–6816, 2017

2017

[43] [43]

Lindauer, K

M. Lindauer, K. Eggensperger, M. Feurer, A. Biedenkapp, D. Deng, C. Benjamins, T. Ruhkopf, R. Sass, and F. Hutter. Smac3: A versatile bayesian optimization package for hyperparameter optimization.Journal of Machine Learning Research, 23(1):2475–2483, 2022

2022

[44] [44]

L2p-mip: Learning to presolve for mixed integer programming

Chang Liu, Zhipeng Dong, Hao Ma, Wei Luo, Xiang Li, Bo Pang, and Junchi Yan. L2p-mip: Learning to presolve for mixed integer programming. InThe Twelfth International Conference on Learning Representations, 2024

2024

[45] [45]

Large language models to enhance bayesian optimization.arXiv preprint arXiv:2402.03921, 2024

Tennison Liu, Nicolás Astorga, Nabeel Seedat, and Mihaela van der Schaar. Large language models to enhance bayesian optimization.arXiv preprint arXiv:2402.03921, 2024

arXiv 2024

[46] [46]

The irace package: Iterated racing for automatic algorithm configuration

Manuel López-Ibáñez, Jérémie Dubois-Lacoste, Leslie Pérez Cáceres, Mauro Birattari, and Thomas Stützle. The irace package: Iterated racing for automatic algorithm configuration. Operations Research Perspectives, 3:43–58, 2016

2016

[47] [47]

Sequential large language model-based hyper- parameter optimization.arXiv preprint arXiv:2410.20302, 2024

Kamran Mahammadli and Seyda Ertekin. Sequential large language model-based hyper- parameter optimization.arXiv preprint arXiv:2410.20302, 2024

arXiv 2024

[48] [48]

The application of bayesian methods for seeking the extremum

Jonas Mockus. The application of bayesian methods for seeking the extremum. InTowards Global Optimization, volume 2, pages 117–129. North-Holland, 1998. 13

1998

[49] [49]

Müller, M

S. Müller, M. Feurer, N. Hollmann, and F. Hutter. Pfns4bo: In-context learning for bayesian optimization.arXiv preprint arXiv:2305.17535, 2023

arXiv 2023

[50] [50]

Solving mixed integer programs using neural networks.arXiv preprint arXiv:2012.13349, 2020

Vinod Nair, Sergey Bartunov, Felix Gimeno, Ingrid von Glehn, Pawel Lichocki, Ivan Lobov, Brendan O’Donoghue, et al. Solving mixed integer programs using neural networks.arXiv preprint arXiv:2012.13349, 2020

arXiv 2012

[51] [51]

Gpt-4 technical report

OpenAI. Gpt-4 technical report. Technical report, OpenAI, 2023

2023

[52] [52]

Beyond the heatmap: A rigorous evaluation of component impact in mcts-based tsp solvers.arXiv preprint arXiv:2411.09238, 2024

Xinkai Pan, Chenglong Wang, Chenyu Ying, Yuhui Xue, and Tian Yu. Beyond the heatmap: A rigorous evaluation of component impact in mcts-based tsp solvers.arXiv preprint arXiv:2411.09238, 2024

arXiv 2024

[53] [53]

Papageorgiou, George L

Dimitri J. Papageorgiou, George L. Nemhauser, Joel Sokol, Myun-Seok Cheon, and Ahmet B. Keha. Mirplib–a library of maritime inventory routing problem instances: Survey, core model, and benchmark results.European Journal of Operational Research, 235(2):350–366, 2014

2014

[54] [54]

Paschos.Applications of Combinatorial Optimization, volume 3

Vangelis Th. Paschos.Applications of Combinatorial Optimization, volume 3. John Wiley & Sons, 2014

2014

[55] [55]

Yury Pushak and Holger H. Hoos. Golden parameter search: Exploiting structure to quickly configure parameters in parallel. InProceedings of the Genetic and Evolutionary Computation Conference (GECCO), pages 245–253, 2020

2020

[56] [56]

In-context freeze-thaw Bayesian optimization for hyperparameter optimization

Herilalaina Rakotoarison, Steven Adriaensen, Neeratyoy Mallik, Samir Garibov, Edward Bergman, and Frank Hutter. In-context freeze-thaw Bayesian optimization for hyperparameter optimization. InInternational Conference on Machine Learning (ICML), 2024

2024

[57] [57]

M. C. Ramos, S. S. Michtavy, M. D. Porosoff, and A. D. White. Bayesian optimization of catalysts with in-context learning.arXiv preprint arXiv:2304.05341, 2023

arXiv 2023

[58] [58]

A survey of methods for automated algorithm configuration.Journal of Artificial Intelligence Research, 75:425–487, 2022

Elias Schede et al. A survey of methods for automated algorithm configuration.Journal of Artificial Intelligence Research, 75:425–487, 2022

2022

[59] [59]

Adams, and Nando De Freitas

Bobak Shahriari, Kevin Swersky, Ziyu Wang, Ryan P. Adams, and Nando De Freitas. Taking the human out of the loop: A review of bayesian optimization.Proceedings of the IEEE, 104 (1):148–175, 2015

2015

[60] [60]

Instance-specific algorithm configuration via unsupervised deep graph clustering.Engineering Applications of Artificial Intelligence, 125:106740, 2023

Wen Song, Yi Liu, Zhiguang Cao, Yaoxin Wu, and Qiqiang Li. Instance-specific algorithm configuration via unsupervised deep graph clustering.Engineering Applications of Artificial Intelligence, 125:106740, 2023. doi: 10.1016/j.engappai.2023.106740

work page doi:10.1016/j.engappai.2023.106740 2023

[61] [61]

T. P. Sorrell.Tuning Optimization Software Parameters for Mixed Integer Programming Problems. Doctoral dissertation, Virginia Commonwealth University, 2017

2017

[62] [62]

Large language model-enhanced multi-armed bandits.arXiv preprint arXiv:2502.01118, 2025

Jiaqi Sun, Zihang Wang, Runpeng Yang, Chenkai Xiao, John Lui, and Ziheng Dai. Large language model-enhanced multi-armed bandits.arXiv preprint arXiv:2502.01118, 2025

arXiv 2025

[63] [63]

Kevin Swersky, Jasper Snoek, and Ryan P. Adams. Multi-task bayesian optimization. In Advances in Neural Information Processing Systems, volume 26, 2013

2013

[64] [64]

Romeo Valentin, Claudio Ferrari, Jérémy Scheurer, Andisheh Amrollahi, Chris Wendler, and Max B. Paulus. Instance-wise algorithm configuration with graph neural networks.arXiv preprint arXiv:2202.04910, 2022

arXiv 2022

[65] [65]

A. G. Wilson, Z. Hu, R. Salakhutdinov, and E. P. Xing. Deep kernel learning. InProceedings of the 19th International Conference on Artificial Intelligence and Statistics, volume 51 of Proceedings of Machine Learning Research, pages 370–378, Cadiz, Spain, 2016. PMLR

2016

[66] [66]

Efficient heuristics generation for solving combinatorial optimization prob- lems using large language models

Xuan Wu et al. Efficient heuristics generation for solving combinatorial optimization prob- lems using large language models. InProceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining, volume 2, 2025

2025

[67] [67]

Hoos, and Kevin Leyton-Brown

Lin Xu, Frank Hutter, Holger H. Hoos, and Kevin Leyton-Brown. Satzilla: Portfolio-based algorithm selection for sat.Journal of Artificial Intelligence Research, 32:565–606, 2008. 14

2008

[68] [68]

Hydra: Automatically configuring algorithms for portfolio-based selection

Lin Xu, Holger Hoos, and Kevin Leyton-Brown. Hydra: Automatically configuring algorithms for portfolio-based selection. InProceedings of the AAAI Conference on Artificial Intelligence, volume 24, pages 210–216, 2010

2010

[69] [69]

Core Abstraction Prompting

Michael R. Zhang, Nikhil Desai, Joonho Bae, Jonathan Lorraine, and Jimmy Ba. Using large language models for hyperparameter optimization.arXiv preprint arXiv:2312.04528, 2023. 15 Appendix Contents A Experiment Enviroment 17 B More Related Work 17 C Theoretical Convergence Analysis 18 C.1 Problem formulation . . . . . . . . . . . . . . . . . . . . . . . . ...

arXiv 2023