Large Language Models for Operations Research: A Comprehensive Survey

Jianhao Li; Jun Fan; Wanquan Liu; Xianchao Xiu

arxiv: 2605.20849 · v1 · pith:ABBJG6UWnew · submitted 2026-05-20 · 🧮 math.OC

Large Language Models for Operations Research: A Comprehensive Survey

Xianchao Xiu , Jianhao Li , Jun Fan , Wanquan Liu This is my paper

Pith reviewed 2026-05-21 03:53 UTC · model grok-4.3

classification 🧮 math.OC

keywords large language modelsoperations researchmodel formulationalgorithm designsolution verificationbenchmark datasetsoptimization problems

0 comments

The pith

Large language models support operations research by aiding in problem formulation, algorithm design, and solution verification.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper provides a systematic review of how large language models can be applied to operations research problems. It details their potential contributions to formulating mathematical models, designing solution algorithms, and verifying outcomes, areas where traditional methods often require significant human expertise. The survey includes discussions of practical uses in various scenarios, available benchmark datasets for evaluation, and outlines key challenges along with suggestions for future work. Readers interested in decision support systems would find value in understanding these emerging capabilities that could make complex optimization more accessible.

Core claim

The central discovery is a comprehensive mapping of large language model applications in operations research, showing how they can handle model formulation for optimization tasks, support algorithm development for solving these models, and perform verification of the computed solutions, while also cataloging real-world applications, datasets, and open research questions.

What carries the argument

The structured roles of large language models in operations research, encompassing model formulation, algorithm design, and solution verification as the primary ways they augment traditional approaches.

If this is right

If large language models reliably formulate models, then experts could focus on higher-level decisions rather than routine setup.
Algorithm design assistance from these models may enable faster prototyping of solutions for large-scale problems.
Solution verification by language models could catch errors that manual checks might miss in complex scenarios.
Identified benchmark datasets would facilitate comparative studies and accelerate progress in the field.
Future directions outlined could guide research toward more robust integration of these models into decision-making pipelines.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Extending this survey, one might explore how these models perform on stochastic or robust optimization problems not covered in detail.
Hybrid approaches combining language models with exact solvers could be tested to improve both creativity and precision in solutions.
Implications for education in operations research include using these models as interactive tutors for problem modeling.

Load-bearing premise

The collected studies and described applications of large language models accurately represent their current abilities to manage the expert knowledge demands of traditional operations research problems.

What would settle it

Finding that large language models produce invalid or suboptimal formulations for standard benchmark problems in operations research, such as the traveling salesman problem described in plain language, would indicate the survey overestimates their practical utility.

read the original abstract

Operations Research (OR) serves as a core decision-support methodology for complex systems, with significant applications across mathematics, management science, and computer science. Traditional approaches heavily rely on expert knowledge and often struggle to efficiently solve large-scale and multi-constraint problems. The rapid advancement of Large Language Models (LLMs) in recent years has offered a novel research paradigm to address these challenges. This paper presents a systematic survey of Large Language Models for Operations Research (LLM4OR). We begin by introducing the definition of OR problems and the fundamental principles of LLMs. We then focus on analyzing the roles of LLMs in OR, specifically covering such as model formulation, algorithm design, and solution verification. In addition, we discuss practical applications in representative scenarios and summarize benchmark datasets in this field. Finally, we outline the key challenges and provide perspectives on future research directions. A collection of related literature is available at https://github.com/xianchaoxiu/LLM4OR.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This is a standard literature survey organizing existing work on LLMs for OR but adds no new methods, data, or analysis.

read the letter

This paper is a survey that pulls together existing research on applying large language models to operations research. It doesn't develop any new methods or run fresh experiments; it's all about reviewing and categorizing what's already out there. What stands out is the structure. It starts with basics on OR and LLMs, then breaks down the roles LLMs play in formulating models, designing algorithms, and verifying solutions. It also covers practical uses in different scenarios, lists some benchmark datasets, and ends with challenges and future directions. Having a GitHub repo with the references is helpful for anyone wanting to dig deeper. The main limitation is that the paper doesn't describe its own methodology in much detail. From the abstract, it claims a systematic approach, but there's no information on the search strategy, inclusion criteria, or how many papers were screened. That makes it harder to know if key works were missed or if the summary is skewed toward certain types of applications. For a survey claiming to be comprehensive, this is a noticeable gap. Overall, this is the kind of paper that could help someone entering the LLM4OR space get oriented quickly. It might appeal to OR practitioners curious about AI tools or to ML researchers looking for optimization problems to tackle. It won't replace deep dives into specific papers, but it could serve as a starting point. I think it merits sending out for peer review. A referee could verify the accuracy of the summaries and suggest additions to make the coverage stronger.

Referee Report

1 major / 1 minor

Summary. The manuscript presents a systematic survey of Large Language Models for Operations Research (LLM4OR). It introduces the definition of OR problems and fundamental principles of LLMs, analyzes LLM roles in model formulation, algorithm design, and solution verification, discusses practical applications in representative scenarios, summarizes benchmark datasets, outlines key challenges, and provides perspectives on future research directions, with an accompanying GitHub repository for related literature.

Significance. If the survey delivers thorough and accurate coverage of this emerging intersection, it would provide a useful consolidation of how LLMs can supplement expert-knowledge-heavy traditional OR methods for large-scale problems. The outlined structure, including benchmarks and challenges, positions the work to guide future research if the literature collection proves comprehensive.

major comments (1)

Abstract: The manuscript describes its contribution as a 'systematic survey' but supplies no information on the literature search methodology, including databases queried, keywords or search strings employed, time period covered, inclusion/exclusion criteria, or any completeness assessment. This omission is load-bearing because the central claim rests on the survey's coverage and accuracy in capturing LLM applications to OR.

minor comments (1)

The GitHub link for the literature collection is a positive step toward reproducibility; consider adding a brief description of its update policy or curation process in the main text.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for this constructive comment. We agree that explicitly documenting the literature search methodology is necessary to substantiate the 'systematic survey' claim and will revise the manuscript to address this.

read point-by-point responses

Referee: Abstract: The manuscript describes its contribution as a 'systematic survey' but supplies no information on the literature search methodology, including databases queried, keywords or search strings employed, time period covered, inclusion/exclusion criteria, or any completeness assessment. This omission is load-bearing because the central claim rests on the survey's coverage and accuracy in capturing LLM applications to OR.

Authors: We acknowledge the validity of this observation. The current version of the manuscript does not include a dedicated description of the literature collection process. In the revised manuscript we will add a new subsection (e.g., Section 1.3 or an appendix) that details: (1) databases queried (arXiv, Google Scholar, IEEE Xplore, and ACM Digital Library); (2) search strings such as (LLM OR “large language model”) AND (“operations research” OR optimization OR scheduling OR “integer programming”); (3) time period (primarily 2022–2024, reflecting the emergence of capable LLMs); (4) inclusion criteria (peer-reviewed papers, preprints, and workshop papers that apply LLMs to at least one core OR task—formulation, algorithm design, or verification); and (5) exclusion criteria (pure LLM capability papers without OR linkage). We will also note that completeness was cross-checked against the continuously updated GitHub repository. The abstract will be updated with a brief clause referencing this methodology. These additions will be placed early in the paper so readers can immediately assess coverage. revision: yes

Circularity Check

0 steps flagged

No significant circularity in this literature survey

full rationale

This paper is a systematic survey compiling and summarizing external literature on LLM applications to Operations Research. It contains no original derivations, equations, fitted parameters, predictions, or mathematical claims that could reduce to inputs by construction. The structure covers definitions, roles of LLMs, applications, benchmarks, challenges, and future directions, all drawn from cited prior works. No self-citation chains, ansatzes, or uniqueness theorems are invoked in a load-bearing manner; the validity rests on coverage and accurate summarization of independent sources rather than any internal self-referential logic. This is a standard literature review with no circularity patterns present.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The survey rests on standard background definitions of operations research and large language models drawn from prior literature, with no free parameters, ad-hoc axioms, or new entities introduced.

axioms (1)

domain assumption Operations Research serves as a core decision-support methodology relying on expert knowledge for complex systems.
Invoked in the opening definition of OR problems and traditional approaches.

pith-pipeline@v0.9.0 · 5699 in / 1062 out tokens · 42549 ms · 2026-05-21T03:53:02.504461+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

This paper presents a systematic survey of Large Language Models for Operations Research (LLM4OR) covering model formulation, algorithm design, solution verification...
IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Table 1 Comparison with other representative surveys... This Work covers Model Formulation, Algorithm Design, Solution Validation, Scenario Application, Benchmark Datasets

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

182 extracted references · 182 canonical work pages · 3 internal anchors

[1]

(ed.): Introduction to Operations Research, 9th edn

Hillier, F.S. (ed.): Introduction to Operations Research, 9th edn. McGrawHill, New York (2005)

work page 2005
[2]

In: Advances in Neural Information Processing Systems (2020)

Brown, T., Mann, B., Ryder, N., Subbiah, M., Kaplan, J.D., Dhariwal, P., Neelakantan, A., Shyam, P., Sastry, G., Askell, A.,et al.: Language models are few-shot learners. In: Advances in Neural Information Processing Systems (2020)

work page 2020
[3]

In: International Conference on Learning Representations (2023)

Wang, X., Wei, J., Schuurmans, D., Le, Q., Chi, E., Narang, S., Chowdhery, A., Zhou, D.: Self-consistency improves chain of thought reasoning in language models. In: International Conference on Learning Representations (2023)

work page 2023
[4]

In: Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop, pp

Ahn, J., Verma, R., Lou, R., Liu, D., Zhang, R., Yin, W.: Large language models for mathematical reasoning: Progresses and challenges. In: Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop, pp. 225–237 (2024)

work page 2024
[5]

IEEE Transactions on Evolutionary Computation (2026)

Li, Y., Wang, H., Jin, Y.: EvoSR-LLM: Evolutionary symbolic regression guided by large language models. IEEE Transactions on Evolutionary Computation (2026)

work page 2026
[6]

ACM Transactions on Software Engineering and Methodology35(2), 1–72 (2026) 23

Jiang, J., Wang, F., Shen, J., Kim, S., Kim, S.: A survey on large language models for code generation. ACM Transactions on Software Engineering and Methodology35(2), 1–72 (2026) 23

work page 2026
[7]

ACM Computing Surveys58(8), 1–32 (2026)

Liu, F., Yao, Y., Guo, P., Yang, Z., Lin, X., Zhao, Z., Tong, X., Mao, K., Lu, Z., Wang, Z.,et al.: A systematic survey on large language models for algorithm design. ACM Computing Surveys58(8), 1–32 (2026)

work page 2026
[8]

Journal of the American Statistical Association 121(553), 1–13 (2026)

Sun, M., Han, R., Jiang, B., Qi, H., Sun, D., Yuan, Y., Huang, J.: Lambda: A large model based data agent. Journal of the American Statistical Association 121(553), 1–13 (2026)

work page 2026
[9]

In: 2025 China Automation Congress (CAC), pp

Li, J., Xiu, X.: LLMM4FS: Leveraging large language models for feature selection and how to improve it. In: 2025 China Automation Congress (CAC), pp. 7297– 7302 (2025). IEEE

work page 2025
[10]

European Journal of Operational Research 332(1), 1–30 (2026)

Fan, Z., Ghaddar, B., Wang, X., Xing, L., Zhang, Y., Zhou, Z.: Artificial intelli- gence for optimization: Unleashing the potential of parameter generation, model formulation, and solution methods. European Journal of Operational Research 332(1), 1–30 (2026)

work page 2026
[11]

Swarm and Evolutionary Computation90, 101663 (2024)

Huang, S., Yang, K., Qi, S., Wang, R.: When large language model meets optimization. Swarm and Evolutionary Computation90, 101663 (2024)

work page 2024
[12]

IEEE Transactions on Evolutionary Computation29(2), 534–554 (2025)

Wu, X., Wu, S.-H., Wu, J., Feng, L., Tan, K.C.: Evolutionary computation in the era of large language model: Survey and roadmap. IEEE Transactions on Evolutionary Computation29(2), 534–554 (2025)

work page 2025
[13]

SCIENTIA SINICA Mathematica55(2), 451 (2025)

Guo, T., Li, A., Han, C.: Machine learning method for combinatorial optimiza- tion problems. SCIENTIA SINICA Mathematica55(2), 451 (2025)

work page 2025
[14]

arXiv preprint arXiv:2503.17726 (2025)

Forootani, A.: A survey on mathematical reasoning and optimization with large language models. arXiv preprint arXiv:2503.17726 (2025)

work page arXiv 2025
[15]

In: International Joint Conference on Artificial Intelligence, pp

Xiao, Z., Xie, J., Xu, L., Guan, S., Zhu, J., Han, X., Fu, X., Yu, W., Wu, H., Shi, W.,et al.: A survey of optimization modeling meets LLMs: Progress and future directions. In: International Joint Conference on Artificial Intelligence, pp. 10742–10750 (2025)

work page 2025
[16]

arXiv preprint arXiv:2509.08269 (2025)

Zhang, Y., Cheng, R., Yi, G., Tan, K.C.: A systematic survey on large language models for evolutionary optimization: From modeling to solving. arXiv preprint arXiv:2509.08269 (2025)

work page arXiv 2025
[17]

arXiv preprint arXiv:2509.18180 (2025)

Wang, Y., Li, K.: Large language models in operations research: Methods, applications, and challenges. arXiv preprint arXiv:2509.18180 (2025)

work page arXiv 2025
[18]

Springer, New York (2006)

Nocedal, J., Wright, S.J.: Numerical Optimization. Springer, New York (2006)

work page 2006
[19]

In: Advances in Neural Information Processing Systems (2017) 24

Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L., Polosukhin, I.: Attention is all you need. In: Advances in Neural Information Processing Systems (2017) 24

work page 2017
[20]

ACM Computing Surveys56(2), 1–40 (2023)

Min, B., Ross, H., Sulem, E., Veyseh, A.P.B., Nguyen, T.H., Sainz, O., Agirre, E., Heintz, I., Roth, D.: Recent advances in natural language processing via large pre-trained language models: A survey. ACM Computing Surveys56(2), 1–40 (2023)

work page 2023
[21]

Nature Mchine Intelligence5(3), 220–235 (2023)

Ding, N., Qin, Y., Yang, G., Wei, F., Yang, Z., Su, Y., Hu, S., Chen, Y., Chan, C.-M., Chen, W.,et al.: Parameter-efficient fine-tuning of large-scale pre-trained language models. Nature Mchine Intelligence5(3), 220–235 (2023)

work page 2023
[22]

In: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, pp

Dong, G., Yuan, H., Lu, K., Li, C., Xue, M., Liu, D., Wang, W., Yuan, Z., Zhou, C., Zhou, J.: How abilities in large language models are affected by supervised fine-tuning data composition. In: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, pp. 177–198 (2024)

work page 2024
[23]

Transactions on Machine Learning Research (2025)

Kaufmann, T., Weng, P., Bengs, V., H¨ ullermeier, E.: A survey of reinforcement learning from human feedback. Transactions on Machine Learning Research (2025)

work page 2025
[24]

Transactions on Machine Learning Research (2024)

Han, Z., Gao, C., Liu, J., Zhang, J., Zhang, S.Q.: Parameter-efficient fine-tuning for large models: A comprehensive survey. Transactions on Machine Learning Research (2024)

work page 2024
[25]

In: International Conference on Learning Representations (2022)

Hu, E.J., Shen, Y., Wallis, P., Allen-Zhu, Z., Li, Y., Wang, S., Wang, L., Chen, W.,et al.: LoRA: Low-rank adaptation of large language models. In: International Conference on Learning Representations (2022)

work page 2022
[26]

In: Proceedings of the 59th Annual Meeting of the Association for Compu- tational Linguistics and the 11th International Joint Conference on Natural Language Processing, pp

Li, X.L., Liang, P.: Prefix-tuning: Optimizing continuous prompts for generation. In: Proceedings of the 59th Annual Meeting of the Association for Compu- tational Linguistics and the 11th International Joint Conference on Natural Language Processing, pp. 4582–4597 (2021)

work page 2021
[27]

In: International Conference on Learning Representations (2024)

Azerbayev, Z., Schoelkopf, H., Paster, K., Santos, M.D., McAleer, S.M., Jiang, A.Q., Deng, J., Biderman, S., Welleck, S.: Llemma: An open language model for mathematics. In: International Conference on Learning Representations (2024)

work page 2024
[28]

In: International Conference on Machine Learning (2025)

Yang, K., Poesia, G., He, J., Li, W., Lauter, K., Chaudhuri, S., Song, D.: Formal mathematical reasoning: A new frontier in AI. In: International Conference on Machine Learning (2025)

work page 2025
[29]

In: Conference on Empirical Methods in Natural Language Processing: Industry Track, pp

Ramamonjison, R., Li, H., Yu, T., He, S., Rengan, V., Banitalebi-Dehkordi, A., Zhou, Z., Zhang, Y.: Augmenting operations research with auto-formulation of optimization models from problem descriptions. In: Conference on Empirical Methods in Natural Language Processing: Industry Track, pp. 29–62 (2022)

work page 2022
[30]

In: Advances in Neural Information Processing Systems (2023)

Ramamonjison, R., Yu, T., Li, R., Li, H., Carenini, G., Ghaddar, B., He, S., Mostajabdaveh, M., Banitalebi-Dehkordi, A., Zhou, Z.,et al.: NL4Opt com- petition: Formulating optimization problems based on their natural language 25 descriptions. In: Advances in Neural Information Processing Systems (2023)

work page 2023
[31]

INFOR: Information Systems and Operational Research62(4), 559–572 (2024)

Ahmed, T., Choudhury, S.: LM4OPT: Unveiling the potential of large lan- guage models in formulating mathematical optimization problems. INFOR: Information Systems and Operational Research62(4), 559–572 (2024)

work page 2024
[32]

arXiv preprint arXiv:2501.00568 (2024)

Bertsimas, D., Margaritis, G.: Robust and adaptive optimization under a large language model lens. arXiv preprint arXiv:2501.00568 (2024)

work page arXiv 2024
[33]

In: International Conference on Machine Learning (2025)

Zhai, H., Lawless, C., Vitercik, E., Leqi, L.: EquivaMap: Leveraging LLMs for automatic equivalence checking of optimization formulations. In: International Conference on Machine Learning (2025)

work page 2025
[34]

In: International Conference on Learning Representations (2023)

Yao, S., Zhao, J., Yu, D., Du, N., Shafran, I., Narasimhan, K., Cao, Y.: React: Synergizing reasoning and acting in language models. In: International Conference on Learning Representations (2023)

work page 2023
[35]

In: Advances in Neural Information Processing Systems (2022)

Wei, J., Wang, X., Schuurmans, D., Bosma, M., Xia, F., Chi, E., Le, Q.V., Zhou, D.,et al.: Chain-of-Thought prompting elicits reasoning in large language models. In: Advances in Neural Information Processing Systems (2022)

work page 2022
[36]

In: Advances in Neural Information Processing Systems (2023)

Yao, S., Yu, D., Zhao, J., Shafran, I., Griffiths, T., Cao, Y., Narasimhan, K.: Tree of Thoughts: Deliberate problem solving with large language models. In: Advances in Neural Information Processing Systems (2023)

work page 2023
[37]

In: Proceed- ings of the AAAI Conference on Artificial Intelligence, vol

Besta, M., Blach, N., Kubicek, A., Gerstenberger, R., Podstawski, M., Giani- nazzi, L., Gajda, J., Lehmann, T., Niewiadomski, H., Nyczyk, P.,et al.: Graph of Thoughts: Solving elaborate problems with large language models. In: Proceed- ings of the AAAI Conference on Artificial Intelligence, vol. 38, pp. 17682–17690 (2024)

work page 2024
[38]

In: International Conference on Learning Representations (2024)

Xiao, Z., Zhang, D., Wu, Y., Xu, L., Wang, Y.J., Han, X., Fu, X., Zhong, T., Zeng, J., Song, M.,et al.: Chain-of-Experts: When LLMs meet com- plex operations research problems. In: International Conference on Learning Representations (2024)

work page 2024
[39]

In: Progress Toward the Holy Grail Workshop at CP2023 (2023)

Tsouros, D., Verhaeghe, H., Kadioglu, S., Guns, T.: Holy Grail 2.0: From natural language to constraint models. In: Progress Toward the Holy Grail Workshop at CP2023 (2023)

work page 2023
[40]

arXiv preprint arXiv:2409.04464 (2024)

Wang, T., Yu, W.-Y., She, R., Yang, W., Chen, T., Zhang, J.: Leverag- ing large language models for solving rare MIP challenges. arXiv preprint arXiv:2409.04464 (2024)

work page arXiv 2024
[41]

In: Advances in Neural Information Processing Systems (2025) 26

Liu, H., Wang, J., Cai, Y., Han, X., Kuang, Y., HAO, J.: OptiTree: Hierarchical thoughts generation with tree search for llm optimization modeling. In: Advances in Neural Information Processing Systems (2025) 26

work page 2025
[42]

In: International Conference on Machine Learning (2024)

AhmadiTeshnizi, A., Gao, W., Udell, M.: OptiMUS: Scalable optimization modeling with (MI)LP solvers and large language models. In: International Conference on Machine Learning (2024)

work page 2024
[43]

In: International Conference on Learning Representations (2025)

Hao, Y., Zhang, Y., Fan, C.: Planning anything with rigor: General-purpose zero-shot planning with LLM-based formalized programming. In: International Conference on Learning Representations (2025)

work page 2025
[44]

INFOR: Information Systems and Operational Research62(4), 599–617 (2024)

Mostajabdaveh, M., Yu, T.T., Ramamonjison, R., Carenini, G., Zhou, Z., Zhang, Y.: Optimization modeling and verification from problem specifications using a multi-agent multi-stage LLM framework. INFOR: Information Systems and Operational Research62(4), 599–617 (2024)

work page 2024
[45]

arXiv preprint arXiv:2504.16918 (2025)

Thind, R., Sun, Y., Liang, L., Yang, H.: OptimAI: Optimization from natu- ral language using LLM-powered AI Agents. arXiv preprint arXiv:2504.16918 (2025)

work page arXiv 2025
[46]

In: International Conference on Machine Learning (2025)

Astorga, N., Liu, T., Xiao, Y., Schaar, M.: Autoformulation of mathemati- cal optimization models using LLMs. In: International Conference on Machine Learning (2025)

work page 2025
[47]

In: Advances in Neural Information Processing Systems (2025)

Berto, F., Hua, C., Luttmann, L., Son, J., Park, J., Ahn, K., Kwon, C., Xie, L., Park, J.: PARCO: parallel autoregressive models for multi-agent combinatorial optimization. In: Advances in Neural Information Processing Systems (2025)

work page 2025
[48]

Proceedings of the Design Society5, 2201–2210 (2025)

Jiang, S., Xie, M., Luo, J.: Large language models for combinatorial optimiza- tion of design structure matrix. Proceedings of the Design Society5, 2201–2210 (2025)

work page 2025
[49]

arXiv preprint arXiv:2407.19633 (2024)

AhmadiTeshnizi, A., Gao, W., Brunborg, H., Talaei, S., Lawless, C., Udell, M.: OptiMUS-0.3: Using large language models to model and solve optimization problems at scale. arXiv preprint arXiv:2407.19633 (2024)

work page arXiv 2024
[50]

In: International Conference on Learning Representations (2025)

Jiang, X., Wu, Y., Zhang, C., Zhang, Y.: DRoC: Elevating large language mod- els for complex vehicle routing via decomposed retrieval of constraints. In: International Conference on Learning Representations (2025)

work page 2025
[51]

In: 2025 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp

Peng, M., Chen, Z., Yang, J., Huang, J., Shi, Z., Liu, Q., Li, X., Gao, L.: Auto- matic MILP model construction for multi-robot task allocation and scheduling based on large language models. In: 2025 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 20291–20296 (2025). IEEE

work page 2025
[52]

arXiv preprint arXiv:2601.09635 (2026)

Liang, K., Lu, Y., Mao, J., Sun, S., Yang, C., Zeng, C., Jin, X., Qin, H., Zhu, R., Teo, C.-P.: LLM for large-scale optimization model auto-formulation: A lightweight few-shot learning approach. arXiv preprint arXiv:2601.09635 (2026)

work page arXiv 2026
[53]

arXiv preprint arXiv:2309.13218 (2023)

Amarasinghe, P.T., Nguyen, S., Sun, Y., Alahakoon, D.: AI-Copilot for business 27 optimisation: A framework and a case study in production scheduling. arXiv preprint arXiv:2309.13218 (2023)

work page arXiv 2023
[54]

arXiv preprint arXiv:2311.15271 (2023)

Li, Q., Zhang, L., Mak-Hau, V.: Synthesizing mixed-integer linear programming models from natural language descriptions. arXiv preprint arXiv:2311.15271 (2023)

work page arXiv 2023
[55]

arXiv preprint arXiv:2405.01997 (2024)

Masoud, M., Abdelhay, A., Elhenawy, M.: Exploring combinatorial problem solv- ing with large language models: A case study on the travelling salesman problem using GPT-3.5 turbo. arXiv preprint arXiv:2405.01997 (2024)

work page arXiv 2024
[56]

In: International Conference on Learning Representations (2025)

Yang, Z., Wang, Y., Huang, Y., Guo, Z., Shi, W., Han, X., Feng, L., Song, L., Liang, X., Tang, J.: OptiBench meets ReSocratic: Measure and improve LLMs for optimization modeling. In: International Conference on Learning Representations (2025)

work page 2025
[57]

Operations Research (2025)

Huang, C., Tang, Z., Hu, S., Jiang, R., Zheng, X., Ge, D., Wang, B., Wang, Z.: ORLM: A customizable framework in training large models for automated optimization modeling. Operations Research (2025)

work page 2025
[58]

IEEE Transactions on Evolutionary Computation (2026)

Ma, Z., Gong, Y.-J., Guo, H., Chen, J., Ma, Y., Cao, Z., Zhang, J.: LLaMoCo: Instruction tuning of large language models for optimization code generation. IEEE Transactions on Evolutionary Computation (2026)

work page 2026
[59]

Advances in Neural Information Processing Systems (2026)

Jiang, X., Wu, Y., Li, M., Cao, Z., Zhang, Y.: Large language models as end-to-end combinatorial optimization solvers. Advances in Neural Information Processing Systems (2026)

work page 2026
[60]

In: International Conference on Learning Representations (2025)

Jiang, C., Shu, X., Qian, H., Lu, X., ZHOU, J., Zhou, A., Yu, Y.: LLMOPT: Learning to define and solve general optimization problems from scratch. In: International Conference on Learning Representations (2025)

work page 2025
[61]

KTO: Model Alignment as Prospect Theoretic Optimization

Ethayarajh, K., Xu, W., Muennighoff, N., Jurafsky, D., Kiela, D.: KTO: Model alignment as prospect theoretic optimization. arXiv preprint arXiv:2402.01306 (2024)

work page internal anchor Pith review Pith/arXiv arXiv 2024
[62]

arXiv preprint arXiv:2507.11737 (2025)

Zhou, C., Yang, J., Xin, L., Chen, Y., He, Z., Ge, D.: Auto-formulating dynamic programming problems with large language models. arXiv preprint arXiv:2507.11737 (2025)

work page arXiv 2025
[63]

In: Advances in Neural Information Processing Systems (2023)

Rafailov, R., Sharma, A., Mitchell, E., Manning, C.D., Ermon, S., Finn, C.: Direct preference optimization: Your language model is secretly a reward model. In: Advances in Neural Information Processing Systems (2023)

work page 2023
[64]

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Shao, Z., Wang, P., Zhu, Q., Xu, R., Song, J., Bi, X., Zhang, H., Zhang, M., Li, Y., Wu, Y., et al.: DeepSeekMath: Pushing the limits of mathematical reasoning in open language models. arXiv preprint arXiv:2402.03300 (2024) 28

work page internal anchor Pith review Pith/arXiv arXiv 2024
[65]

In: Proceedings of the AAAI Conference on Artificial Intelligence, vol

Ding, Z., Tan, Z., Zhang, J., Chen, T.: OR-R1: Automating modeling and solving of operations research optimization problem via test-time reinforcement learn- ing. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 40, pp. 228–236 (2026)

work page 2026
[66]

In: International Conference on Machine Learning (2025)

Lu, H., Xie, Z., Wu, Y., Ren, C., Chen, Y., Wen, Z.: OptMATH: A scalable bidi- rectional data synthesis framework for optimization modeling. In: International Conference on Machine Learning (2025)

work page 2025
[67]

In: Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp

Zhang, J., Wang, W., Guo, S., Wang, L., Lin, F., Yang, C., Yin, W.: Solving general natural-language-description optimization problems with large language models. In: Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 483–490 (2024)

work page 2024
[68]

Wu, Y., Zhang, Y., Wu, Y., Wang, Y., Zhang, J., Cheng, J.: Evo-Step: Evolutionary generation and stepwise validation for optimizing LLMs in OR (2025)

work page 2025
[69]

In: International Conference on Parallel Problem Solving from Nature, pp

Yao, Y., Liu, F., Cheng, J., Zhang, Q.: Evolve cost-aware acquisition functions using large language models. In: International Conference on Parallel Problem Solving from Nature, pp. 374–390 (2024). Springer

work page 2024
[70]

IEEE Transactions on Smart Grid (2026)

Lou, C., Jin, Z., Tang, W., Geng, G., Yang, J., Zhang, L.: Llm-enhanced multi- agent reinforcement learning with expert workflow for real-time p2p energy trading. IEEE Transactions on Smart Grid (2026)

work page 2026
[71]

In: Handbook of Evolutionary Machine Learning, pp

Lehman, J., Gordon, J., Jain, S., Ndousse, K., Yeh, C., Stanley, K.O.: Evolution through large models. In: Handbook of Evolutionary Machine Learning, pp. 331–366. Springer, Cham, Switzerland (2023)

work page 2023
[72]

IFAC Journal of Systems and Control, 100420 (2026)

Xu, H., Gao, J., Wen, J., Wang, W., Du, J.: Synergistic optimization and llm-informed decision-making for aero-engine rotor assembly. IFAC Journal of Systems and Control, 100420 (2026)

work page 2026
[73]

arXiv preprint arXiv:2504.19636 (2025)

Liu, F., Zhang, Q., Shi, J., Tong, X., Mao, K., Yuan, M.: Fitness landscape of large language model-assisted automated algorithm search. arXiv preprint arXiv:2504.19636 (2025)

work page arXiv 2025
[74]

ACM Computing Surveys 58(11), 1–53 (2026)

Da Ros, F., Soprano, M., Di Gaspero, L., Roitero, K.: Large language models for combinatorial optimization: A systematic review. ACM Computing Surveys 58(11), 1–53 (2026)

work page 2026
[75]

In: Advances in Neural Information Processing Systems (2023)

Nie, A., Cheng, C.-A., Kolobov, A., Swaminathan, A.: Importance of direc- tional feedback for LLM-based optimizers. In: Advances in Neural Information Processing Systems (2023)

work page 2023
[76]

In: 3rd International Joint Conference on Artificial Intelligence, pp

Wu, X., Zhong, Y., Wu, J., Jiang, B., Tan, K.C.: Large language model-enhanced 29 algorithm selection: Towards comprehensive algorithm representation. In: 3rd International Joint Conference on Artificial Intelligence, pp. 5235–5244 (2024)

work page 2024
[77]

arXiv preprint arXiv:2506.11057 (2025)

Li, X., Yang, J., Wang, J., Peng, B., Yao, J., Guan, H.: STRCMP: Integrating graph structural priors with language models for combinatorial optimization. arXiv preprint arXiv:2506.11057 (2025)

work page arXiv 2025
[78]

In: ACM SIGKDD Conference on Knowledge Discovery and Data Mining V

Wu, X., Wang, D., Wu, C., Wen, L., Miao, C., Xiao, Y., Zhou, Y.: Efficient heuristics generation for solving combinatorial optimization problems using large language models. In: ACM SIGKDD Conference on Knowledge Discovery and Data Mining V. 2, pp. 3228–3239 (2025)

work page 2025
[79]

arXiv preprint arXiv:2403.03962 (2024)

Mao, J., Zou, D., Sheng, L., Liu, S., Gao, C., Wang, Y., Li, Y.: Identify crit- ical nodes in complex network with large language models. arXiv preprint arXiv:2403.03962 (2024)

work page arXiv 2024
[80]

arXiv preprint arXiv:2410.17656 (2024)

Yu, H., Liu, J.: AutoRNet: Automatically optimizing heuristics for robust network design via large language models. arXiv preprint arXiv:2410.17656 (2024)

work page arXiv 2024

Showing first 80 references.

[1] [1]

(ed.): Introduction to Operations Research, 9th edn

Hillier, F.S. (ed.): Introduction to Operations Research, 9th edn. McGrawHill, New York (2005)

work page 2005

[2] [2]

In: Advances in Neural Information Processing Systems (2020)

Brown, T., Mann, B., Ryder, N., Subbiah, M., Kaplan, J.D., Dhariwal, P., Neelakantan, A., Shyam, P., Sastry, G., Askell, A.,et al.: Language models are few-shot learners. In: Advances in Neural Information Processing Systems (2020)

work page 2020

[3] [3]

In: International Conference on Learning Representations (2023)

Wang, X., Wei, J., Schuurmans, D., Le, Q., Chi, E., Narang, S., Chowdhery, A., Zhou, D.: Self-consistency improves chain of thought reasoning in language models. In: International Conference on Learning Representations (2023)

work page 2023

[4] [4]

In: Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop, pp

Ahn, J., Verma, R., Lou, R., Liu, D., Zhang, R., Yin, W.: Large language models for mathematical reasoning: Progresses and challenges. In: Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop, pp. 225–237 (2024)

work page 2024

[5] [5]

IEEE Transactions on Evolutionary Computation (2026)

Li, Y., Wang, H., Jin, Y.: EvoSR-LLM: Evolutionary symbolic regression guided by large language models. IEEE Transactions on Evolutionary Computation (2026)

work page 2026

[6] [6]

ACM Transactions on Software Engineering and Methodology35(2), 1–72 (2026) 23

Jiang, J., Wang, F., Shen, J., Kim, S., Kim, S.: A survey on large language models for code generation. ACM Transactions on Software Engineering and Methodology35(2), 1–72 (2026) 23

work page 2026

[7] [7]

ACM Computing Surveys58(8), 1–32 (2026)

Liu, F., Yao, Y., Guo, P., Yang, Z., Lin, X., Zhao, Z., Tong, X., Mao, K., Lu, Z., Wang, Z.,et al.: A systematic survey on large language models for algorithm design. ACM Computing Surveys58(8), 1–32 (2026)

work page 2026

[8] [8]

Journal of the American Statistical Association 121(553), 1–13 (2026)

Sun, M., Han, R., Jiang, B., Qi, H., Sun, D., Yuan, Y., Huang, J.: Lambda: A large model based data agent. Journal of the American Statistical Association 121(553), 1–13 (2026)

work page 2026

[9] [9]

In: 2025 China Automation Congress (CAC), pp

Li, J., Xiu, X.: LLMM4FS: Leveraging large language models for feature selection and how to improve it. In: 2025 China Automation Congress (CAC), pp. 7297– 7302 (2025). IEEE

work page 2025

[10] [10]

European Journal of Operational Research 332(1), 1–30 (2026)

Fan, Z., Ghaddar, B., Wang, X., Xing, L., Zhang, Y., Zhou, Z.: Artificial intelli- gence for optimization: Unleashing the potential of parameter generation, model formulation, and solution methods. European Journal of Operational Research 332(1), 1–30 (2026)

work page 2026

[11] [11]

Swarm and Evolutionary Computation90, 101663 (2024)

Huang, S., Yang, K., Qi, S., Wang, R.: When large language model meets optimization. Swarm and Evolutionary Computation90, 101663 (2024)

work page 2024

[12] [12]

IEEE Transactions on Evolutionary Computation29(2), 534–554 (2025)

Wu, X., Wu, S.-H., Wu, J., Feng, L., Tan, K.C.: Evolutionary computation in the era of large language model: Survey and roadmap. IEEE Transactions on Evolutionary Computation29(2), 534–554 (2025)

work page 2025

[13] [13]

SCIENTIA SINICA Mathematica55(2), 451 (2025)

Guo, T., Li, A., Han, C.: Machine learning method for combinatorial optimiza- tion problems. SCIENTIA SINICA Mathematica55(2), 451 (2025)

work page 2025

[14] [14]

arXiv preprint arXiv:2503.17726 (2025)

Forootani, A.: A survey on mathematical reasoning and optimization with large language models. arXiv preprint arXiv:2503.17726 (2025)

work page arXiv 2025

[15] [15]

In: International Joint Conference on Artificial Intelligence, pp

Xiao, Z., Xie, J., Xu, L., Guan, S., Zhu, J., Han, X., Fu, X., Yu, W., Wu, H., Shi, W.,et al.: A survey of optimization modeling meets LLMs: Progress and future directions. In: International Joint Conference on Artificial Intelligence, pp. 10742–10750 (2025)

work page 2025

[16] [16]

arXiv preprint arXiv:2509.08269 (2025)

Zhang, Y., Cheng, R., Yi, G., Tan, K.C.: A systematic survey on large language models for evolutionary optimization: From modeling to solving. arXiv preprint arXiv:2509.08269 (2025)

work page arXiv 2025

[17] [17]

arXiv preprint arXiv:2509.18180 (2025)

Wang, Y., Li, K.: Large language models in operations research: Methods, applications, and challenges. arXiv preprint arXiv:2509.18180 (2025)

work page arXiv 2025

[18] [18]

Springer, New York (2006)

Nocedal, J., Wright, S.J.: Numerical Optimization. Springer, New York (2006)

work page 2006

[19] [19]

In: Advances in Neural Information Processing Systems (2017) 24

Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L., Polosukhin, I.: Attention is all you need. In: Advances in Neural Information Processing Systems (2017) 24

work page 2017

[20] [20]

ACM Computing Surveys56(2), 1–40 (2023)

Min, B., Ross, H., Sulem, E., Veyseh, A.P.B., Nguyen, T.H., Sainz, O., Agirre, E., Heintz, I., Roth, D.: Recent advances in natural language processing via large pre-trained language models: A survey. ACM Computing Surveys56(2), 1–40 (2023)

work page 2023

[21] [21]

Nature Mchine Intelligence5(3), 220–235 (2023)

Ding, N., Qin, Y., Yang, G., Wei, F., Yang, Z., Su, Y., Hu, S., Chen, Y., Chan, C.-M., Chen, W.,et al.: Parameter-efficient fine-tuning of large-scale pre-trained language models. Nature Mchine Intelligence5(3), 220–235 (2023)

work page 2023

[22] [22]

In: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, pp

Dong, G., Yuan, H., Lu, K., Li, C., Xue, M., Liu, D., Wang, W., Yuan, Z., Zhou, C., Zhou, J.: How abilities in large language models are affected by supervised fine-tuning data composition. In: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, pp. 177–198 (2024)

work page 2024

[23] [23]

Transactions on Machine Learning Research (2025)

Kaufmann, T., Weng, P., Bengs, V., H¨ ullermeier, E.: A survey of reinforcement learning from human feedback. Transactions on Machine Learning Research (2025)

work page 2025

[24] [24]

Transactions on Machine Learning Research (2024)

Han, Z., Gao, C., Liu, J., Zhang, J., Zhang, S.Q.: Parameter-efficient fine-tuning for large models: A comprehensive survey. Transactions on Machine Learning Research (2024)

work page 2024

[25] [25]

In: International Conference on Learning Representations (2022)

Hu, E.J., Shen, Y., Wallis, P., Allen-Zhu, Z., Li, Y., Wang, S., Wang, L., Chen, W.,et al.: LoRA: Low-rank adaptation of large language models. In: International Conference on Learning Representations (2022)

work page 2022

[26] [26]

In: Proceedings of the 59th Annual Meeting of the Association for Compu- tational Linguistics and the 11th International Joint Conference on Natural Language Processing, pp

Li, X.L., Liang, P.: Prefix-tuning: Optimizing continuous prompts for generation. In: Proceedings of the 59th Annual Meeting of the Association for Compu- tational Linguistics and the 11th International Joint Conference on Natural Language Processing, pp. 4582–4597 (2021)

work page 2021

[27] [27]

In: International Conference on Learning Representations (2024)

Azerbayev, Z., Schoelkopf, H., Paster, K., Santos, M.D., McAleer, S.M., Jiang, A.Q., Deng, J., Biderman, S., Welleck, S.: Llemma: An open language model for mathematics. In: International Conference on Learning Representations (2024)

work page 2024

[28] [28]

In: International Conference on Machine Learning (2025)

Yang, K., Poesia, G., He, J., Li, W., Lauter, K., Chaudhuri, S., Song, D.: Formal mathematical reasoning: A new frontier in AI. In: International Conference on Machine Learning (2025)

work page 2025

[29] [29]

In: Conference on Empirical Methods in Natural Language Processing: Industry Track, pp

Ramamonjison, R., Li, H., Yu, T., He, S., Rengan, V., Banitalebi-Dehkordi, A., Zhou, Z., Zhang, Y.: Augmenting operations research with auto-formulation of optimization models from problem descriptions. In: Conference on Empirical Methods in Natural Language Processing: Industry Track, pp. 29–62 (2022)

work page 2022

[30] [30]

In: Advances in Neural Information Processing Systems (2023)

Ramamonjison, R., Yu, T., Li, R., Li, H., Carenini, G., Ghaddar, B., He, S., Mostajabdaveh, M., Banitalebi-Dehkordi, A., Zhou, Z.,et al.: NL4Opt com- petition: Formulating optimization problems based on their natural language 25 descriptions. In: Advances in Neural Information Processing Systems (2023)

work page 2023

[31] [31]

INFOR: Information Systems and Operational Research62(4), 559–572 (2024)

Ahmed, T., Choudhury, S.: LM4OPT: Unveiling the potential of large lan- guage models in formulating mathematical optimization problems. INFOR: Information Systems and Operational Research62(4), 559–572 (2024)

work page 2024

[32] [32]

arXiv preprint arXiv:2501.00568 (2024)

Bertsimas, D., Margaritis, G.: Robust and adaptive optimization under a large language model lens. arXiv preprint arXiv:2501.00568 (2024)

work page arXiv 2024

[33] [33]

In: International Conference on Machine Learning (2025)

Zhai, H., Lawless, C., Vitercik, E., Leqi, L.: EquivaMap: Leveraging LLMs for automatic equivalence checking of optimization formulations. In: International Conference on Machine Learning (2025)

work page 2025

[34] [34]

In: International Conference on Learning Representations (2023)

Yao, S., Zhao, J., Yu, D., Du, N., Shafran, I., Narasimhan, K., Cao, Y.: React: Synergizing reasoning and acting in language models. In: International Conference on Learning Representations (2023)

work page 2023

[35] [35]

In: Advances in Neural Information Processing Systems (2022)

Wei, J., Wang, X., Schuurmans, D., Bosma, M., Xia, F., Chi, E., Le, Q.V., Zhou, D.,et al.: Chain-of-Thought prompting elicits reasoning in large language models. In: Advances in Neural Information Processing Systems (2022)

work page 2022

[36] [36]

In: Advances in Neural Information Processing Systems (2023)

Yao, S., Yu, D., Zhao, J., Shafran, I., Griffiths, T., Cao, Y., Narasimhan, K.: Tree of Thoughts: Deliberate problem solving with large language models. In: Advances in Neural Information Processing Systems (2023)

work page 2023

[37] [37]

In: Proceed- ings of the AAAI Conference on Artificial Intelligence, vol

Besta, M., Blach, N., Kubicek, A., Gerstenberger, R., Podstawski, M., Giani- nazzi, L., Gajda, J., Lehmann, T., Niewiadomski, H., Nyczyk, P.,et al.: Graph of Thoughts: Solving elaborate problems with large language models. In: Proceed- ings of the AAAI Conference on Artificial Intelligence, vol. 38, pp. 17682–17690 (2024)

work page 2024

[38] [38]

In: International Conference on Learning Representations (2024)

Xiao, Z., Zhang, D., Wu, Y., Xu, L., Wang, Y.J., Han, X., Fu, X., Zhong, T., Zeng, J., Song, M.,et al.: Chain-of-Experts: When LLMs meet com- plex operations research problems. In: International Conference on Learning Representations (2024)

work page 2024

[39] [39]

In: Progress Toward the Holy Grail Workshop at CP2023 (2023)

Tsouros, D., Verhaeghe, H., Kadioglu, S., Guns, T.: Holy Grail 2.0: From natural language to constraint models. In: Progress Toward the Holy Grail Workshop at CP2023 (2023)

work page 2023

[40] [40]

arXiv preprint arXiv:2409.04464 (2024)

Wang, T., Yu, W.-Y., She, R., Yang, W., Chen, T., Zhang, J.: Leverag- ing large language models for solving rare MIP challenges. arXiv preprint arXiv:2409.04464 (2024)

work page arXiv 2024

[41] [41]

In: Advances in Neural Information Processing Systems (2025) 26

Liu, H., Wang, J., Cai, Y., Han, X., Kuang, Y., HAO, J.: OptiTree: Hierarchical thoughts generation with tree search for llm optimization modeling. In: Advances in Neural Information Processing Systems (2025) 26

work page 2025

[42] [42]

In: International Conference on Machine Learning (2024)

AhmadiTeshnizi, A., Gao, W., Udell, M.: OptiMUS: Scalable optimization modeling with (MI)LP solvers and large language models. In: International Conference on Machine Learning (2024)

work page 2024

[43] [43]

In: International Conference on Learning Representations (2025)

Hao, Y., Zhang, Y., Fan, C.: Planning anything with rigor: General-purpose zero-shot planning with LLM-based formalized programming. In: International Conference on Learning Representations (2025)

work page 2025

[44] [44]

INFOR: Information Systems and Operational Research62(4), 599–617 (2024)

Mostajabdaveh, M., Yu, T.T., Ramamonjison, R., Carenini, G., Zhou, Z., Zhang, Y.: Optimization modeling and verification from problem specifications using a multi-agent multi-stage LLM framework. INFOR: Information Systems and Operational Research62(4), 599–617 (2024)

work page 2024

[45] [45]

arXiv preprint arXiv:2504.16918 (2025)

Thind, R., Sun, Y., Liang, L., Yang, H.: OptimAI: Optimization from natu- ral language using LLM-powered AI Agents. arXiv preprint arXiv:2504.16918 (2025)

work page arXiv 2025

[46] [46]

In: International Conference on Machine Learning (2025)

Astorga, N., Liu, T., Xiao, Y., Schaar, M.: Autoformulation of mathemati- cal optimization models using LLMs. In: International Conference on Machine Learning (2025)

work page 2025

[47] [47]

In: Advances in Neural Information Processing Systems (2025)

Berto, F., Hua, C., Luttmann, L., Son, J., Park, J., Ahn, K., Kwon, C., Xie, L., Park, J.: PARCO: parallel autoregressive models for multi-agent combinatorial optimization. In: Advances in Neural Information Processing Systems (2025)

work page 2025

[48] [48]

Proceedings of the Design Society5, 2201–2210 (2025)

Jiang, S., Xie, M., Luo, J.: Large language models for combinatorial optimiza- tion of design structure matrix. Proceedings of the Design Society5, 2201–2210 (2025)

work page 2025

[49] [49]

arXiv preprint arXiv:2407.19633 (2024)

AhmadiTeshnizi, A., Gao, W., Brunborg, H., Talaei, S., Lawless, C., Udell, M.: OptiMUS-0.3: Using large language models to model and solve optimization problems at scale. arXiv preprint arXiv:2407.19633 (2024)

work page arXiv 2024

[50] [50]

In: International Conference on Learning Representations (2025)

Jiang, X., Wu, Y., Zhang, C., Zhang, Y.: DRoC: Elevating large language mod- els for complex vehicle routing via decomposed retrieval of constraints. In: International Conference on Learning Representations (2025)

work page 2025

[51] [51]

In: 2025 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp

Peng, M., Chen, Z., Yang, J., Huang, J., Shi, Z., Liu, Q., Li, X., Gao, L.: Auto- matic MILP model construction for multi-robot task allocation and scheduling based on large language models. In: 2025 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 20291–20296 (2025). IEEE

work page 2025

[52] [52]

arXiv preprint arXiv:2601.09635 (2026)

Liang, K., Lu, Y., Mao, J., Sun, S., Yang, C., Zeng, C., Jin, X., Qin, H., Zhu, R., Teo, C.-P.: LLM for large-scale optimization model auto-formulation: A lightweight few-shot learning approach. arXiv preprint arXiv:2601.09635 (2026)

work page arXiv 2026

[53] [53]

arXiv preprint arXiv:2309.13218 (2023)

Amarasinghe, P.T., Nguyen, S., Sun, Y., Alahakoon, D.: AI-Copilot for business 27 optimisation: A framework and a case study in production scheduling. arXiv preprint arXiv:2309.13218 (2023)

work page arXiv 2023

[54] [54]

arXiv preprint arXiv:2311.15271 (2023)

Li, Q., Zhang, L., Mak-Hau, V.: Synthesizing mixed-integer linear programming models from natural language descriptions. arXiv preprint arXiv:2311.15271 (2023)

work page arXiv 2023

[55] [55]

arXiv preprint arXiv:2405.01997 (2024)

Masoud, M., Abdelhay, A., Elhenawy, M.: Exploring combinatorial problem solv- ing with large language models: A case study on the travelling salesman problem using GPT-3.5 turbo. arXiv preprint arXiv:2405.01997 (2024)

work page arXiv 2024

[56] [56]

In: International Conference on Learning Representations (2025)

Yang, Z., Wang, Y., Huang, Y., Guo, Z., Shi, W., Han, X., Feng, L., Song, L., Liang, X., Tang, J.: OptiBench meets ReSocratic: Measure and improve LLMs for optimization modeling. In: International Conference on Learning Representations (2025)

work page 2025

[57] [57]

Operations Research (2025)

Huang, C., Tang, Z., Hu, S., Jiang, R., Zheng, X., Ge, D., Wang, B., Wang, Z.: ORLM: A customizable framework in training large models for automated optimization modeling. Operations Research (2025)

work page 2025

[58] [58]

IEEE Transactions on Evolutionary Computation (2026)

Ma, Z., Gong, Y.-J., Guo, H., Chen, J., Ma, Y., Cao, Z., Zhang, J.: LLaMoCo: Instruction tuning of large language models for optimization code generation. IEEE Transactions on Evolutionary Computation (2026)

work page 2026

[59] [59]

Advances in Neural Information Processing Systems (2026)

Jiang, X., Wu, Y., Li, M., Cao, Z., Zhang, Y.: Large language models as end-to-end combinatorial optimization solvers. Advances in Neural Information Processing Systems (2026)

work page 2026

[60] [60]

In: International Conference on Learning Representations (2025)

Jiang, C., Shu, X., Qian, H., Lu, X., ZHOU, J., Zhou, A., Yu, Y.: LLMOPT: Learning to define and solve general optimization problems from scratch. In: International Conference on Learning Representations (2025)

work page 2025

[61] [61]

KTO: Model Alignment as Prospect Theoretic Optimization

Ethayarajh, K., Xu, W., Muennighoff, N., Jurafsky, D., Kiela, D.: KTO: Model alignment as prospect theoretic optimization. arXiv preprint arXiv:2402.01306 (2024)

work page internal anchor Pith review Pith/arXiv arXiv 2024

[62] [62]

arXiv preprint arXiv:2507.11737 (2025)

Zhou, C., Yang, J., Xin, L., Chen, Y., He, Z., Ge, D.: Auto-formulating dynamic programming problems with large language models. arXiv preprint arXiv:2507.11737 (2025)

work page arXiv 2025

[63] [63]

In: Advances in Neural Information Processing Systems (2023)

Rafailov, R., Sharma, A., Mitchell, E., Manning, C.D., Ermon, S., Finn, C.: Direct preference optimization: Your language model is secretly a reward model. In: Advances in Neural Information Processing Systems (2023)

work page 2023

[64] [64]

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Shao, Z., Wang, P., Zhu, Q., Xu, R., Song, J., Bi, X., Zhang, H., Zhang, M., Li, Y., Wu, Y., et al.: DeepSeekMath: Pushing the limits of mathematical reasoning in open language models. arXiv preprint arXiv:2402.03300 (2024) 28

work page internal anchor Pith review Pith/arXiv arXiv 2024

[65] [65]

In: Proceedings of the AAAI Conference on Artificial Intelligence, vol

Ding, Z., Tan, Z., Zhang, J., Chen, T.: OR-R1: Automating modeling and solving of operations research optimization problem via test-time reinforcement learn- ing. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 40, pp. 228–236 (2026)

work page 2026

[66] [66]

In: International Conference on Machine Learning (2025)

Lu, H., Xie, Z., Wu, Y., Ren, C., Chen, Y., Wen, Z.: OptMATH: A scalable bidi- rectional data synthesis framework for optimization modeling. In: International Conference on Machine Learning (2025)

work page 2025

[67] [67]

In: Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp

Zhang, J., Wang, W., Guo, S., Wang, L., Lin, F., Yang, C., Yin, W.: Solving general natural-language-description optimization problems with large language models. In: Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 483–490 (2024)

work page 2024

[68] [68]

Wu, Y., Zhang, Y., Wu, Y., Wang, Y., Zhang, J., Cheng, J.: Evo-Step: Evolutionary generation and stepwise validation for optimizing LLMs in OR (2025)

work page 2025

[69] [69]

In: International Conference on Parallel Problem Solving from Nature, pp

Yao, Y., Liu, F., Cheng, J., Zhang, Q.: Evolve cost-aware acquisition functions using large language models. In: International Conference on Parallel Problem Solving from Nature, pp. 374–390 (2024). Springer

work page 2024

[70] [70]

IEEE Transactions on Smart Grid (2026)

Lou, C., Jin, Z., Tang, W., Geng, G., Yang, J., Zhang, L.: Llm-enhanced multi- agent reinforcement learning with expert workflow for real-time p2p energy trading. IEEE Transactions on Smart Grid (2026)

work page 2026

[71] [71]

In: Handbook of Evolutionary Machine Learning, pp

Lehman, J., Gordon, J., Jain, S., Ndousse, K., Yeh, C., Stanley, K.O.: Evolution through large models. In: Handbook of Evolutionary Machine Learning, pp. 331–366. Springer, Cham, Switzerland (2023)

work page 2023

[72] [72]

IFAC Journal of Systems and Control, 100420 (2026)

Xu, H., Gao, J., Wen, J., Wang, W., Du, J.: Synergistic optimization and llm-informed decision-making for aero-engine rotor assembly. IFAC Journal of Systems and Control, 100420 (2026)

work page 2026

[73] [73]

arXiv preprint arXiv:2504.19636 (2025)

Liu, F., Zhang, Q., Shi, J., Tong, X., Mao, K., Yuan, M.: Fitness landscape of large language model-assisted automated algorithm search. arXiv preprint arXiv:2504.19636 (2025)

work page arXiv 2025

[74] [74]

ACM Computing Surveys 58(11), 1–53 (2026)

Da Ros, F., Soprano, M., Di Gaspero, L., Roitero, K.: Large language models for combinatorial optimization: A systematic review. ACM Computing Surveys 58(11), 1–53 (2026)

work page 2026

[75] [75]

In: Advances in Neural Information Processing Systems (2023)

Nie, A., Cheng, C.-A., Kolobov, A., Swaminathan, A.: Importance of direc- tional feedback for LLM-based optimizers. In: Advances in Neural Information Processing Systems (2023)

work page 2023

[76] [76]

In: 3rd International Joint Conference on Artificial Intelligence, pp

Wu, X., Zhong, Y., Wu, J., Jiang, B., Tan, K.C.: Large language model-enhanced 29 algorithm selection: Towards comprehensive algorithm representation. In: 3rd International Joint Conference on Artificial Intelligence, pp. 5235–5244 (2024)

work page 2024

[77] [77]

arXiv preprint arXiv:2506.11057 (2025)

Li, X., Yang, J., Wang, J., Peng, B., Yao, J., Guan, H.: STRCMP: Integrating graph structural priors with language models for combinatorial optimization. arXiv preprint arXiv:2506.11057 (2025)

work page arXiv 2025

[78] [78]

In: ACM SIGKDD Conference on Knowledge Discovery and Data Mining V

Wu, X., Wang, D., Wu, C., Wen, L., Miao, C., Xiao, Y., Zhou, Y.: Efficient heuristics generation for solving combinatorial optimization problems using large language models. In: ACM SIGKDD Conference on Knowledge Discovery and Data Mining V. 2, pp. 3228–3239 (2025)

work page 2025

[79] [79]

arXiv preprint arXiv:2403.03962 (2024)

Mao, J., Zou, D., Sheng, L., Liu, S., Gao, C., Wang, Y., Li, Y.: Identify crit- ical nodes in complex network with large language models. arXiv preprint arXiv:2403.03962 (2024)

work page arXiv 2024

[80] [80]

arXiv preprint arXiv:2410.17656 (2024)

Yu, H., Liu, J.: AutoRNet: Automatically optimizing heuristics for robust network design via large language models. arXiv preprint arXiv:2410.17656 (2024)

work page arXiv 2024