pith. sign in

arxiv: 2605.20849 · v1 · pith:ABBJG6UWnew · submitted 2026-05-20 · 🧮 math.OC

Large Language Models for Operations Research: A Comprehensive Survey

Pith reviewed 2026-05-21 03:53 UTC · model grok-4.3

classification 🧮 math.OC
keywords large language modelsoperations researchmodel formulationalgorithm designsolution verificationbenchmark datasetsoptimization problems
0
0 comments X

The pith

Large language models support operations research by aiding in problem formulation, algorithm design, and solution verification.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper provides a systematic review of how large language models can be applied to operations research problems. It details their potential contributions to formulating mathematical models, designing solution algorithms, and verifying outcomes, areas where traditional methods often require significant human expertise. The survey includes discussions of practical uses in various scenarios, available benchmark datasets for evaluation, and outlines key challenges along with suggestions for future work. Readers interested in decision support systems would find value in understanding these emerging capabilities that could make complex optimization more accessible.

Core claim

The central discovery is a comprehensive mapping of large language model applications in operations research, showing how they can handle model formulation for optimization tasks, support algorithm development for solving these models, and perform verification of the computed solutions, while also cataloging real-world applications, datasets, and open research questions.

What carries the argument

The structured roles of large language models in operations research, encompassing model formulation, algorithm design, and solution verification as the primary ways they augment traditional approaches.

If this is right

  • If large language models reliably formulate models, then experts could focus on higher-level decisions rather than routine setup.
  • Algorithm design assistance from these models may enable faster prototyping of solutions for large-scale problems.
  • Solution verification by language models could catch errors that manual checks might miss in complex scenarios.
  • Identified benchmark datasets would facilitate comparative studies and accelerate progress in the field.
  • Future directions outlined could guide research toward more robust integration of these models into decision-making pipelines.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Extending this survey, one might explore how these models perform on stochastic or robust optimization problems not covered in detail.
  • Hybrid approaches combining language models with exact solvers could be tested to improve both creativity and precision in solutions.
  • Implications for education in operations research include using these models as interactive tutors for problem modeling.

Load-bearing premise

The collected studies and described applications of large language models accurately represent their current abilities to manage the expert knowledge demands of traditional operations research problems.

What would settle it

Finding that large language models produce invalid or suboptimal formulations for standard benchmark problems in operations research, such as the traveling salesman problem described in plain language, would indicate the survey overestimates their practical utility.

read the original abstract

Operations Research (OR) serves as a core decision-support methodology for complex systems, with significant applications across mathematics, management science, and computer science. Traditional approaches heavily rely on expert knowledge and often struggle to efficiently solve large-scale and multi-constraint problems. The rapid advancement of Large Language Models (LLMs) in recent years has offered a novel research paradigm to address these challenges. This paper presents a systematic survey of Large Language Models for Operations Research (LLM4OR). We begin by introducing the definition of OR problems and the fundamental principles of LLMs. We then focus on analyzing the roles of LLMs in OR, specifically covering such as model formulation, algorithm design, and solution verification. In addition, we discuss practical applications in representative scenarios and summarize benchmark datasets in this field. Finally, we outline the key challenges and provide perspectives on future research directions. A collection of related literature is available at https://github.com/xianchaoxiu/LLM4OR.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 1 minor

Summary. The manuscript presents a systematic survey of Large Language Models for Operations Research (LLM4OR). It introduces the definition of OR problems and fundamental principles of LLMs, analyzes LLM roles in model formulation, algorithm design, and solution verification, discusses practical applications in representative scenarios, summarizes benchmark datasets, outlines key challenges, and provides perspectives on future research directions, with an accompanying GitHub repository for related literature.

Significance. If the survey delivers thorough and accurate coverage of this emerging intersection, it would provide a useful consolidation of how LLMs can supplement expert-knowledge-heavy traditional OR methods for large-scale problems. The outlined structure, including benchmarks and challenges, positions the work to guide future research if the literature collection proves comprehensive.

major comments (1)
  1. Abstract: The manuscript describes its contribution as a 'systematic survey' but supplies no information on the literature search methodology, including databases queried, keywords or search strings employed, time period covered, inclusion/exclusion criteria, or any completeness assessment. This omission is load-bearing because the central claim rests on the survey's coverage and accuracy in capturing LLM applications to OR.
minor comments (1)
  1. The GitHub link for the literature collection is a positive step toward reproducibility; consider adding a brief description of its update policy or curation process in the main text.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for this constructive comment. We agree that explicitly documenting the literature search methodology is necessary to substantiate the 'systematic survey' claim and will revise the manuscript to address this.

read point-by-point responses
  1. Referee: Abstract: The manuscript describes its contribution as a 'systematic survey' but supplies no information on the literature search methodology, including databases queried, keywords or search strings employed, time period covered, inclusion/exclusion criteria, or any completeness assessment. This omission is load-bearing because the central claim rests on the survey's coverage and accuracy in capturing LLM applications to OR.

    Authors: We acknowledge the validity of this observation. The current version of the manuscript does not include a dedicated description of the literature collection process. In the revised manuscript we will add a new subsection (e.g., Section 1.3 or an appendix) that details: (1) databases queried (arXiv, Google Scholar, IEEE Xplore, and ACM Digital Library); (2) search strings such as (LLM OR “large language model”) AND (“operations research” OR optimization OR scheduling OR “integer programming”); (3) time period (primarily 2022–2024, reflecting the emergence of capable LLMs); (4) inclusion criteria (peer-reviewed papers, preprints, and workshop papers that apply LLMs to at least one core OR task—formulation, algorithm design, or verification); and (5) exclusion criteria (pure LLM capability papers without OR linkage). We will also note that completeness was cross-checked against the continuously updated GitHub repository. The abstract will be updated with a brief clause referencing this methodology. These additions will be placed early in the paper so readers can immediately assess coverage. revision: yes

Circularity Check

0 steps flagged

No significant circularity in this literature survey

full rationale

This paper is a systematic survey compiling and summarizing external literature on LLM applications to Operations Research. It contains no original derivations, equations, fitted parameters, predictions, or mathematical claims that could reduce to inputs by construction. The structure covers definitions, roles of LLMs, applications, benchmarks, challenges, and future directions, all drawn from cited prior works. No self-citation chains, ansatzes, or uniqueness theorems are invoked in a load-bearing manner; the validity rests on coverage and accurate summarization of independent sources rather than any internal self-referential logic. This is a standard literature review with no circularity patterns present.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The survey rests on standard background definitions of operations research and large language models drawn from prior literature, with no free parameters, ad-hoc axioms, or new entities introduced.

axioms (1)
  • domain assumption Operations Research serves as a core decision-support methodology relying on expert knowledge for complex systems.
    Invoked in the opening definition of OR problems and traditional approaches.

pith-pipeline@v0.9.0 · 5699 in / 1062 out tokens · 42549 ms · 2026-05-21T03:53:02.504461+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

182 extracted references · 182 canonical work pages · 3 internal anchors

  1. [1]

    (ed.): Introduction to Operations Research, 9th edn

    Hillier, F.S. (ed.): Introduction to Operations Research, 9th edn. McGrawHill, New York (2005)

  2. [2]

    In: Advances in Neural Information Processing Systems (2020)

    Brown, T., Mann, B., Ryder, N., Subbiah, M., Kaplan, J.D., Dhariwal, P., Neelakantan, A., Shyam, P., Sastry, G., Askell, A.,et al.: Language models are few-shot learners. In: Advances in Neural Information Processing Systems (2020)

  3. [3]

    In: International Conference on Learning Representations (2023)

    Wang, X., Wei, J., Schuurmans, D., Le, Q., Chi, E., Narang, S., Chowdhery, A., Zhou, D.: Self-consistency improves chain of thought reasoning in language models. In: International Conference on Learning Representations (2023)

  4. [4]

    In: Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop, pp

    Ahn, J., Verma, R., Lou, R., Liu, D., Zhang, R., Yin, W.: Large language models for mathematical reasoning: Progresses and challenges. In: Conference of the European Chapter of the Association for Computational Linguistics: Student Research Workshop, pp. 225–237 (2024)

  5. [5]

    IEEE Transactions on Evolutionary Computation (2026)

    Li, Y., Wang, H., Jin, Y.: EvoSR-LLM: Evolutionary symbolic regression guided by large language models. IEEE Transactions on Evolutionary Computation (2026)

  6. [6]

    ACM Transactions on Software Engineering and Methodology35(2), 1–72 (2026) 23

    Jiang, J., Wang, F., Shen, J., Kim, S., Kim, S.: A survey on large language models for code generation. ACM Transactions on Software Engineering and Methodology35(2), 1–72 (2026) 23

  7. [7]

    ACM Computing Surveys58(8), 1–32 (2026)

    Liu, F., Yao, Y., Guo, P., Yang, Z., Lin, X., Zhao, Z., Tong, X., Mao, K., Lu, Z., Wang, Z.,et al.: A systematic survey on large language models for algorithm design. ACM Computing Surveys58(8), 1–32 (2026)

  8. [8]

    Journal of the American Statistical Association 121(553), 1–13 (2026)

    Sun, M., Han, R., Jiang, B., Qi, H., Sun, D., Yuan, Y., Huang, J.: Lambda: A large model based data agent. Journal of the American Statistical Association 121(553), 1–13 (2026)

  9. [9]

    In: 2025 China Automation Congress (CAC), pp

    Li, J., Xiu, X.: LLMM4FS: Leveraging large language models for feature selection and how to improve it. In: 2025 China Automation Congress (CAC), pp. 7297– 7302 (2025). IEEE

  10. [10]

    European Journal of Operational Research 332(1), 1–30 (2026)

    Fan, Z., Ghaddar, B., Wang, X., Xing, L., Zhang, Y., Zhou, Z.: Artificial intelli- gence for optimization: Unleashing the potential of parameter generation, model formulation, and solution methods. European Journal of Operational Research 332(1), 1–30 (2026)

  11. [11]

    Swarm and Evolutionary Computation90, 101663 (2024)

    Huang, S., Yang, K., Qi, S., Wang, R.: When large language model meets optimization. Swarm and Evolutionary Computation90, 101663 (2024)

  12. [12]

    IEEE Transactions on Evolutionary Computation29(2), 534–554 (2025)

    Wu, X., Wu, S.-H., Wu, J., Feng, L., Tan, K.C.: Evolutionary computation in the era of large language model: Survey and roadmap. IEEE Transactions on Evolutionary Computation29(2), 534–554 (2025)

  13. [13]

    SCIENTIA SINICA Mathematica55(2), 451 (2025)

    Guo, T., Li, A., Han, C.: Machine learning method for combinatorial optimiza- tion problems. SCIENTIA SINICA Mathematica55(2), 451 (2025)

  14. [14]

    arXiv preprint arXiv:2503.17726 (2025)

    Forootani, A.: A survey on mathematical reasoning and optimization with large language models. arXiv preprint arXiv:2503.17726 (2025)

  15. [15]

    In: International Joint Conference on Artificial Intelligence, pp

    Xiao, Z., Xie, J., Xu, L., Guan, S., Zhu, J., Han, X., Fu, X., Yu, W., Wu, H., Shi, W.,et al.: A survey of optimization modeling meets LLMs: Progress and future directions. In: International Joint Conference on Artificial Intelligence, pp. 10742–10750 (2025)

  16. [16]

    arXiv preprint arXiv:2509.08269 (2025)

    Zhang, Y., Cheng, R., Yi, G., Tan, K.C.: A systematic survey on large language models for evolutionary optimization: From modeling to solving. arXiv preprint arXiv:2509.08269 (2025)

  17. [17]

    arXiv preprint arXiv:2509.18180 (2025)

    Wang, Y., Li, K.: Large language models in operations research: Methods, applications, and challenges. arXiv preprint arXiv:2509.18180 (2025)

  18. [18]

    Springer, New York (2006)

    Nocedal, J., Wright, S.J.: Numerical Optimization. Springer, New York (2006)

  19. [19]

    In: Advances in Neural Information Processing Systems (2017) 24

    Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, L., Polosukhin, I.: Attention is all you need. In: Advances in Neural Information Processing Systems (2017) 24

  20. [20]

    ACM Computing Surveys56(2), 1–40 (2023)

    Min, B., Ross, H., Sulem, E., Veyseh, A.P.B., Nguyen, T.H., Sainz, O., Agirre, E., Heintz, I., Roth, D.: Recent advances in natural language processing via large pre-trained language models: A survey. ACM Computing Surveys56(2), 1–40 (2023)

  21. [21]

    Nature Mchine Intelligence5(3), 220–235 (2023)

    Ding, N., Qin, Y., Yang, G., Wei, F., Yang, Z., Su, Y., Hu, S., Chen, Y., Chan, C.-M., Chen, W.,et al.: Parameter-efficient fine-tuning of large-scale pre-trained language models. Nature Mchine Intelligence5(3), 220–235 (2023)

  22. [22]

    In: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, pp

    Dong, G., Yuan, H., Lu, K., Li, C., Xue, M., Liu, D., Wang, W., Yuan, Z., Zhou, C., Zhou, J.: How abilities in large language models are affected by supervised fine-tuning data composition. In: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics, pp. 177–198 (2024)

  23. [23]

    Transactions on Machine Learning Research (2025)

    Kaufmann, T., Weng, P., Bengs, V., H¨ ullermeier, E.: A survey of reinforcement learning from human feedback. Transactions on Machine Learning Research (2025)

  24. [24]

    Transactions on Machine Learning Research (2024)

    Han, Z., Gao, C., Liu, J., Zhang, J., Zhang, S.Q.: Parameter-efficient fine-tuning for large models: A comprehensive survey. Transactions on Machine Learning Research (2024)

  25. [25]

    In: International Conference on Learning Representations (2022)

    Hu, E.J., Shen, Y., Wallis, P., Allen-Zhu, Z., Li, Y., Wang, S., Wang, L., Chen, W.,et al.: LoRA: Low-rank adaptation of large language models. In: International Conference on Learning Representations (2022)

  26. [26]

    In: Proceedings of the 59th Annual Meeting of the Association for Compu- tational Linguistics and the 11th International Joint Conference on Natural Language Processing, pp

    Li, X.L., Liang, P.: Prefix-tuning: Optimizing continuous prompts for generation. In: Proceedings of the 59th Annual Meeting of the Association for Compu- tational Linguistics and the 11th International Joint Conference on Natural Language Processing, pp. 4582–4597 (2021)

  27. [27]

    In: International Conference on Learning Representations (2024)

    Azerbayev, Z., Schoelkopf, H., Paster, K., Santos, M.D., McAleer, S.M., Jiang, A.Q., Deng, J., Biderman, S., Welleck, S.: Llemma: An open language model for mathematics. In: International Conference on Learning Representations (2024)

  28. [28]

    In: International Conference on Machine Learning (2025)

    Yang, K., Poesia, G., He, J., Li, W., Lauter, K., Chaudhuri, S., Song, D.: Formal mathematical reasoning: A new frontier in AI. In: International Conference on Machine Learning (2025)

  29. [29]

    In: Conference on Empirical Methods in Natural Language Processing: Industry Track, pp

    Ramamonjison, R., Li, H., Yu, T., He, S., Rengan, V., Banitalebi-Dehkordi, A., Zhou, Z., Zhang, Y.: Augmenting operations research with auto-formulation of optimization models from problem descriptions. In: Conference on Empirical Methods in Natural Language Processing: Industry Track, pp. 29–62 (2022)

  30. [30]

    In: Advances in Neural Information Processing Systems (2023)

    Ramamonjison, R., Yu, T., Li, R., Li, H., Carenini, G., Ghaddar, B., He, S., Mostajabdaveh, M., Banitalebi-Dehkordi, A., Zhou, Z.,et al.: NL4Opt com- petition: Formulating optimization problems based on their natural language 25 descriptions. In: Advances in Neural Information Processing Systems (2023)

  31. [31]

    INFOR: Information Systems and Operational Research62(4), 559–572 (2024)

    Ahmed, T., Choudhury, S.: LM4OPT: Unveiling the potential of large lan- guage models in formulating mathematical optimization problems. INFOR: Information Systems and Operational Research62(4), 559–572 (2024)

  32. [32]

    arXiv preprint arXiv:2501.00568 (2024)

    Bertsimas, D., Margaritis, G.: Robust and adaptive optimization under a large language model lens. arXiv preprint arXiv:2501.00568 (2024)

  33. [33]

    In: International Conference on Machine Learning (2025)

    Zhai, H., Lawless, C., Vitercik, E., Leqi, L.: EquivaMap: Leveraging LLMs for automatic equivalence checking of optimization formulations. In: International Conference on Machine Learning (2025)

  34. [34]

    In: International Conference on Learning Representations (2023)

    Yao, S., Zhao, J., Yu, D., Du, N., Shafran, I., Narasimhan, K., Cao, Y.: React: Synergizing reasoning and acting in language models. In: International Conference on Learning Representations (2023)

  35. [35]

    In: Advances in Neural Information Processing Systems (2022)

    Wei, J., Wang, X., Schuurmans, D., Bosma, M., Xia, F., Chi, E., Le, Q.V., Zhou, D.,et al.: Chain-of-Thought prompting elicits reasoning in large language models. In: Advances in Neural Information Processing Systems (2022)

  36. [36]

    In: Advances in Neural Information Processing Systems (2023)

    Yao, S., Yu, D., Zhao, J., Shafran, I., Griffiths, T., Cao, Y., Narasimhan, K.: Tree of Thoughts: Deliberate problem solving with large language models. In: Advances in Neural Information Processing Systems (2023)

  37. [37]

    In: Proceed- ings of the AAAI Conference on Artificial Intelligence, vol

    Besta, M., Blach, N., Kubicek, A., Gerstenberger, R., Podstawski, M., Giani- nazzi, L., Gajda, J., Lehmann, T., Niewiadomski, H., Nyczyk, P.,et al.: Graph of Thoughts: Solving elaborate problems with large language models. In: Proceed- ings of the AAAI Conference on Artificial Intelligence, vol. 38, pp. 17682–17690 (2024)

  38. [38]

    In: International Conference on Learning Representations (2024)

    Xiao, Z., Zhang, D., Wu, Y., Xu, L., Wang, Y.J., Han, X., Fu, X., Zhong, T., Zeng, J., Song, M.,et al.: Chain-of-Experts: When LLMs meet com- plex operations research problems. In: International Conference on Learning Representations (2024)

  39. [39]

    In: Progress Toward the Holy Grail Workshop at CP2023 (2023)

    Tsouros, D., Verhaeghe, H., Kadioglu, S., Guns, T.: Holy Grail 2.0: From natural language to constraint models. In: Progress Toward the Holy Grail Workshop at CP2023 (2023)

  40. [40]

    arXiv preprint arXiv:2409.04464 (2024)

    Wang, T., Yu, W.-Y., She, R., Yang, W., Chen, T., Zhang, J.: Leverag- ing large language models for solving rare MIP challenges. arXiv preprint arXiv:2409.04464 (2024)

  41. [41]

    In: Advances in Neural Information Processing Systems (2025) 26

    Liu, H., Wang, J., Cai, Y., Han, X., Kuang, Y., HAO, J.: OptiTree: Hierarchical thoughts generation with tree search for llm optimization modeling. In: Advances in Neural Information Processing Systems (2025) 26

  42. [42]

    In: International Conference on Machine Learning (2024)

    AhmadiTeshnizi, A., Gao, W., Udell, M.: OptiMUS: Scalable optimization modeling with (MI)LP solvers and large language models. In: International Conference on Machine Learning (2024)

  43. [43]

    In: International Conference on Learning Representations (2025)

    Hao, Y., Zhang, Y., Fan, C.: Planning anything with rigor: General-purpose zero-shot planning with LLM-based formalized programming. In: International Conference on Learning Representations (2025)

  44. [44]

    INFOR: Information Systems and Operational Research62(4), 599–617 (2024)

    Mostajabdaveh, M., Yu, T.T., Ramamonjison, R., Carenini, G., Zhou, Z., Zhang, Y.: Optimization modeling and verification from problem specifications using a multi-agent multi-stage LLM framework. INFOR: Information Systems and Operational Research62(4), 599–617 (2024)

  45. [45]

    arXiv preprint arXiv:2504.16918 (2025)

    Thind, R., Sun, Y., Liang, L., Yang, H.: OptimAI: Optimization from natu- ral language using LLM-powered AI Agents. arXiv preprint arXiv:2504.16918 (2025)

  46. [46]

    In: International Conference on Machine Learning (2025)

    Astorga, N., Liu, T., Xiao, Y., Schaar, M.: Autoformulation of mathemati- cal optimization models using LLMs. In: International Conference on Machine Learning (2025)

  47. [47]

    In: Advances in Neural Information Processing Systems (2025)

    Berto, F., Hua, C., Luttmann, L., Son, J., Park, J., Ahn, K., Kwon, C., Xie, L., Park, J.: PARCO: parallel autoregressive models for multi-agent combinatorial optimization. In: Advances in Neural Information Processing Systems (2025)

  48. [48]

    Proceedings of the Design Society5, 2201–2210 (2025)

    Jiang, S., Xie, M., Luo, J.: Large language models for combinatorial optimiza- tion of design structure matrix. Proceedings of the Design Society5, 2201–2210 (2025)

  49. [49]

    arXiv preprint arXiv:2407.19633 (2024)

    AhmadiTeshnizi, A., Gao, W., Brunborg, H., Talaei, S., Lawless, C., Udell, M.: OptiMUS-0.3: Using large language models to model and solve optimization problems at scale. arXiv preprint arXiv:2407.19633 (2024)

  50. [50]

    In: International Conference on Learning Representations (2025)

    Jiang, X., Wu, Y., Zhang, C., Zhang, Y.: DRoC: Elevating large language mod- els for complex vehicle routing via decomposed retrieval of constraints. In: International Conference on Learning Representations (2025)

  51. [51]

    In: 2025 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp

    Peng, M., Chen, Z., Yang, J., Huang, J., Shi, Z., Liu, Q., Li, X., Gao, L.: Auto- matic MILP model construction for multi-robot task allocation and scheduling based on large language models. In: 2025 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 20291–20296 (2025). IEEE

  52. [52]

    arXiv preprint arXiv:2601.09635 (2026)

    Liang, K., Lu, Y., Mao, J., Sun, S., Yang, C., Zeng, C., Jin, X., Qin, H., Zhu, R., Teo, C.-P.: LLM for large-scale optimization model auto-formulation: A lightweight few-shot learning approach. arXiv preprint arXiv:2601.09635 (2026)

  53. [53]

    arXiv preprint arXiv:2309.13218 (2023)

    Amarasinghe, P.T., Nguyen, S., Sun, Y., Alahakoon, D.: AI-Copilot for business 27 optimisation: A framework and a case study in production scheduling. arXiv preprint arXiv:2309.13218 (2023)

  54. [54]

    arXiv preprint arXiv:2311.15271 (2023)

    Li, Q., Zhang, L., Mak-Hau, V.: Synthesizing mixed-integer linear programming models from natural language descriptions. arXiv preprint arXiv:2311.15271 (2023)

  55. [55]

    arXiv preprint arXiv:2405.01997 (2024)

    Masoud, M., Abdelhay, A., Elhenawy, M.: Exploring combinatorial problem solv- ing with large language models: A case study on the travelling salesman problem using GPT-3.5 turbo. arXiv preprint arXiv:2405.01997 (2024)

  56. [56]

    In: International Conference on Learning Representations (2025)

    Yang, Z., Wang, Y., Huang, Y., Guo, Z., Shi, W., Han, X., Feng, L., Song, L., Liang, X., Tang, J.: OptiBench meets ReSocratic: Measure and improve LLMs for optimization modeling. In: International Conference on Learning Representations (2025)

  57. [57]

    Operations Research (2025)

    Huang, C., Tang, Z., Hu, S., Jiang, R., Zheng, X., Ge, D., Wang, B., Wang, Z.: ORLM: A customizable framework in training large models for automated optimization modeling. Operations Research (2025)

  58. [58]

    IEEE Transactions on Evolutionary Computation (2026)

    Ma, Z., Gong, Y.-J., Guo, H., Chen, J., Ma, Y., Cao, Z., Zhang, J.: LLaMoCo: Instruction tuning of large language models for optimization code generation. IEEE Transactions on Evolutionary Computation (2026)

  59. [59]

    Advances in Neural Information Processing Systems (2026)

    Jiang, X., Wu, Y., Li, M., Cao, Z., Zhang, Y.: Large language models as end-to-end combinatorial optimization solvers. Advances in Neural Information Processing Systems (2026)

  60. [60]

    In: International Conference on Learning Representations (2025)

    Jiang, C., Shu, X., Qian, H., Lu, X., ZHOU, J., Zhou, A., Yu, Y.: LLMOPT: Learning to define and solve general optimization problems from scratch. In: International Conference on Learning Representations (2025)

  61. [61]

    KTO: Model Alignment as Prospect Theoretic Optimization

    Ethayarajh, K., Xu, W., Muennighoff, N., Jurafsky, D., Kiela, D.: KTO: Model alignment as prospect theoretic optimization. arXiv preprint arXiv:2402.01306 (2024)

  62. [62]

    arXiv preprint arXiv:2507.11737 (2025)

    Zhou, C., Yang, J., Xin, L., Chen, Y., He, Z., Ge, D.: Auto-formulating dynamic programming problems with large language models. arXiv preprint arXiv:2507.11737 (2025)

  63. [63]

    In: Advances in Neural Information Processing Systems (2023)

    Rafailov, R., Sharma, A., Mitchell, E., Manning, C.D., Ermon, S., Finn, C.: Direct preference optimization: Your language model is secretly a reward model. In: Advances in Neural Information Processing Systems (2023)

  64. [64]

    DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

    Shao, Z., Wang, P., Zhu, Q., Xu, R., Song, J., Bi, X., Zhang, H., Zhang, M., Li, Y., Wu, Y., et al.: DeepSeekMath: Pushing the limits of mathematical reasoning in open language models. arXiv preprint arXiv:2402.03300 (2024) 28

  65. [65]

    In: Proceedings of the AAAI Conference on Artificial Intelligence, vol

    Ding, Z., Tan, Z., Zhang, J., Chen, T.: OR-R1: Automating modeling and solving of operations research optimization problem via test-time reinforcement learn- ing. In: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 40, pp. 228–236 (2026)

  66. [66]

    In: International Conference on Machine Learning (2025)

    Lu, H., Xie, Z., Wu, Y., Ren, C., Chen, Y., Wen, Z.: OptMATH: A scalable bidi- rectional data synthesis framework for optimization modeling. In: International Conference on Machine Learning (2025)

  67. [67]

    In: Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp

    Zhang, J., Wang, W., Guo, S., Wang, L., Lin, F., Yang, C., Yin, W.: Solving general natural-language-description optimization problems with large language models. In: Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp. 483–490 (2024)

  68. [68]

    Wu, Y., Zhang, Y., Wu, Y., Wang, Y., Zhang, J., Cheng, J.: Evo-Step: Evolutionary generation and stepwise validation for optimizing LLMs in OR (2025)

  69. [69]

    In: International Conference on Parallel Problem Solving from Nature, pp

    Yao, Y., Liu, F., Cheng, J., Zhang, Q.: Evolve cost-aware acquisition functions using large language models. In: International Conference on Parallel Problem Solving from Nature, pp. 374–390 (2024). Springer

  70. [70]

    IEEE Transactions on Smart Grid (2026)

    Lou, C., Jin, Z., Tang, W., Geng, G., Yang, J., Zhang, L.: Llm-enhanced multi- agent reinforcement learning with expert workflow for real-time p2p energy trading. IEEE Transactions on Smart Grid (2026)

  71. [71]

    In: Handbook of Evolutionary Machine Learning, pp

    Lehman, J., Gordon, J., Jain, S., Ndousse, K., Yeh, C., Stanley, K.O.: Evolution through large models. In: Handbook of Evolutionary Machine Learning, pp. 331–366. Springer, Cham, Switzerland (2023)

  72. [72]

    IFAC Journal of Systems and Control, 100420 (2026)

    Xu, H., Gao, J., Wen, J., Wang, W., Du, J.: Synergistic optimization and llm-informed decision-making for aero-engine rotor assembly. IFAC Journal of Systems and Control, 100420 (2026)

  73. [73]

    arXiv preprint arXiv:2504.19636 (2025)

    Liu, F., Zhang, Q., Shi, J., Tong, X., Mao, K., Yuan, M.: Fitness landscape of large language model-assisted automated algorithm search. arXiv preprint arXiv:2504.19636 (2025)

  74. [74]

    ACM Computing Surveys 58(11), 1–53 (2026)

    Da Ros, F., Soprano, M., Di Gaspero, L., Roitero, K.: Large language models for combinatorial optimization: A systematic review. ACM Computing Surveys 58(11), 1–53 (2026)

  75. [75]

    In: Advances in Neural Information Processing Systems (2023)

    Nie, A., Cheng, C.-A., Kolobov, A., Swaminathan, A.: Importance of direc- tional feedback for LLM-based optimizers. In: Advances in Neural Information Processing Systems (2023)

  76. [76]

    In: 3rd International Joint Conference on Artificial Intelligence, pp

    Wu, X., Zhong, Y., Wu, J., Jiang, B., Tan, K.C.: Large language model-enhanced 29 algorithm selection: Towards comprehensive algorithm representation. In: 3rd International Joint Conference on Artificial Intelligence, pp. 5235–5244 (2024)

  77. [77]

    arXiv preprint arXiv:2506.11057 (2025)

    Li, X., Yang, J., Wang, J., Peng, B., Yao, J., Guan, H.: STRCMP: Integrating graph structural priors with language models for combinatorial optimization. arXiv preprint arXiv:2506.11057 (2025)

  78. [78]

    In: ACM SIGKDD Conference on Knowledge Discovery and Data Mining V

    Wu, X., Wang, D., Wu, C., Wen, L., Miao, C., Xiao, Y., Zhou, Y.: Efficient heuristics generation for solving combinatorial optimization problems using large language models. In: ACM SIGKDD Conference on Knowledge Discovery and Data Mining V. 2, pp. 3228–3239 (2025)

  79. [79]

    arXiv preprint arXiv:2403.03962 (2024)

    Mao, J., Zou, D., Sheng, L., Liu, S., Gao, C., Wang, Y., Li, Y.: Identify crit- ical nodes in complex network with large language models. arXiv preprint arXiv:2403.03962 (2024)

  80. [80]

    arXiv preprint arXiv:2410.17656 (2024)

    Yu, H., Liu, J.: AutoRNet: Automatically optimizing heuristics for robust network design via large language models. arXiv preprint arXiv:2410.17656 (2024)

Showing first 80 references.