Recognition: 2 theorem links
· Lean TheoremSemantic Reverse Engineering Legacy Software Applications with ChatGPT, Gemini AI, and Claude AI
Pith reviewed 2026-05-08 18:51 UTC · model grok-4.3
The pith
ChatGPT, Gemini, and Claude can semantically reverse engineer legacy database applications.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Large language models supplied with legacy database source code can generate accurate semantic models, entity-relationship descriptions, and functional summaries through targeted prompting.
What carries the argument
Prompting large language models to perform semantic analysis and abstraction on legacy database code.
If this is right
- Legacy database maintenance can shift from line-by-line reading to reviewing AI-generated semantic summaries.
- Migration projects gain an automated first pass at understanding the original intent.
- Documentation gaps in old systems can be filled faster than manual reverse engineering allows.
Where Pith is reading between the lines
- The method might extend to non-database legacy systems if the prompting patterns generalize.
- Cross-checking outputs from multiple models could reduce individual hallucinations without added human effort.
- Over time this could create a library of reusable semantic templates for common legacy patterns.
Load-bearing premise
The AI models can reliably extract accurate semantic understanding from legacy code without significant hallucinations or the need for extensive human correction.
What would settle it
Run the same legacy application through both human experts and the AI pipeline and compare the resulting semantic models for systematic mismatches on data flows or business rules.
Figures
read the original abstract
This research paper describes our research results on using ChatGPT, Gemini, and Claude AI to semantically reverse engineer legacy database software applications.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript consists of a single-sentence abstract claiming to describe research results on using ChatGPT, Gemini, and Claude AI to semantically reverse engineer legacy database software applications. No methods, legacy code artifacts, prompts, sample outputs, evaluation metrics, or results sections are present in the full text.
Significance. Application of large language models to semantic reverse engineering of legacy database applications addresses a real industrial need for recovering meaning from undocumented code. If the work had supplied concrete artifacts, reproducible prompts, and correctness assessments, it could have offered a useful case study; the current document supplies none of these elements, so no contribution can be evaluated.
major comments (1)
- The full manuscript text contains only the abstract; there is no Methods section, no description of the specific legacy database applications examined, no prompting strategies or workflows, no example AI-generated semantic models, and no evaluation against ground truth. This absence makes the central claim of 'research results' impossible to assess.
Simulated Author's Rebuttal
We thank the referee for their review and for highlighting the gaps in the current manuscript. We agree that the submitted version is incomplete and does not provide sufficient detail to evaluate the claimed research results.
read point-by-point responses
-
Referee: The full manuscript text contains only the abstract; there is no Methods section, no description of the specific legacy database applications examined, no prompting strategies or workflows, no example AI-generated semantic models, and no evaluation against ground truth. This absence makes the central claim of 'research results' impossible to assess.
Authors: We acknowledge that the referee is correct: the submitted manuscript consists only of a single-sentence description and lacks all of the elements listed. This was an oversight during submission. In the revised manuscript we will add a complete Methods section, descriptions of the legacy database applications studied, the prompting strategies and workflows used with ChatGPT, Gemini, and Claude, representative AI-generated outputs, and evaluation metrics against ground truth. revision: yes
Circularity Check
No circularity: purely descriptive abstract with no derivation chain
full rationale
The manuscript consists solely of a one-sentence abstract stating that it 'describes our research results on using ChatGPT, Gemini, and Claude AI to semantically reverse engineer legacy database software applications.' No equations, models, parameters, predictions, uniqueness theorems, or methodological steps are present. Consequently none of the enumerated circularity patterns (self-definitional, fitted-input-as-prediction, self-citation load-bearing, etc.) can be instantiated, as there is no derivational content whose internal logic could reduce to its own inputs by construction. The document is self-contained as a high-level claim of having performed research; any evaluation of whether that claim is substantiated belongs to correctness or completeness assessment, not circularity.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
From Legacy Databases to (Elementary) Mathematical Data Model Schemas with MatBase, ChatGPT , Gemini AI, and Claude AI
Mancas C. and Mancas DC. “From Legacy Databases to (Elementary) Mathematical Data Model Schemas with MatBase, ChatGPT , Gemini AI, and Claude AI” . Current Trends in Comp. Sci. & App 3.5 (2026): 451-469
2026
-
[2]
ChatGPT (2026)
OpenAI. ChatGPT (2026). https://chatgpt.com/
2026
-
[3]
Gemini AI (2026)
Google LLC. Gemini AI (2026)
2026
-
[4]
Claude AI (2026)
Anthropic. Claude AI (2026). https://claude.ai
2026
-
[5]
MatBase Metadata Catalog Management
Mancas C. “MatBase Metadata Catalog Management”. Acta Scientific Computer Sciences 2.4 (2020): 25-29. PriMera Scientific Engineering https://primerascientific.com/psen Semantic Reverse Engineering Legacy Software Applications with ChatGPT, Gemini AI, and Claude AI 23
2020
-
[6]
The (Elementary) Mathematical Data Model revisited
Mancas C. “The (Elementary) Mathematical Data Model revisited” . PriMera Scientific Engineering 5.4 (2024): 78-91
2024
-
[7]
MatBase - A Tool for Transparent Programming while Modeling Data at Conceptual Levels
Mancas C. “MatBase - A Tool for Transparent Programming while Modeling Data at Conceptual Levels” . In: Proc. 5th Int. Conf. on Comp. Sci. & Inf. Techn. (CSITEC 2019), AIRCC Pub. Corp. Chennai, India (2019): 15-27
2019
-
[8]
Foundations of Databases
Abiteboul S, Hull R and Vianu V. “Foundations of Databases” . Addison-Wesley, Reading, MA (1995)
1995
-
[9]
A relational model for large shared data banks
Codd EF. “ A relational model for large shared data banks” . CACM 13.6 (1970): 377-387
1970
-
[10]
Conceptual Data Modeling and Database Design: A Completely Algorithmic Approach. Volume 1: The Shortest Advis able Path
Mancas C. “Conceptual Data Modeling and Database Design: A Completely Algorithmic Approach. Volume 1: The Shortest Advis able Path” . Apple Academic Press, Waretown, NJ (2015)
2015
-
[11]
The entity-relationship model. Toward a unified view of data
Chen PP . “The entity-relationship model. Toward a unified view of data” . ACM TODS 1.1 (1976): 9-36
1976
-
[12]
Entity-Relationship Modeling: Foundations of Database Technology
Thalheim B. “Entity-Relationship Modeling: Foundations of Database Technology” . Springer-Verlag, Berlin, Germany (2000)
2000
-
[13]
On Modelware as the 5th Generation of Programming Languages
Mancas C. “On Modelware as the 5th Generation of Programming Languages” . Acta Scientific Computer Sciences 2.9 (2020): 24- 26
2020
-
[14]
A Book of Set Theory
Pinter CC. “ A Book of Set Theory” . Dover Pub. Inc., Mineola, NY (2014)
2014
-
[15]
First-Order Logic. A Concise Introduction. 2nd Edition
Heil J. “First-Order Logic. A Concise Introduction. 2nd Edition” . Hackett Pub. Co. Inc., Indianapolis / Cambridge, U.S.A (2021)
2021
-
[16]
Temporal Logic and State Systems
Kroger F. and Merz S. “Temporal Logic and State Systems” . Springer-Verlag, Berlin Heidelberg, Germany (2008)
2008
-
[17]
Should Reverse Engineering Remain a Computer Science Cinderella?
Mancas C. “Should Reverse Engineering Remain a Computer Science Cinderella?” . J. Inform. Tech. Soft. Eng. S5 (2013): e001
2013
-
[18]
Reverse Engineering and Design Recovery: A Taxonomy
Chikofsky E. and Cross II J. “Reverse Engineering and Design Recovery: A Taxonomy” . IEEE Softw 7.1 (1190): 13-17
-
[19]
The Art of Memory Forensis
Ligh MH., et al. “The Art of Memory Forensis” . Wiley, Indianapolis, IN (2014)
2014
-
[20]
Reverse Engineering for Beginners
Yurichev D. “Reverse Engineering for Beginners” . https://bdigital.uvhm.edu.mx/wp-content/uploads/2020/07/Reverse-Engi- neering-for-Beginners.pdf
2020
-
[21]
Rootkits and Bootkits. Reversing Modern Malware and Next Generation Threats
Matrosov A., et al. “Rootkits and Bootkits. Reversing Modern Malware and Next Generation Threats” . no starch press, San Fran- cisco, CA (2019)
2019
-
[22]
The Ghidra Book, 2nd Edition
Nance K and Eagle C. “The Ghidra Book, 2nd Edition” . no starch press, San Francisco, CA (2026)
2026
-
[23]
Reversing: Secrets of Reverse Engineering
Eilam E. “Reversing: Secrets of Reverse Engineering” . Wiley, Indianapolis, IN (2005)
2005
-
[24]
Blue Fox: ARM Assembly Internals & Reverse Engineering
Markstedter MA. “Blue Fox: ARM Assembly Internals & Reverse Engineering” . Wiley, Hoboken, NJ (2023)
2023
-
[25]
EvilClippy
outflankln. EvilClippy. https://github.com/outflanknl/EvilClippy/pulls
-
[26]
VB Decompiler
DotFix Software. VB Decompiler. https://www.vb-decompiler.org/products.htm
-
[27]
MatBase E-RD Cycles Associated Non-relational Constraints Discovery Assistance Algorithm
Mancas C. “MatBase E-RD Cycles Associated Non-relational Constraints Discovery Assistance Algorithm” . In: Arai, K. et al. (eds). Intelligent Computing. CompCom 2019. Advances in Intelligent Systems and Computing, Springer, Cham, Switzerland 997 (2019)
2019
-
[28]
On enforcing dyadic-type self-map constraints in MatBase
Mancas C. “On enforcing dyadic-type self-map constraints in MatBase” . Intl. J. Front. In Eng. & Techn. Res. (IJFETR) 05.01 (2023): 014-026
2023
-
[29]
Design and development of a database software application for managing genealogical trees
Mancas DC. “Design and development of a database software application for managing genealogical trees” . [Unpublished M.Sc. Dissertation Thesis]. Ovidius University at Constanta, Romania, Mathematics and Informatics Department (2023)
2023
-
[30]
From (Elementary) Mathematical Data Model Schemas to Safe Blazor Web Applications with Claude AI
Mancas C and Mancas DC. “From (Elementary) Mathematical Data Model Schemas to Safe Blazor Web Applications with Claude AI” . PriMera Scientific Engineering J. 8.4 (2026): 16-37
2026
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.