Building Specialized Software-Assistant ChatBot with Graph-Based Retrieval-Augmented Generation
Pith reviewed 2026-05-21 19:50 UTC · model grok-4.3
The pith
Enterprise web applications can be automatically converted into state-action knowledge graphs that enable LLMs to generate accurate, context-aware assistance without hallucinations.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The framework automatically converts enterprise web applications into state-action knowledge graphs, enabling LLMs to generate grounded and context-aware assistance. An engineering pipeline extracts software interfaces and structures them as graphs, after which a graph-based retrieval process supplies relevant states and actions to the model during generation, all integrated into digital adoption platform workflows.
What carries the argument
State-action knowledge graphs extracted from enterprise web application interfaces, which represent possible user interactions and software states for retrieval-augmented generation.
If this is right
- LLMs can deliver reliable assistance for navigating complex enterprise systems such as CRM, ERP, or HRMS without producing hallucinations.
- Manual creation and maintenance of interactive guides in digital adoption platforms is reduced through automation.
- The approach works with black-box production LLMs since no access to model weights or fine-tuning is needed.
- Scalability improves for large enterprise applications via the automated graph extraction pipeline.
- Industrial deployment lessons on robustness can guide integration into real company workflows.
Where Pith is reading between the lines
- The extraction pipeline might be adapted to handle mobile or desktop interfaces if similar parsing techniques are developed for those environments.
- State-action graphs could be enriched with logged user behavior data to support more predictive forms of assistance.
- This retrieval method may combine with other knowledge sources to cover software features that are hard to extract from the interface alone.
Load-bearing premise
The automatic extraction process produces state-action graphs that faithfully capture the relevant user interactions and interface states of the target enterprise software.
What would settle it
A direct test in which the assistant is asked for navigation steps on a specific feature of the enterprise software and the generated response includes incorrect or incomplete actions that do not match the actual interface.
Figures
read the original abstract
Digital Adoption Platforms (DAPs) have become essential tools for helping employees navigate complex enterprise software such as CRM, ERP, or HRMS systems. Companies like LemonLearning have shown how digital guidance can reduce training costs and accelerate onboarding. However, building and maintaining these interactive guides still requires extensive manual effort. Leveraging Large Language Models as virtual assistants is an appealing alternative, yet without a structured understanding of the target software, LLMs often hallucinate and produce unreliable answers. Moreover, most production-grade LLMs are black-box APIs, making fine-tuning impractical due to the lack of access to model weights. In this work, we introduce a Graph-based Retrieval-Augmented Generation framework that automatically converts enterprise web applications into state-action knowledge graphs, enabling LLMs to generate grounded and context-aware assistance. The framework was co-developed with the AI enterprise RAKAM, in collaboration with Lemon Learning. We detail the engineering pipeline that extracts and structures software interfaces, the design of the graph-based retrieval process, and the integration of our approach into production DAP workflows. Finally, we discuss scalability, robustness, and deployment lessons learned from industrial use cases.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces a Graph-based Retrieval-Augmented Generation framework that automatically converts enterprise web applications (CRM, ERP, HRMS) into state-action knowledge graphs. These graphs are intended to ground LLM responses for digital adoption platforms, reducing manual effort in building interactive guides while preventing hallucinations. The work describes the extraction pipeline (DOM analysis, state identification, graph construction), the retrieval mechanism, integration into production DAP workflows, and lessons from industrial deployments developed with RAKAM and Lemon Learning.
Significance. If the automatic extraction produces complete and accurate state-action graphs, the framework could meaningfully lower the cost of maintaining digital guidance systems and improve the reliability of LLM assistants in complex enterprise software. The industrial collaboration and reported deployment lessons offer practical value for scalability and robustness in real-world settings.
major comments (1)
- [Extraction and structuring pipeline (as described in the abstract and engineering pipeline section)] The central claim that the framework enables grounded, hallucination-free assistance rests on the assumption that the automatic extraction pipeline faithfully captures relevant user interactions and interface states. The manuscript describes the pipeline (DOM analysis, state identification, graph construction) but provides no quantitative metrics on extraction fidelity, such as state coverage, action recall on multi-step workflows, or accuracy against manually annotated ground truth for the target CRM/ERP systems. Dynamic elements (JS-driven views, conditional rendering, role-based states) are acknowledged as challenges but receive no empirical evaluation.
minor comments (2)
- [Graph design] Clarify the exact definition of 'state' and 'action' nodes in the knowledge graph, including how conditional or role-specific views are represented.
- [Deployment lessons] The abstract and text refer to 'industrial use cases' and 'deployment lessons'; adding a brief table summarizing the systems tested and qualitative outcomes would improve readability.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback and for highlighting the importance of empirical validation for the extraction pipeline. We agree that quantitative metrics would strengthen the central claims regarding grounded assistance and hallucination reduction. Below we address the major comment in detail and outline the revisions we will make.
read point-by-point responses
-
Referee: [Extraction and structuring pipeline (as described in the abstract and engineering pipeline section)] The central claim that the framework enables grounded, hallucination-free assistance rests on the assumption that the automatic extraction pipeline faithfully captures relevant user interactions and interface states. The manuscript describes the pipeline (DOM analysis, state identification, graph construction) but provides no quantitative metrics on extraction fidelity, such as state coverage, action recall on multi-step workflows, or accuracy against manually annotated ground truth for the target CRM/ERP systems. Dynamic elements (JS-driven views, conditional rendering, role-based states) are acknowledged as challenges but receive no empirical evaluation.
Authors: We acknowledge that the manuscript, as currently written, does not report quantitative metrics on extraction fidelity such as state coverage, action recall, or accuracy against manually annotated ground truth. The paper's primary contributions center on the overall Graph-RAG framework, its integration into production DAP workflows, and lessons from industrial deployments with RAKAM and Lemon Learning rather than a controlled benchmark study of the extraction component. Dynamic elements are discussed qualitatively as engineering challenges. We agree that adding empirical evaluation would better support the claims. In the revised manuscript we will include a new evaluation subsection that reports state coverage and action recall on a representative subset of the CRM/ERP interfaces, using manually annotated ground truth where available, and we will discuss limitations for fully dynamic JS-driven views. revision: yes
Circularity Check
No circularity: engineering pipeline presented without derivations or self-referential reductions
full rationale
The paper describes a practical engineering framework for automatically extracting state-action knowledge graphs from enterprise web applications and integrating them into a graph-based RAG system for LLM assistants. No mathematical derivations, equations, fitted parameters, or predictions are present that could reduce outputs to inputs by construction. The central claims rest on the described extraction pipeline (DOM analysis, state identification, graph construction) and industrial deployment lessons rather than any self-citation chain, uniqueness theorem, or ansatz imported from prior work. The work is self-contained as an applied construction; external validation of extraction fidelity is a separate evidence question, not a circularity issue.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Enterprise web applications can be automatically parsed into a faithful state-action graph that captures all relevant user interactions.
Reference graph
Works this paper leans on
-
[1]
Asai, A., Wu, Z., Wang, Y ., Sil, A., and Hajishirzi, H
URL https://www.anthropic.com/ news/claude-3-family. Asai, A., Wu, Z., Wang, Y ., Sil, A., and Hajishirzi, H. Self- rag: Learning to retrieve, generate, and critique through self-reflection. InThe Twelfth International Conference on Learning Representations, 2023. Batz, G. V . An optimization technique for subgraph match- ing strategies. Technical report,...
work page 2023
-
[2]
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
URL https://proceedings.neurips. cc/paper_files/paper/2020/file/ 1457c0d6bfcb4967418bfb8ac142f64a-Paper. pdf. Cheng, D., Huang, S., Bi, J., Zhan, Y ., Liu, J., Wang, Y ., Sun, H., Wei, F., Deng, W., and Zhang, Q. Uprise: Universal prompt retrieval for improving zero-shot evaluation. In Proceedings of the 2023 Conference on Empirical Meth- ods in Natural L...
work page internal anchor Pith review Pith/arXiv arXiv 2020
-
[3]
URL https://api.semanticscholar. org/CorpusID:3144218. Latka, G. Lemon learning: Company profile and revenue data, 2024. URL https://getlatka.com/ companies/lemon-learning#:˜:text=In% 202024%2C%20Lemon%20Learning%E2%80% 99s%20revenue,increasing%20adoption% 20across%20various%20industries. Ac- cessed: 2025-10-26. Lewis, P., Perez, E., Piktus, A., Petroni, ...
work page 2024
-
[4]
URL https://arxiv.org/abs/2005. 11401. Li, X. L. and Liang, P. Prefix-tuning: Optimizing continuous prompts for generation, 2021. URL https://arxiv. org/abs/2101.00190. Meta AI. Introducing meta llama 3: The most capable openly available llm to date, 2024. URL https://ai. meta.com/blog/meta-llama-3. Mortaheb, M., Khojastepour, M. A. A., Chakradhar, S. T.,...
work page internal anchor Pith review Pith/arXiv arXiv 2005
-
[5]
URL https://api.semanticscholar. org/CorpusID:201646309. Saha, S., Yadav, P., Bauer, L., and Bansal, M. Explagraphs: An explanation graph generation task for structured com- monsense reasoning.arXiv preprint arXiv:2104.07644, 2021. Sawarkar, K., Mangal, A., and Solanki, S. R. Blended rag: Improving rag (retriever-augmented generation) accuracy with semant...
-
[6]
Go to the Opportunities tab
-
[7]
Find and open the opportunity you want to modify
-
[8]
Click Edit, update the details and click Save. 0.91 LLM+GRAG 1. Go to the Opportunities section from the home page. This displays a list of recently viewed opportunities, with details such as the account name, stage, closing date, and owner alias. 2. From the list of opportunities, select the opportunity you want to edit by clicking on its name. This will...
-
[9]
Allez dans l’onglet Tˆaches ou dans la section Activit´es
-
[10]
Trouvez et ouvrez la tˆache `a modifier
-
[11]
Cliquez sur Modifier, mettez `a jour les informations n´ecessaires, puis cliquez sur Enregistrer. 0.75 LLM+G-RAG 1. Depuis le tableau de bord principal, acc´edez `a la section de gestion des tˆaches via la barre de navigation. 2. Dans la liste des t ˆaches r´ecemment consult´ees, s´electionnez la tˆache `a modifier afin d’afficher ses d´etails. 3. Utilise...
-
[12]
Find and open the contact you want to modify
-
[13]
Click Edit, update the details (e.g., Name, Email, Phone, Account), and click Save. 0.91 LLM+G-RAG 1. Go to the Contacts section from the main navigation bar
-
[14]
Find the contact you want to edit using the search bar or by browsing the list of recently viewed contacts
-
[15]
Select the desired contact from the list to access its details
-
[16]
Look for the option to edit the contact’s information (e.g., an edit button or icon)
-
[17]
Edit the necessary fields, such as full name, associated account name, phone number, or email address
-
[18]
Save your changes to update the contact’s information. 2.47 Table 4. Qualitative comparison between Graph-RAG (G-RAG) and a standalone LLM onSalesforceuser queries. Each example shows the user question in French and in English, the model-generated response, and the corresponding response time in second. Each block of three rows represents one query instan...
-
[19]
Locate the contract you want to delete in the list
-
[20]
Click on the contract name to open its details
-
[21]
Confirm the deletion when prompted
Scroll to the bottom (or top, depending on your version) and click Delete. Confirm the deletion when prompted. 0.79 LLM+GRAG 1. From the home page, go to the Commerce section. 2. In the dedicated menu, select the option to display the list of contracts or subscriptions. On the contract management page, you can: - View the list of existing contracts. - Use...
-
[22]
Click New Intervention
-
[23]
Select the customer and fill in date, staff, and description
-
[24]
Click Save. 0.88 LLM+G-RAG 1. From the home page, go to the Commerce section
-
[25]
In the menu dedicated to interventions, select the option to create a new intervention
-
[26]
Use the add button if necessary to manage third parties
On the intervention creation page, choose a third party from the drop-down list provided for this purpose. Use the add button if necessary to manage third parties
-
[27]
Confirm the creation of the draft intervention using the appropriate action
-
[28]
6.78 User query How to modify a client invoice LLM 1
If necessary, cancel the process using the cancel option. 6.78 User query How to modify a client invoice LLM 1. Log in to Dolibarr and go to Commercial→Invoices→Customer Invoices
-
[29]
Find and open the invoice you want to modify
-
[30]
Click Modify, update the details such as products, quantities, prices, or dates
-
[31]
Click Save to apply the changes. 0.72 LLM+G-RAG 1. Go to the Home section from the main menu to view the general dashboard. 2. From the dashboard, go to the Customer Invoices section to view the list of customer invoices
-
[32]
From the list of customer invoices, select the invoice you want to modify. 4. Use the available options to edit the invoice details, subject to restrictions related to the invoice status (for example, modifying a “Paid” invoice may be limited). 5. Apply the desired changes and save them. Features available on the customer invoice management page: - Displa...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.