Odyssey: Constructing Verifiable Local Truth-Preserving Foundation Models

Sridhar Mahadevan

arxiv: 2606.27593 · v1 · pith:X2L5IMDVnew · submitted 2026-06-25 · 💻 cs.AI · cs.LG

Odyssey: Constructing Verifiable Local Truth-Preserving Foundation Models

Sridhar Mahadevan This is my paper

Pith reviewed 2026-06-29 01:09 UTC · model grok-4.3

classification 💻 cs.AI cs.LG

keywords categorical frameworkfoundriesKan extensionsfoundation modelssheavesverifiabilitytruth preservationargumentation

0 comments

The pith

ODYSSEY composes foundries via left and right Kan extensions to build verifiable local truth-preserving foundation models.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces ODYSSEY, a categorical framework that treats foundation models as compositions of foundries. Each foundry functions as an organized sheaf specifying local contexts, representation families, restriction maps, gluing rules, obstruction policies, update obligations, and an embedded argumentation component. Universal Foundry Learning constructs these via left Kan extensions that integrate local artifacts into candidate foundries and right Kan extensions that enforce consistency conditions. The approach is implemented and tested across concrete foundries, supporting domain construction, artifact replay, sheaf diagnostics, Toulmin scrutiny, obstruction ledgers, and TICKET-certified causal-claim extraction from heterogeneous sources.

Core claim

Foundation models can be constructed as compositions of foundries using left and right Kan extensions, where foundries are organized sheaves of knowledge carrying argumentation components, and the extensions enforce restriction, gluing, obstruction, and argumentation conditions required for local truth preservation and verifiability.

What carries the argument

Foundries as organized sheaves of knowledge with covers of local contexts, restriction maps, gluing rules, obstruction policies, and argumentation components, composed through left and right Kan extensions in Universal Foundry Learning.

If this is right

The same categorical machinery supports domain construction, artifact replay, sheaf diagnostics, grounded Toulmin scrutiny, residual-obstruction ledgers, and causal-claim extraction.
Foundry SQL provides a typed query surface for slicing maintained foundry artifacts using TICKET certification.
Concrete foundries can be assembled from generic templates such as evidence/argument, scientific challenge, and evaluation-harness foundries.
External or pre-built models can be admitted into durable ODYSSEY state through TICKET-compatible certification.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The framework could scale verification to models trained on mixed institutional and market data sources.
Obstruction ledgers might serve as persistent records for auditing model updates over time.
TICKET certification could allow selective incorporation of existing LLMs without rebuilding entire foundries.

Load-bearing premise

Left and right Kan extensions applied to foundries defined via restriction maps, gluing rules, and obstruction policies will enforce local truth preservation and verifiability in actual foundation model construction.

What would settle it

A constructed ODYSSEY model that produces inconsistent argumentation or fails obstruction policies when applied to a concrete domain with heterogeneous data sources.

Figures

Figures reproduced from arXiv: 2606.27593 by Sridhar Mahadevan.

**Figure 1.** Figure 1: Foundries as reusable building blocks for foundation models. The “basis vector” language is an algebraic [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗

**Figure 2.** Figure 2: TICKET as ODYSSEY’s admission and consistency operator. The left Kan extension rolls a structured source artifact into the target foundry as a document-claim candidate. The right Kan / pullback pass supplies the UDL-style consistency check: the same interface asks whether the candidate claim, its grounds, and its warrant survive Toulmin scrutiny against the target cover and maintained foundry state. Promot… view at source ↗

**Figure 3.** Figure 3: ODYSSEY turns a user request into maintained foundry state through typed Scylla, Homer, Athena, Prometheus, Toulmin, TICKET, and SkillOpt contracts. In causal foundries, Prometheus may emit BRIDGE/SKFM residualobstruction artifacts, TICKET may apply IDC-style local causal diagnostics, and SkillOpt can optimize the admission skill using the resulting rollout and gate traces [PITH_FULL_IMAGE:figures/full_f… view at source ↗

**Figure 4.** Figure 4: Screenshots of the current ODYSSEY interface and generated foundry artifacts. The visual surfaces are intentionally artifact-backed: each screen is a view over JSON, HTML, and audit files emitted by the foundry pipeline. 28 [PITH_FULL_IMAGE:figures/full_fig_p028_4.png] view at source ↗

**Figure 5.** Figure 5: Grounded Toulmin/local-LLM Scylla interface. A local LLM run over Democritus/Prometheus claims [PITH_FULL_IMAGE:figures/full_fig_p029_5.png] view at source ↗

read the original abstract

We introduce a categorical framework called ODYSSEY for constructing verifiable, local truth-preserving foundation models as compositions of foundries: building-block architectural components that specify a cover of local contexts, local representation families, restriction maps, gluing rules, obstruction policies, update obligations, and human-facing views. A foundry is an organized sheaf of knowledge that carries within it an argumentation component. Concrete foundries are built from generic foundries such as evidence/argument, operational decision, institutional/financial, market meaning, scientific challenge, research-program, assistant-build, and evaluation-harness foundries. Universal Foundry Learning (UFL) formalizes foundry construction as a composition of left and right Kan extensions, with left Kan extension rolling local artifacts into candidate foundries and right Kan extension enforcing the restriction, gluing, obstruction, and argumentation conditions required for promotion. Foundry SQL (FSQL) is a small typed query surface for slicing maintained foundry artifacts that uses TICKET (Topos Integration using Causal Kan Extension Transformers) certification for admitting external or pre-built models into durable ODYSSEY state. ODYSSEY is fully implemented and tested across a wide spectrum of concrete foundries, showing that the same categorical machinery supports domain construction, artifact replay, sheaf diagnostics, grounded Toulmin/local-LLM scrutiny, residual-obstruction ledgers, and optimized TICKET-compatible causal-claim extraction across heterogeneous sources. This paper is to be presented as a 2.5 hour tutorial at ICML 2026. The tutorial home page is at https://bit.ly/4ajS0nA.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Odyssey proposes a categorical framework for verifiable foundation models via sheaves and Kan extensions on 'foundries' but asserts full implementation without any code, derivations, examples, or results.

read the letter

The main takeaway is that this paper sketches a framework called ODYSSEY that treats foundation models as compositions of 'foundries'—sheaves equipped with restriction maps, gluing rules, obstruction policies, and argumentation structures—and uses left and right Kan extensions to build and enforce local consistency. It adds terminology like UFL for the extensions, FSQL for queries, and TICKET for certification, plus links to Toulmin-style scrutiny and causal claims.

The paper does a clear job mapping standard categorical constructions onto AI concerns. Defining foundries as sheaves that carry both representation families and human-facing views, then using Kan extensions to roll up local artifacts while enforcing gluing and obstruction conditions, is a coherent way to formalize compositionality and local truth preservation. The attempt to integrate argumentation components directly into the sheaf data is a reasonable direction if the goal is verifiable models.

The soft spot is the mismatch between the claims and the delivered content. The abstract states that ODYSSEY is fully implemented and tested across a wide spectrum of concrete foundries and supports domain construction, artifact replay, sheaf diagnostics, and TICKET-compatible extraction. Yet the text supplies no pseudocode for the Kan extensions, no worked example on an actual dataset or model, no error metrics, and no repository. Verifiability is presented as following from the construction itself rather than demonstrated against external benchmarks. This leaves the central promise untested.

The paper is aimed at the small set of researchers already comfortable with topos theory and looking for ways to apply it to AI safety or modular models. A reader wanting new theorems, reproducible experiments, or even basic implementation details will not find them.

I would not recommend sending this to peer review. The framework needs concrete computations and evidence before it merits referee time.

Referee Report

2 major / 1 minor

Summary. The manuscript introduces ODYSSEY, a categorical framework for constructing verifiable, local truth-preserving foundation models as compositions of foundries (sheaves carrying argumentation components, defined via restriction maps, gluing rules, obstruction policies, and update obligations). Universal Foundry Learning (UFL) is formalized via left and right Kan extensions, with left extensions rolling artifacts into candidate foundries and right extensions enforcing conditions for promotion; Foundry SQL (FSQL) and TICKET certification are introduced for querying and admitting models. The central claim is that ODYSSEY is fully implemented and tested across concrete foundries (evidence/argument, operational decision, etc.), supporting domain construction, artifact replay, sheaf diagnostics, Toulmin scrutiny, residual-obstruction ledgers, and causal-claim extraction.

Significance. If the implementation claims and Kan-extension enforcement were demonstrated with concrete computations and metrics, the work could offer a structured categorical approach to verifiability in foundation models that integrates local contexts and argumentation. The manuscript, however, contains no derivations, code, datasets, error analysis, or examples, so no assessment of significance is possible.

major comments (2)

[Abstract] Abstract: the assertion that ODYSSEY 'is fully implemented and tested across a wide spectrum of concrete foundries' is load-bearing for the central claim of verifiability and truth-preservation, yet the manuscript supplies no pseudocode, no explicit computation of a left or right Kan extension on any model or dataset, no experimental metrics, and no reproducibility artifact.
[Abstract] Abstract: verifiability and local truth-preservation are enforced solely by the Kan-extension conditions on the foundries themselves (restriction maps, gluing rules, obstruction policies); this reduces the enforcement to a definitional property with no described external benchmark or independent check.

minor comments (1)

[Abstract] Abstract: the statement that the paper 'is to be presented as a 2.5 hour tutorial at ICML 2026' should be clarified to indicate whether the submission is intended as a research article or a tutorial description.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the comments. We address each major comment below, noting where revisions to the manuscript are appropriate.

read point-by-point responses

Referee: [Abstract] Abstract: the assertion that ODYSSEY 'is fully implemented and tested across a wide spectrum of concrete foundries' is load-bearing for the central claim of verifiability and truth-preservation, yet the manuscript supplies no pseudocode, no explicit computation of a left or right Kan extension on any model or dataset, no experimental metrics, and no reproducibility artifact.

Authors: The manuscript is structured as the foundation for a 2.5-hour ICML 2026 tutorial, with the linked homepage (https://bit.ly/4ajS0nA) providing the detailed implementations, pseudocode, explicit Kan extension computations, and examples across foundries. The abstract summarizes results from those implementations. We agree the manuscript text itself does not contain these elements and will add selected pseudocode for UFL, a worked Kan extension example, and basic reproducibility notes in revision. revision: yes
Referee: [Abstract] Abstract: verifiability and local truth-preservation are enforced solely by the Kan-extension conditions on the foundries themselves (restriction maps, gluing rules, obstruction policies); this reduces the enforcement to a definitional property with no described external benchmark or independent check.

Authors: The framework defines verifiability and local truth-preservation via the Kan extension conditions, restriction maps, gluing rules, and obstruction policies as a deliberate categorical guarantee by construction. This is not an oversight but the intended formal mechanism. TICKET certification and the argumentation components within foundries provide interfaces for external validation where desired, though the core enforcement remains internal to the sheaf structure. revision: no

Circularity Check

1 steps flagged

Truth-preservation and verifiability defined into Kan-extension construction by fiat

specific steps

self definitional [Abstract (UFL paragraph)]
"Universal Foundry Learning (UFL) formalizes foundry construction as a composition of left and right Kan extensions, with left Kan extension rolling local artifacts into candidate foundries and right Kan extension enforcing the restriction, gluing, obstruction, and argumentation conditions required for promotion."

The right Kan extension is introduced precisely to enforce the restriction/gluing/obstruction/argumentation conditions. These conditions are the paper's own definition of what makes a foundry 'verifiable' and 'local truth-preserving.' Therefore the claim that the construction enforces truth-preservation reduces directly to the definitional choice of the extension operator; no additional derivation or external validation is supplied.

full rationale

The paper's central claim is that UFL (left/right Kan extensions) constructs verifiable, local truth-preserving models. However, the right Kan extension is explicitly defined to enforce the very restriction/gluing/obstruction/argumentation conditions that constitute truth-preservation and verifiability in the framework. No independent external check or benchmark is described; the enforcement is the definition. The 'fully implemented and tested' assertion is stated without any exhibited computation or metric that could falsify it. This matches self-definitional circularity at the core of the derivation.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 4 invented entities

The central claim rests on numerous invented terms and the unproven transfer of categorical constructions to AI model verifiability; no free parameters are specified but the framework itself introduces many ad-hoc entities.

axioms (1)

domain assumption Categorical constructions such as sheaves and left/right Kan extensions can be used to enforce local truth preservation in foundation models.
Invoked throughout the abstract as the basis for the ODYSSEY framework without further justification.

invented entities (4)

Foundry no independent evidence
purpose: Organized sheaf of knowledge carrying an argumentation component for local contexts in foundation models.
Core new building block introduced to structure the framework; no independent evidence provided.
Universal Foundry Learning (UFL) no independent evidence
purpose: Formalization of foundry construction via composition of left and right Kan extensions.
New named process; reduces to the Kan extension definitions by construction.
Foundry SQL (FSQL) no independent evidence
purpose: Typed query surface for slicing maintained foundry artifacts.
New query language tied to the framework; no external validation.
TICKET no independent evidence
purpose: Topos Integration using Causal Kan Extension Transformers for certifying external models.
New certification mechanism; defined within the paper's own terms.

pith-pipeline@v0.9.1-grok · 5815 in / 1631 out tokens · 34115 ms · 2026-06-29T01:09:51.024847+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

19 extracted references · 6 canonical work pages · 2 internal anchors

[1]

Rishi Bommasani, Drew A

URL https://arxiv.org/abs/ 2402.01602. Rishi Bommasani, Drew A. Hudson, Ehsan Adeli, Russ Altman, Simran Arora, Sydney von Arx, Michael S. Bernstein, Jeannette Bohg, Antoine Bosselut, Emma Brunskill, et al. On the opportunities and risks of foundation models,

arXiv
[2]

Haolong Chen, Hanzhi Chen, Zijian Zhao, Kaifeng Han, Guangxu Zhu, Yichen Zhao, Ying Du, Wei Xu, and Qingjiang Shi

URLhttps://arxiv.org/abs/2108.07258. Haolong Chen, Hanzhi Chen, Zijian Zhao, Kaifeng Han, Guangxu Zhu, Yichen Zhao, Ying Du, Wei Xu, and Qingjiang Shi. An overview of domain-specific foundation model: Key technologies, applications and challenges,

Pith/arXiv arXiv
[3]

Qing Cheng, Zefan Zeng, Xingchen Hu, Yuehang Si, and Zhong Liu

URL https://arxiv.org/abs/2409.04267. Qing Cheng, Zefan Zeng, Xingchen Hu, Yuehang Si, and Zhong Liu. A survey of event causality identification: Taxonomy, challenges, assessment, and prospects.ACM Computing Surveys, 58(3):1–37,

arXiv
[4]

URLhttps://dl.acm.org/doi/10.1145/3756009

doi:10.1145/3756009. URLhttps://dl.acm.org/doi/10.1145/3756009. Amir Feder, Katherine A. Keith, Emaad Manzoor, Reid Pryzant, Dhanya Sridhar, Zach Wood-Doughty, Jacob Eisenstein, Justin Grimmer, Roi Reichart, Margaret E. Roberts, Brandon M. Stewart, Victor Veitch, and Diyi Yang. Causal inference in natural language processing: Estimation, prediction, inter...

work page doi:10.1145/3756009
[5]

Thomas McCoy, Paul Smolensky, Tal Linzen, Jianfeng Gao, and Asli Celikyilmaz

doi:10.1162/tacl a 00511. URL https: //aclanthology.org/2022.tacl-1.66/. Prashant Garg and Thiemo Fetzer. Testing causal claims in economics.arXiv preprint arXiv:2501.06873,

work page internal anchor Pith review doi:10.1162/tacl 2022
[6]

Dataset and analysis of causal claims extracted from economics papers

URL https://arxiv.org/abs/2501.06873. Dataset and analysis of causal claims extracted from economics papers. Ankita Gupta, Ethan Zuckerman, and Brendan O’Connor. Harnessing Toulmin’s theory for zero-shot argument explication. InProceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 10259–1027...

arXiv
[7]

doi:10.18653/v1/2024.acl-long.552

Association for Computational Linguistics. doi:10.18653/v1/2024.acl-long.552. URLhttps://aclanthology.org/2024.acl-long.552/. Oktie Hassanzadeh, Debarun Bhattacharjya, Mark Feblowitz, Michael Perrone, Shirin Sohrabi, Kavitha Srinivas, and Michael Katz. Causal knowledge extraction through large-scale text mining. InProceedings of the AAAI Conference on Art...

work page doi:10.18653/v1/2024.acl-long.552 2024
[8]

Bridging language and items for retrieval and recommendation.arXiv preprint arXiv:2403.03952,

30 APREPRINT- JUNE29, 2026 Yupeng Hou, Jiacheng Li, Zhankui He, An Yan, Xiusi Chen, and Julian McAuley. Bridging language and items for retrieval and recommendation.arXiv preprint arXiv:2403.03952,

Pith/arXiv arXiv 2026
[9]

Accepted by TMLR; arXiv:2305.00050

URL https: //openreview.net/forum?id=mqoxLkX210. Accepted by TMLR; arXiv:2305.00050. Yundong Kim and Heyoung Yang. TRACE: Toulmin-based reasoning assessment through constructive elements for LLM CoT evaluation,

arXiv
[10]

Hao Duong Le, Xin Xia, and Zhang Chen

URLhttps://arxiv.org/abs/2605.29656. Hao Duong Le, Xin Xia, and Zhang Chen. Multi-agent causal discovery using large language models.arXiv preprint arXiv:2407.15073,

Pith/arXiv arXiv
[11]

Sridhar Mahadevan

URL https://arxiv.org/abs/2405.10467. Sridhar Mahadevan. Large causal models from large language models, 2025a. URL https://arxiv.org/abs/2512. 07796. Sridhar Mahadevan. CLIFF CatAgi: Categories for AGI local research interface. GitHub repository, 2025b. URL https://github.com/sridharmahadevan/CLIFF_CatAgi. Sridhar Mahadevan. Categories for AGI. Book manu...

arXiv
[12]

Asko Parpola.Deciphering the Indus Script

URL https: //arxiv.org/abs/2604.17828. Asko Parpola.Deciphering the Indus Script. Cambridge University Press, reissue edition,

Pith/arXiv arXiv
[13]

Stephen E

doi:10.1145/2187836.2187958. Stephen E. Toulmin.The Uses of Argument. Cambridge University Press,

work page doi:10.1145/2187836.2187958
[14]

Xinyi Xu, Zhaoxuan Wu, Rui Qiao, Arun Verma, Yao Shu, Jingtan Wang, Xinyuan Niu, Zhenfeng He, Jiangwei Chen, Zijian Zhou, et al

URLhttps://arxiv.org/abs/2403.13784. Xinyi Xu, Zhaoxuan Wu, Rui Qiao, Arun Verma, Yao Shu, Jingtan Wang, Xinyuan Niu, Zhenfeng He, Jiangwei Chen, Zijian Zhou, et al. Data-centric AI in the age of large language models,

arXiv
[15]

Yutaro Yamada, Robert Tjarko Lange, Cong Lu, Shengran Hu, Chris Lu, Jakob Foerster, Jeff Clune, and David Ha

URL https://arxiv.org/abs/ 2406.14473. Yutaro Yamada, Robert Tjarko Lange, Cong Lu, Shengran Hu, Chris Lu, Jakob Foerster, Jeff Clune, and David Ha. The AI scientist-v2: Workshop-level automated scientific discovery via agentic tree search.arXiv preprint arXiv:2504.08066,

arXiv
[16]

The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search

doi:10.48550/arXiv.2504.08066. Jie Yang, Soyeon Caren Han, and Josiah Poon. A survey on extraction of causal relations from natural language text. Knowledge and Information Systems, 64(5):1161–1186,

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2504.08066
[17]

Yifan Yang, Ziyang Gong, Weiquan Huang, Qihao Yang, Ziwei Zhou, Zisu Huang, Yan Li, Xuemei Gao, Qi Dai, Bei Liu, Kai Qiu, Yuqing Yang, Dongdong Chen, Xue Yang, and Chong Luo

doi:10.1007/s10115-022-01665-w. Yifan Yang, Ziyang Gong, Weiquan Huang, Qihao Yang, Ziwei Zhou, Zisu Huang, Yan Li, Xuemei Gao, Qi Dai, Bei Liu, Kai Qiu, Yuqing Yang, Dongdong Chen, Xue Yang, and Chong Luo. Skillopt: Executive strategy for self-evolving agent skills,

work page doi:10.1007/s10115-022-01665-w
[18]

31 APREPRINT- JUNE29, 2026 Jiahang Zhou, Yanyu Chen, Zicong Hong, Wuhui Chen, Yue Yu, Tao Zhang, Hui Wang, Chuanfu Zhang, and Zibin Zheng

URLhttps://arxiv.org/abs/2605.23904. 31 APREPRINT- JUNE29, 2026 Jiahang Zhou, Yanyu Chen, Zicong Hong, Wuhui Chen, Yue Yu, Tao Zhang, Hui Wang, Chuanfu Zhang, and Zibin Zheng. Training and serving system of foundation models: A comprehensive survey,

Pith/arXiv arXiv 2026
[19]

URL https: //arxiv.org/abs/2401.02643. 32 APREPRINT- JUNE29, 2026 Foundry Current gluing artifact Further validation Prometheus transport The ingestion console exposes 129 Prometheus-family cases: 6 admit- ted, 6 quarantined, and 117 legacy candidates awaiting TICKET re- view; DKS and MyFixIt compile back into Prometheus-compatible sheaf explorers. Full T...

arXiv 2026

[1] [1]

Rishi Bommasani, Drew A

URL https://arxiv.org/abs/ 2402.01602. Rishi Bommasani, Drew A. Hudson, Ehsan Adeli, Russ Altman, Simran Arora, Sydney von Arx, Michael S. Bernstein, Jeannette Bohg, Antoine Bosselut, Emma Brunskill, et al. On the opportunities and risks of foundation models,

arXiv

[2] [2]

Haolong Chen, Hanzhi Chen, Zijian Zhao, Kaifeng Han, Guangxu Zhu, Yichen Zhao, Ying Du, Wei Xu, and Qingjiang Shi

URLhttps://arxiv.org/abs/2108.07258. Haolong Chen, Hanzhi Chen, Zijian Zhao, Kaifeng Han, Guangxu Zhu, Yichen Zhao, Ying Du, Wei Xu, and Qingjiang Shi. An overview of domain-specific foundation model: Key technologies, applications and challenges,

Pith/arXiv arXiv

[3] [3]

Qing Cheng, Zefan Zeng, Xingchen Hu, Yuehang Si, and Zhong Liu

URL https://arxiv.org/abs/2409.04267. Qing Cheng, Zefan Zeng, Xingchen Hu, Yuehang Si, and Zhong Liu. A survey of event causality identification: Taxonomy, challenges, assessment, and prospects.ACM Computing Surveys, 58(3):1–37,

arXiv

[4] [4]

URLhttps://dl.acm.org/doi/10.1145/3756009

doi:10.1145/3756009. URLhttps://dl.acm.org/doi/10.1145/3756009. Amir Feder, Katherine A. Keith, Emaad Manzoor, Reid Pryzant, Dhanya Sridhar, Zach Wood-Doughty, Jacob Eisenstein, Justin Grimmer, Roi Reichart, Margaret E. Roberts, Brandon M. Stewart, Victor Veitch, and Diyi Yang. Causal inference in natural language processing: Estimation, prediction, inter...

work page doi:10.1145/3756009

[5] [5]

Thomas McCoy, Paul Smolensky, Tal Linzen, Jianfeng Gao, and Asli Celikyilmaz

doi:10.1162/tacl a 00511. URL https: //aclanthology.org/2022.tacl-1.66/. Prashant Garg and Thiemo Fetzer. Testing causal claims in economics.arXiv preprint arXiv:2501.06873,

work page internal anchor Pith review doi:10.1162/tacl 2022

[6] [6]

Dataset and analysis of causal claims extracted from economics papers

URL https://arxiv.org/abs/2501.06873. Dataset and analysis of causal claims extracted from economics papers. Ankita Gupta, Ethan Zuckerman, and Brendan O’Connor. Harnessing Toulmin’s theory for zero-shot argument explication. InProceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 10259–1027...

arXiv

[7] [7]

doi:10.18653/v1/2024.acl-long.552

Association for Computational Linguistics. doi:10.18653/v1/2024.acl-long.552. URLhttps://aclanthology.org/2024.acl-long.552/. Oktie Hassanzadeh, Debarun Bhattacharjya, Mark Feblowitz, Michael Perrone, Shirin Sohrabi, Kavitha Srinivas, and Michael Katz. Causal knowledge extraction through large-scale text mining. InProceedings of the AAAI Conference on Art...

work page doi:10.18653/v1/2024.acl-long.552 2024

[8] [8]

Bridging language and items for retrieval and recommendation.arXiv preprint arXiv:2403.03952,

30 APREPRINT- JUNE29, 2026 Yupeng Hou, Jiacheng Li, Zhankui He, An Yan, Xiusi Chen, and Julian McAuley. Bridging language and items for retrieval and recommendation.arXiv preprint arXiv:2403.03952,

Pith/arXiv arXiv 2026

[9] [9]

Accepted by TMLR; arXiv:2305.00050

URL https: //openreview.net/forum?id=mqoxLkX210. Accepted by TMLR; arXiv:2305.00050. Yundong Kim and Heyoung Yang. TRACE: Toulmin-based reasoning assessment through constructive elements for LLM CoT evaluation,

arXiv

[10] [10]

Hao Duong Le, Xin Xia, and Zhang Chen

URLhttps://arxiv.org/abs/2605.29656. Hao Duong Le, Xin Xia, and Zhang Chen. Multi-agent causal discovery using large language models.arXiv preprint arXiv:2407.15073,

Pith/arXiv arXiv

[11] [11]

Sridhar Mahadevan

URL https://arxiv.org/abs/2405.10467. Sridhar Mahadevan. Large causal models from large language models, 2025a. URL https://arxiv.org/abs/2512. 07796. Sridhar Mahadevan. CLIFF CatAgi: Categories for AGI local research interface. GitHub repository, 2025b. URL https://github.com/sridharmahadevan/CLIFF_CatAgi. Sridhar Mahadevan. Categories for AGI. Book manu...

arXiv

[12] [12]

Asko Parpola.Deciphering the Indus Script

URL https: //arxiv.org/abs/2604.17828. Asko Parpola.Deciphering the Indus Script. Cambridge University Press, reissue edition,

Pith/arXiv arXiv

[13] [13]

Stephen E

doi:10.1145/2187836.2187958. Stephen E. Toulmin.The Uses of Argument. Cambridge University Press,

work page doi:10.1145/2187836.2187958

[14] [14]

Xinyi Xu, Zhaoxuan Wu, Rui Qiao, Arun Verma, Yao Shu, Jingtan Wang, Xinyuan Niu, Zhenfeng He, Jiangwei Chen, Zijian Zhou, et al

URLhttps://arxiv.org/abs/2403.13784. Xinyi Xu, Zhaoxuan Wu, Rui Qiao, Arun Verma, Yao Shu, Jingtan Wang, Xinyuan Niu, Zhenfeng He, Jiangwei Chen, Zijian Zhou, et al. Data-centric AI in the age of large language models,

arXiv

[15] [15]

Yutaro Yamada, Robert Tjarko Lange, Cong Lu, Shengran Hu, Chris Lu, Jakob Foerster, Jeff Clune, and David Ha

URL https://arxiv.org/abs/ 2406.14473. Yutaro Yamada, Robert Tjarko Lange, Cong Lu, Shengran Hu, Chris Lu, Jakob Foerster, Jeff Clune, and David Ha. The AI scientist-v2: Workshop-level automated scientific discovery via agentic tree search.arXiv preprint arXiv:2504.08066,

arXiv

[16] [16]

The AI Scientist-v2: Workshop-Level Automated Scientific Discovery via Agentic Tree Search

doi:10.48550/arXiv.2504.08066. Jie Yang, Soyeon Caren Han, and Josiah Poon. A survey on extraction of causal relations from natural language text. Knowledge and Information Systems, 64(5):1161–1186,

work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.2504.08066

[17] [17]

Yifan Yang, Ziyang Gong, Weiquan Huang, Qihao Yang, Ziwei Zhou, Zisu Huang, Yan Li, Xuemei Gao, Qi Dai, Bei Liu, Kai Qiu, Yuqing Yang, Dongdong Chen, Xue Yang, and Chong Luo

doi:10.1007/s10115-022-01665-w. Yifan Yang, Ziyang Gong, Weiquan Huang, Qihao Yang, Ziwei Zhou, Zisu Huang, Yan Li, Xuemei Gao, Qi Dai, Bei Liu, Kai Qiu, Yuqing Yang, Dongdong Chen, Xue Yang, and Chong Luo. Skillopt: Executive strategy for self-evolving agent skills,

work page doi:10.1007/s10115-022-01665-w

[18] [18]

31 APREPRINT- JUNE29, 2026 Jiahang Zhou, Yanyu Chen, Zicong Hong, Wuhui Chen, Yue Yu, Tao Zhang, Hui Wang, Chuanfu Zhang, and Zibin Zheng

URLhttps://arxiv.org/abs/2605.23904. 31 APREPRINT- JUNE29, 2026 Jiahang Zhou, Yanyu Chen, Zicong Hong, Wuhui Chen, Yue Yu, Tao Zhang, Hui Wang, Chuanfu Zhang, and Zibin Zheng. Training and serving system of foundation models: A comprehensive survey,

Pith/arXiv arXiv 2026

[19] [19]

URL https: //arxiv.org/abs/2401.02643. 32 APREPRINT- JUNE29, 2026 Foundry Current gluing artifact Further validation Prometheus transport The ingestion console exposes 129 Prometheus-family cases: 6 admit- ted, 6 quarantined, and 117 legacy candidates awaiting TICKET re- view; DKS and MyFixIt compile back into Prometheus-compatible sheaf explorers. Full T...

arXiv 2026