ISBN 979-8-89176-251-0

Li, Moxin, Zhao, Yong, Zhang, Wenxuan, Li, Shuaiyi, Xie, Wenya, Ng, See-Kiong · 2025 · DOI 10.18653/v1/2025.acl-long.256

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

open at publisher browse 7 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

PhantomBench: Benchmarking the Non-existential Threat of Language Models

cs.CL · 2026-06-09 · unverdicted · novelty 7.0

PhantomBench is a new benchmark of 60K+ non-existent terms showing language models hallucinate at rates up to 86.7 percent even when inputs assume the concepts exist.

Code-on-Graph: Iterative Programmatic Reasoning via Large Language Models on Knowledge Graphs

cs.AI · 2026-06-02 · unverdicted · novelty 7.0

Code-on-Graph lets LLMs turn retrieved KG facts into Python class instances and generate executable code for reasoning, outperforming prior LLM-KG methods by up to 10.5% on WebQSP, CWQ, and GrailQA.

Decision Potential Surface: A Theoretical and Practical Approximation of Large Language Model Decision Boundary

cs.LG · 2025-09-27 · unverdicted · novelty 7.0

Defines Decision Potential Surface (DPS) whose zero isohypse equals an LLM decision boundary and supplies a K-sample approximation algorithm with derived upper bounds on absolute, expected, and concentration errors.

Localizing RL-Induced Tool Use to a Single Crosscoder Feature

cs.LG · 2026-06-25 · unverdicted · novelty 6.0

Dedicated Feature Crosscoders localize RL-induced tool use to a compact feature set in Qwen2.5-3B, yielding +31.1 pp tool correctness gains and +6.8 pp spillover to the base model.

Staying In Character: Perspective-Bounded Memory For Book-Based Role-Playing Agents

cs.CL · 2026-06-24 · unverdicted · novelty 6.0

REVERIEMEM is a three-layer perspective-bounded memory system that raises knowledge boundary fidelity by 34.6 points and wins ~79% of narrative comparisons on a new book-based role-playing benchmark.

A Case Study on the Impact of Anonymization Along the RAG Pipeline

cs.CR · 2026-04-17 · unverdicted · novelty 6.0

Anonymization placement in RAG—at the dataset or at the generated answer—creates observable differences in privacy protection versus response utility.

An End-to-End Ukrainian RAG for Local Deployment. Optimized Hybrid Search and Lightweight Generation

cs.CL · 2026-04-23 · unverdicted · novelty 4.0

A two-stage hybrid search pipeline paired with a synthetic-data fine-tuned and compressed Ukrainian language model delivers competitive local question answering under strict compute limits.

citing papers explorer

Showing 7 of 7 citing papers.

PhantomBench: Benchmarking the Non-existential Threat of Language Models cs.CL · 2026-06-09 · unverdicted · none · ref 43
PhantomBench is a new benchmark of 60K+ non-existent terms showing language models hallucinate at rates up to 86.7 percent even when inputs assume the concepts exist.
Code-on-Graph: Iterative Programmatic Reasoning via Large Language Models on Knowledge Graphs cs.AI · 2026-06-02 · unverdicted · none · ref 21
Code-on-Graph lets LLMs turn retrieved KG facts into Python class instances and generate executable code for reasoning, outperforming prior LLM-KG methods by up to 10.5% on WebQSP, CWQ, and GrailQA.
Decision Potential Surface: A Theoretical and Practical Approximation of Large Language Model Decision Boundary cs.LG · 2025-09-27 · unverdicted · none · ref 10
Defines Decision Potential Surface (DPS) whose zero isohypse equals an LLM decision boundary and supplies a K-sample approximation algorithm with derived upper bounds on absolute, expected, and concentration errors.
Localizing RL-Induced Tool Use to a Single Crosscoder Feature cs.LG · 2026-06-25 · unverdicted · none · ref 19
Dedicated Feature Crosscoders localize RL-induced tool use to a compact feature set in Qwen2.5-3B, yielding +31.1 pp tool correctness gains and +6.8 pp spillover to the base model.
Staying In Character: Perspective-Bounded Memory For Book-Based Role-Playing Agents cs.CL · 2026-06-24 · unverdicted · none · ref 38
REVERIEMEM is a three-layer perspective-bounded memory system that raises knowledge boundary fidelity by 34.6 points and wins ~79% of narrative comparisons on a new book-based role-playing benchmark.
A Case Study on the Impact of Anonymization Along the RAG Pipeline cs.CR · 2026-04-17 · unverdicted · none · ref 21
Anonymization placement in RAG—at the dataset or at the generated answer—creates observable differences in privacy protection versus response utility.
An End-to-End Ukrainian RAG for Local Deployment. Optimized Hybrid Search and Lightweight Generation cs.CL · 2026-04-23 · unverdicted · none · ref 15
A two-stage hybrid search pipeline paired with a synthetic-data fine-tuned and compressed Ukrainian language model delivers competitive local question answering under strict compute limits.

ISBN 979-8-89176-251-0

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer