Yuxi Sun, Aoqi Zuo, Wei Gao, and Jing Ma

Conflictbank: A benchmark for evaluating the influence of knowledge conflicts in llm · 2025 · arXiv 2408.12076

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

read on arXiv browse 4 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Diagnosing LLM Arbitration Behavior over Pre-evidence Epistemic States in RAG-based Fact-Checking

cs.AI · 2026-05-31 · unverdicted · novelty 6.0

PAVE evaluates LLM verifiers across four pre-evidence epistemic states in RAG fact-checking, revealing model-dependent unreliable arbitration and proposing a JSD-based test-time method to improve reliability.

Trust or Abstain? A Self-Aware RAG Approach

cs.IR · 2026-05-11 · unverdicted · novelty 6.0

SABER combines self-prior with multi-trace PK and CK reasoning representations to estimate reliability beliefs and drive trust-or-abstain decisions in knowledge-conflict RAG, improving accuracy over baselines.

The Override Gap: A Magnitude Account of Knowledge Conflict Failure in Hypernetwork-Based Instant LLM Adaptation

cs.LG · 2026-04-26 · conditional · novelty 6.0 · 2 refs

Knowledge conflicts in hypernetwork LLM adaptation stem from constant adapter margins losing to frequency-dependent pretrained margins; selective layer boosting and conflict-aware triggering raise deep-conflict accuracy to 71-72.5% on Gemma-2B and Mistral-7B.

Beyond Math: Stories as a Testbed for Memorization-Constrained Reasoning in LLMs

cs.CL · 2024-12-18 · unverdicted · novelty 6.0

Introduces a mitigation technique that drops LLM accuracy on popular fiction character tasks from 96% to 72% by limiting verbatim memorization while retaining gist cues.

citing papers explorer

Showing 4 of 4 citing papers.

Diagnosing LLM Arbitration Behavior over Pre-evidence Epistemic States in RAG-based Fact-Checking cs.AI · 2026-05-31 · unverdicted · none · ref 5
PAVE evaluates LLM verifiers across four pre-evidence epistemic states in RAG fact-checking, revealing model-dependent unreliable arbitration and proposing a JSD-based test-time method to improve reliability.
Trust or Abstain? A Self-Aware RAG Approach cs.IR · 2026-05-11 · unverdicted · none · ref 24
SABER combines self-prior with multi-trace PK and CK reasoning representations to estimate reliability beliefs and drive trust-or-abstain decisions in knowledge-conflict RAG, improving accuracy over baselines.
The Override Gap: A Magnitude Account of Knowledge Conflict Failure in Hypernetwork-Based Instant LLM Adaptation cs.LG · 2026-04-26 · conditional · none · ref 32 · 2 links
Knowledge conflicts in hypernetwork LLM adaptation stem from constant adapter margins losing to frequency-dependent pretrained margins; selective layer boosting and conflict-aware triggering raise deep-conflict accuracy to 71-72.5% on Gemma-2B and Mistral-7B.
Beyond Math: Stories as a Testbed for Memorization-Constrained Reasoning in LLMs cs.CL · 2024-12-18 · unverdicted · none · ref 6
Introduces a mitigation technique that drops LLM accuracy on popular fiction character tasks from 96% to 72% by limiting verbatim memorization while retaining gist cues.

Yuxi Sun, Aoqi Zuo, Wei Gao, and Jing Ma

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer