CoRR , volume =

· 1909 · arXiv 1909.12434

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

representative citing papers

Don't Start What You Can't Finish: A Counterfactual Audit of Support-State Triage in LLM Agents

cs.AI · 2026-04-17 · unverdicted · novelty 7.0

LLM agents overcommit on non-complete tasks at 41.7% unless given explicit support-state categories, which raise typed deferral accuracy to 91.7%.

Faithfulness Serum: Mitigating the Faithfulness Gap in Textual Explanations of LLM Decisions via Attribution Guidance

cs.CL · 2026-04-15 · unverdicted · novelty 6.0

A training-free method improves epistemic faithfulness of LLM textual explanations by guiding generation with attribution-based attention interventions.

Aligning AI With Shared Human Values

cs.CY · 2020-08-05 · conditional · novelty 6.0

Introduces ETHICS benchmark showing current language models have promising but incomplete ability to predict basic human ethical judgments on text scenarios.

Medical Model Synthesis Architectures: A Case Study

cs.AI · 2026-05-10 · unverdicted · novelty 5.0

MedMSA framework retrieves knowledge via language models then builds formal probabilistic models to produce uncertainty-weighted differential diagnoses from symptoms.

Case-Grounded Evidence Verification: A Framework for Constructing Evidence-Sensitive Supervision

cs.CL · 2026-04-10 · unverdicted · novelty 5.0

A supervision construction procedure generates explicit support and controlled non-support examples (counterfactual and topic-related negatives) without manual annotation, producing verifiers that demonstrate genuine evidence dependence in radiology tasks.

citing papers explorer

Showing 5 of 5 citing papers.

Don't Start What You Can't Finish: A Counterfactual Audit of Support-State Triage in LLM Agents cs.AI · 2026-04-17 · unverdicted · none · ref 19
LLM agents overcommit on non-complete tasks at 41.7% unless given explicit support-state categories, which raise typed deferral accuracy to 91.7%.
Faithfulness Serum: Mitigating the Faithfulness Gap in Textual Explanations of LLM Decisions via Attribution Guidance cs.CL · 2026-04-15 · unverdicted · none · ref 4
A training-free method improves epistemic faithfulness of LLM textual explanations by guiding generation with attribution-based attention interventions.
Aligning AI With Shared Human Values cs.CY · 2020-08-05 · conditional · none · ref 15
Introduces ETHICS benchmark showing current language models have promising but incomplete ability to predict basic human ethical judgments on text scenarios.
Medical Model Synthesis Architectures: A Case Study cs.AI · 2026-05-10 · unverdicted · none · ref 183
MedMSA framework retrieves knowledge via language models then builds formal probabilistic models to produce uncertainty-weighted differential diagnoses from symptoms.
Case-Grounded Evidence Verification: A Framework for Constructing Evidence-Sensitive Supervision cs.CL · 2026-04-10 · unverdicted · none · ref 35
A supervision construction procedure generates explicit support and controlled non-support examples (counterfactual and topic-related negatives) without manual annotation, producing verifiers that demonstrate genuine evidence dependence in radiology tasks.

CoRR , volume =

fields

years

verdicts

representative citing papers

citing papers explorer