Title resolution pending

Li, B · 2025 · arXiv 2503.02854

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

WMF-AM: Probing LLM Working Memory via Depth-Parameterized Cumulative State Tracking

cs.AI · 2026-03-28 · unverdicted · novelty 7.0

WMF-AM is a depth-parameterized benchmark that measures LLMs' cumulative state tracking ability without scratchpads, validated on 28 models across arithmetic and non-arithmetic tasks with ablations confirming the construct.

Is This Just Fantasy? Language Model Representations Reflect Human Judgments of Event Plausibility

cs.CL · 2025-07-16 · unverdicted · novelty 5.0

Language models encode modal categories via linear difference vectors in their activations that predict fine-grained human plausibility judgments better than prior reports suggested.

citing papers explorer

Showing 2 of 2 citing papers.

WMF-AM: Probing LLM Working Memory via Depth-Parameterized Cumulative State Tracking cs.AI · 2026-03-28 · unverdicted · none · ref 35
WMF-AM is a depth-parameterized benchmark that measures LLMs' cumulative state tracking ability without scratchpads, validated on 28 models across arithmetic and non-arithmetic tasks with ablations confirming the construct.
Is This Just Fantasy? Language Model Representations Reflect Human Judgments of Event Plausibility cs.CL · 2025-07-16 · unverdicted · none · ref 8
Language models encode modal categories via linear difference vectors in their activations that predict fine-grained human plausibility judgments better than prior reports suggested.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer