arXiv preprint arXiv:2601.22510 , year=

Shattered Compositionality: Counterintuitive Learning Dynamics of Transformers for Arithmetic , author= · arXiv 2601.22510

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

read on arXiv browse 1 citing papers

representative citing papers

OpenSafeIntent: Evaluating Intent-Calibrated Safe Completion Across Dual-Use Prompt Sets

cs.CL · 2026-07-02 · unverdicted · novelty 7.0

OpenSafeIntent benchmark shows models fail to calibrate safety across intent shifts in matched dual-use prompts, indicating current evaluations are insufficient.

citing papers explorer

Showing 1 of 1 citing paper after filters.

OpenSafeIntent: Evaluating Intent-Calibrated Safe Completion Across Dual-Use Prompt Sets cs.CL · 2026-07-02 · unverdicted · none · ref 41
OpenSafeIntent benchmark shows models fail to calibrate safety across intent shifts in matched dual-use prompts, indicating current evaluations are insufficient.

arXiv preprint arXiv:2601.22510 , year=

fields

years

verdicts

representative citing papers

citing papers explorer