arXiv preprint arXiv:2512.22712 , year =

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Safety is Contextual, LLM-Judges Are Not: Navigating the Rigid Priors of Evaluators

cs.AI · 2026-06-05 · unverdicted · novelty 5.0

LLM safety judges resist adjusting evaluations when given contradictory context or new safety definitions, despite some ability to learn from new information.

Improving Cross-Lingual Factual Recall via Consistency-Driven Reinforcement Learning

cs.CL · 2026-06-04 · unverdicted · novelty 5.0

GRPO reinforcement learning on the new PolyFact dataset outperforms SFT and CPT for cross-lingual factual consistency in Qwen-2.5-7B and OLMo-2-7B by reducing language specialization in MLP and attention layers.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Safety is Contextual, LLM-Judges Are Not: Navigating the Rigid Priors of Evaluators cs.AI · 2026-06-05 · unverdicted · none · ref 46
LLM safety judges resist adjusting evaluations when given contradictory context or new safety definitions, despite some ability to learn from new information.

arXiv preprint arXiv:2512.22712 , year =

fields

years

verdicts

representative citing papers

citing papers explorer