pith. sign in

Grains: Gradient-based attribution for inference-time steering of llms and vlms.CoRR, abs/2507.18043, 2025a

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

citation-role summary

dataset 1

citation-polarity summary

fields

cs.CL 2 cs.LG 1

years

2026 3

verdicts

UNVERDICTED 3

roles

dataset 1

polarities

use dataset 1

representative citing papers

Continuous Interpretive Steering for Scalar Diversity

cs.CL · 2026-04-08 · unverdicted · novelty 6.0

Continuous Interpretive Steering and the GraSD dataset reveal that LLMs encode graded sensitivity to scalar diversity in their internal representations, recoverable via controlled activation interventions.

citing papers explorer

Showing 3 of 3 citing papers.