pith. sign in

arXiv preprint arXiv:2410.17245 , year=

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

years

2026 3

representative citing papers

Detecting and Controlling Sycophancy with Cascading Linear Features

cs.AI · 2026-06-23 · conditional · novelty 6.0

Cascading linear features extracted from graded sycophancy samples form separable subspaces that enable detection, scoring, and steering of sycophantic behavior in LLMs, matching or exceeding LLM-judge and prompting baselines.

citing papers explorer

Showing 3 of 3 citing papers.