Multi-layer SAE transitions capture domain-specific signatures that distinguish OOD texts in Gemma-2 models.
cc/paper_files/paper/1991/file/ ff4d5fbbafdf976cfdc032e3bde78de5-Paper
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.AI 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Domain Restriction via Multi SAE Layer Transitions
Multi-layer SAE transitions capture domain-specific signatures that distinguish OOD texts in Gemma-2 models.