pith. sign in

Mechanistic? arXiv preprint arXiv:2410.09087

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

years

2026 6 2025 2

verdicts

UNVERDICTED 8

clear filters

representative citing papers

Radical AI Interpretability

cs.AI · 2026-06-25 · unverdicted · novelty 6.0

A framework is proposed for solving for an AI system's beliefs and desires from its computational facts, with criteria for success tied to interpretability tests and emphasis on holistic attribution.

Mechanistic Interpretability Needs Philosophy

cs.CL · 2025-06-23 · unverdicted · novelty 4.0

The paper claims that mechanistic interpretability needs philosophy as a partner to clarify concepts, refine methods, and navigate epistemic and ethical complexities in AI systems.

citing papers explorer

Showing 1 of 1 citing paper after filters.