pith. sign in

A theory of state abstraction for reinforcement learning

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

fields

cs.LG 1 cs.LO 1

years

2026 2

verdicts

UNVERDICTED 2

representative citing papers

Bayesian updates from coalgebraic determinisation

cs.LO · 2026-06-24 · unverdicted · novelty 7.0

Unifilarisation of stochastic Mealy machines is an instance of coalgebraic determinisation over monads with support structure, producing causal stochastic behaviours rather than Moore-style output distributions.

Adaptive state-action abstractions via rate-distortion

cs.LG · 2026-06-04 · unverdicted · novelty 6.0

A rate-distortion based switching strategy for adaptive state-action abstractions in RL decomposes value error into Bellman residual and bisimulation metric terms to achieve near-optimal performance under lossy compression in tabular settings.

citing papers explorer

Showing 2 of 2 citing papers.

  • Bayesian updates from coalgebraic determinisation cs.LO · 2026-06-24 · unverdicted · none · ref 15

    Unifilarisation of stochastic Mealy machines is an instance of coalgebraic determinisation over monads with support structure, producing causal stochastic behaviours rather than Moore-style output distributions.

  • Adaptive state-action abstractions via rate-distortion cs.LG · 2026-06-04 · unverdicted · none · ref 3

    A rate-distortion based switching strategy for adaptive state-action abstractions in RL decomposes value error into Bellman residual and bisimulation metric terms to achieve near-optimal performance under lossy compression in tabular settings.