METRA: Scalable unsupervised RL with metric-aware abstraction

Park, S · 2023 · arXiv 2310.08887

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Unifying Goal-Conditioned RL and Unsupervised Skill Learning via Control-Maximization

cs.LG · 2026-05-07 · unverdicted · novelty 8.0

GCRL and MISL are unified as control maximization, with three inequivalent GCRL formulations each matched to a MISL objective via bounds on goal-sensitivity.

Intention-Conditioned Flow Occupancy Models

cs.LG · 2025-06-10 · unverdicted · novelty 5.0

InFOM applies flow matching to model intention-conditioned occupancy measures for RL pre-training, reporting 1.8x median return gains and 36% higher success rates on benchmarks.

citing papers explorer

Showing 2 of 2 citing papers.

Unifying Goal-Conditioned RL and Unsupervised Skill Learning via Control-Maximization cs.LG · 2026-05-07 · unverdicted · none · ref 19
GCRL and MISL are unified as control maximization, with three inequivalent GCRL formulations each matched to a MISL objective via bounds on goal-sensitivity.
Intention-Conditioned Flow Occupancy Models cs.LG · 2025-06-10 · unverdicted · none · ref 86
InFOM applies flow matching to model intention-conditioned occupancy measures for RL pre-training, reporting 1.8x median return gains and 36% higher success rates on benchmarks.

METRA: Scalable unsupervised RL with metric-aware abstraction

fields

years

verdicts

representative citing papers

citing papers explorer