RLMM decouples person-level choice sensitivity from task-level value functions via a parametric RL model with Boltzmann choice and MAP estimation, outperforming tabular MDP-MM in simulations and linking person parameters to performance in real gameplay data.
Behavior Research Methods , year =
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
stat.ME 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Reinforcement Learning Measurement Model
RLMM decouples person-level choice sensitivity from task-level value functions via a parametric RL model with Boltzmann choice and MAP estimation, outperforming tabular MDP-MM in simulations and linking person parameters to performance in real gameplay data.