Maximum entropy reinforcement learning via energy-based normalizing flow

Chen-Hao Chao, Chien Feng, Wei-Fang Sun, Cheng-Kuang Lee, Simon See, Chun-Yi Lee · 2024 · arXiv 2405.13629

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

GenPO++: Generative Policy Optimization with Jacobian-free Likelihood Ratios

cs.LG · 2026-06-05 · unverdicted · novelty 6.0

GenPO++ achieves exact Jacobian-free likelihood ratio computation for generative flow policies by embedding history states as auxiliary memory in a high-order reversible ODE solver.

Frictional Q-Learning

cs.LG · 2025-09-24

citing papers explorer

Showing 1 of 1 citing paper after filters.

GenPO++: Generative Policy Optimization with Jacobian-free Likelihood Ratios cs.LG · 2026-06-05 · unverdicted · none · ref 5
GenPO++ achieves exact Jacobian-free likelihood ratio computation for generative flow policies by embedding history states as auxiliary memory in a high-order reversible ODE solver.

Maximum entropy reinforcement learning via energy-based normalizing flow

fields

years

verdicts

representative citing papers

citing papers explorer