Addressing function approximation error in actor-critic methods

· 2018

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Reinforcement Learning with Reward Machines for Sleep Control in Mobile Networks

cs.LG · 2026-04-08 · unverdicted · novelty 5.0

Reinforcement learning with reward machines enables sleep control in mobile networks that accounts for history-dependent, time-averaged quality of service constraints.

Biologically Inspired Event-Based Perception and Sample-Efficient Learning for High-Speed Table Tennis Robots

cs.RO · 2026-04-06 · unverdicted · novelty 5.0

Event-based perception combined with progressive low-to-high speed training improves robotic table tennis return accuracy by 35.8% using the same number of training episodes.

citing papers explorer

Showing 2 of 2 citing papers.

Reinforcement Learning with Reward Machines for Sleep Control in Mobile Networks cs.LG · 2026-04-08 · unverdicted · none · ref 18
Reinforcement learning with reward machines enables sleep control in mobile networks that accounts for history-dependent, time-averaged quality of service constraints.
Biologically Inspired Event-Based Perception and Sample-Efficient Learning for High-Speed Table Tennis Robots cs.RO · 2026-04-06 · unverdicted · none · ref 46
Event-based perception combined with progressive low-to-high speed training improves robotic table tennis return accuracy by 35.8% using the same number of training episodes.

Addressing function approximation error in actor-critic methods

fields

years

verdicts

representative citing papers

citing papers explorer