Proximal Policy Optimization Algorithms

· 2017

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Sim-to-Real Transfer and Robustness Evaluation of Reinforcement Learning Control with Integrated Perception on an ASV for Floating Waste Capture

cs.RO · 2026-05-04 · unverdicted · novelty 5.0

A DRL controller for ASV floating waste capture, trained in simulation with a perception abstraction module, achieves centimeter-level accuracy in real-world field experiments across 14 disturbance regimes.

Empirical Evaluation of Policy-Based Reinforcement Learning for Dynamic Service Control in an M/M/1 Queue

math.OC · 2026-04-15 · unverdicted · novelty 3.0

REINFORCE, A2C, and PPO are compared for service rate control in an M/M/1 queue modeled as an SMDP, using queue length states and assessing convergence and regret.

citing papers explorer

Showing 2 of 2 citing papers.

Sim-to-Real Transfer and Robustness Evaluation of Reinforcement Learning Control with Integrated Perception on an ASV for Floating Waste Capture cs.RO · 2026-05-04 · unverdicted · none · ref 57
A DRL controller for ASV floating waste capture, trained in simulation with a perception abstraction module, achieves centimeter-level accuracy in real-world field experiments across 14 disturbance regimes.
Empirical Evaluation of Policy-Based Reinforcement Learning for Dynamic Service Control in an M/M/1 Queue math.OC · 2026-04-15 · unverdicted · none · ref 10
REINFORCE, A2C, and PPO are compared for service rate control in an M/M/1 queue modeled as an SMDP, using queue length states and assessing convergence and regret.

Proximal Policy Optimization Algorithms

fields

years

verdicts

representative citing papers

citing papers explorer