Anderson Acceleration for Reinforcement Learning

· 2018 · cs.LG · arXiv 1809.09501

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Anderson acceleration is an old and simple method for accelerating the computation of a fixed point. However, as far as we know and quite surprisingly, it has never been applied to dynamic programming or reinforcement learning. In this paper, we explain briefly what Anderson acceleration is and how it can be applied to value iteration, this being supported by preliminary experiments showing a significant speed up of convergence, that we critically discuss. We also discuss how this idea could be applied more generally to (deep) reinforcement learning.

representative citing papers

Fault Tolerance of Accelerated Asynchronous Fixed-Point Iterations on Flexible Computing Infrastructure

cs.DC · 2026-05-27 · unverdicted · novelty 6.0

Asynchronous execution yields 2.9x-16.9x speedups across Jacobi, value iteration, and SCF methods; Anderson acceleration succeeds only under evaluation-level perturbation, not iterate-level corruption.

citing papers explorer

Showing 1 of 1 citing paper.

Fault Tolerance of Accelerated Asynchronous Fixed-Point Iterations on Flexible Computing Infrastructure cs.DC · 2026-05-27 · unverdicted · none · ref 18 · internal anchor
Asynchronous execution yields 2.9x-16.9x speedups across Jacobi, value iteration, and SCF methods; Anderson acceleration succeeds only under evaluation-level perturbation, not iterate-level corruption.

Anderson Acceleration for Reinforcement Learning

fields

years

verdicts

representative citing papers

citing papers explorer