Path Integral Policy Improvement with Covariance Matrix Adaptation

· 2012 · cs.LG · arXiv 1206.4621

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

There has been a recent focus in reinforcement learning on addressing continuous state and action problems by optimizing parameterized policies. PI2 is a recent example of this approach. It combines a derivation from first principles of stochastic optimal control with tools from statistical estimation theory. In this paper, we consider PI2 as a member of the wider family of methods which share the concept of probability-weighted averaging to iteratively update parameters to optimize a cost function. We compare PI2 to other members of the same family - Cross-Entropy Methods and CMAES - at the conceptual level and in terms of performance. The comparison suggests the derivation of a novel algorithm which we call PI2-CMA for "Path Integral Policy Improvement with Covariance Matrix Adaptation". PI2-CMA's main advantage is that it determines the magnitude of the exploration noise automatically.

representative citing papers

Industrial Dual-Arm Box Handling via Online Inertial Estimation and Convex Wrench Optimization

cs.RO · 2026-05-21 · unverdicted · novelty 5.0

A dual-arm robot framework performs online inertial estimation from contact wrenches and uses SOCP under ellipsoidal friction constraints to lift boxes with unknown properties while maintaining stable contact.

citing papers explorer

Showing 1 of 1 citing paper.

Industrial Dual-Arm Box Handling via Online Inertial Estimation and Convex Wrench Optimization cs.RO · 2026-05-21 · unverdicted · none · ref 24 · internal anchor
A dual-arm robot framework performs online inertial estimation from contact wrenches and uses SOCP under ellipsoidal friction constraints to lift boxes with unknown properties while maintaining stable contact.

Path Integral Policy Improvement with Covariance Matrix Adaptation

fields

years

verdicts

representative citing papers

citing papers explorer