Policy Gradient Adaptive Control for the

· 2025 · arXiv 2505.03706

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

representative citing papers

A Data-Enabled Primal-Dual Approach for Policy Learning with SDP Formulations

eess.SY · 2026-07-01 · unverdicted · novelty 7.0

A primal-dual online framework updates policies from closed-loop data for SDP-based control synthesis in linear discrete-time systems, with local linear tracking and global ergodic convergence guarantees under persistency of excitation and slow data variation.

Direct Data-Driven Linear Quadratic Tracking via Policy Optimization

eess.SY · 2026-05-15 · unverdicted · novelty 7.0

A reference-decoupled reformulation makes direct data-driven LQT equivalent to certainty-equivalence solutions and supports convergent offline and online DeePO algorithms.

Global Convergence of Policy Gradient Methods for ReLU Controllers in Linear Quadratic Regulation

math.OC · 2026-04-24 · unverdicted · novelty 6.0

Model-based policy gradient converges globally to the optimal scalar LQR gain for discounted LQR using overparameterized ReLU networks by reducing the controller to two effective gains on positive and negative half-lines.

Sample-Efficient Model-Free Policy Gradient Methods for Stochastic LQR via Robust Linear Regression

eess.SY · 2025-12-03 · unverdicted · novelty 6.0

Primal-dual robust linear regression enables O(1/epsilon) sample complexity for model-free policy gradient methods on stochastic LQR.

Stability of Certainty-Equivalent Adaptive LQR for Linear Systems with Unknown Time-Varying Parameters

eess.SY · 2025-11-11 · unverdicted · novelty 5.0

LMS estimation paired with certainty-equivalent LQR delivers finite-gain ℓ²-stability for linear systems with unknown time-varying parameters and disturbances.

citing papers explorer

Showing 5 of 5 citing papers after filters.

A Data-Enabled Primal-Dual Approach for Policy Learning with SDP Formulations eess.SY · 2026-07-01 · unverdicted · none · ref 12
A primal-dual online framework updates policies from closed-loop data for SDP-based control synthesis in linear discrete-time systems, with local linear tracking and global ergodic convergence guarantees under persistency of excitation and slow data variation.
Direct Data-Driven Linear Quadratic Tracking via Policy Optimization eess.SY · 2026-05-15 · unverdicted · none · ref 78
A reference-decoupled reformulation makes direct data-driven LQT equivalent to certainty-equivalence solutions and supports convergent offline and online DeePO algorithms.
Global Convergence of Policy Gradient Methods for ReLU Controllers in Linear Quadratic Regulation math.OC · 2026-04-24 · unverdicted · none · ref 7
Model-based policy gradient converges globally to the optimal scalar LQR gain for discounted LQR using overparameterized ReLU networks by reducing the controller to two effective gains on positive and negative half-lines.
Sample-Efficient Model-Free Policy Gradient Methods for Stochastic LQR via Robust Linear Regression eess.SY · 2025-12-03 · unverdicted · none · ref 2
Primal-dual robust linear regression enables O(1/epsilon) sample complexity for model-free policy gradient methods on stochastic LQR.
Stability of Certainty-Equivalent Adaptive LQR for Linear Systems with Unknown Time-Varying Parameters eess.SY · 2025-11-11 · unverdicted · none · ref 5
LMS estimation paired with certainty-equivalent LQR delivers finite-gain ℓ²-stability for linear systems with unknown time-varying parameters and disturbances.

Policy Gradient Adaptive Control for the

fields

years

verdicts

representative citing papers

citing papers explorer