Convergence and sample complexity of policy gradient methods for stabilizing linear systems

· 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Multitask LQG Control: Performance and Generalization Bounds

math.OC · 2026-04-17 · unverdicted · novelty 5.0

Multitask LQG control via history-dependent lifting to LQR yields generalization bounds tied to bisimulation heterogeneity and reduces policy gradient variance proportionally to the number of training tasks.

citing papers explorer

Showing 1 of 1 citing paper.

Multitask LQG Control: Performance and Generalization Bounds math.OC · 2026-04-17 · unverdicted · none · ref 34
Multitask LQG control via history-dependent lifting to LQR yields generalization bounds tied to bisimulation heterogeneity and reduces policy gradient variance proportionally to the number of training tasks.

Convergence and sample complexity of policy gradient methods for stabilizing linear systems

fields

years

verdicts

representative citing papers

citing papers explorer