Title resolution pending

· 2012

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

browse 3 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Bridging Continuous-time LQR and Reinforcement Learning via Gradient Flow of the Bellman Error

eess.SY · 2025-06-11 · unverdicted · novelty 7.0

A gradient flow on a continuous-time Bellman error parametrized by feedback gain converges to the optimal LQR controller and stays inside the stabilizing region.

Data-driven Linear Quadratic Integral Control: A Convex Formulation and Policy Gradient Approach

eess.SY · 2026-04-16 · unverdicted · novelty 5.0

A convex data-driven formulation yields the optimal LQI feedback gain for continuous-time systems directly from measured data without system matrices.

Data-Driven Continuous-Time Linear Quadratic Regulator via Closed-Loop and Reinforcement Learning Parameterizations

math.OC · 2026-04-30 · unverdicted · novelty 4.0

The authors adapt closed-loop and IRL parameterizations to continuous time, deriving policy iteration schemes, a data-driven CARE, convex reformulations, and a policy gradient flow while unifying the two approaches.

citing papers explorer

Showing 3 of 3 citing papers.

Bridging Continuous-time LQR and Reinforcement Learning via Gradient Flow of the Bellman Error eess.SY · 2025-06-11 · unverdicted · none · ref 28
A gradient flow on a continuous-time Bellman error parametrized by feedback gain converges to the optimal LQR controller and stays inside the stabilizing region.
Data-driven Linear Quadratic Integral Control: A Convex Formulation and Policy Gradient Approach eess.SY · 2026-04-16 · unverdicted · none · ref 26
A convex data-driven formulation yields the optimal LQI feedback gain for continuous-time systems directly from measured data without system matrices.
Data-Driven Continuous-Time Linear Quadratic Regulator via Closed-Loop and Reinforcement Learning Parameterizations math.OC · 2026-04-30 · unverdicted · none · ref 47
The authors adapt closed-loop and IRL parameterizations to continuous time, deriving policy iteration schemes, a data-driven CARE, convex reformulations, and a policy gradient flow while unifying the two approaches.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer