Title resolution pending

· 2018

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Bridging Continuous-time LQR and Reinforcement Learning via Gradient Flow of the Bellman Error

eess.SY · 2025-06-11 · unverdicted · novelty 7.0

A gradient flow on a continuous-time Bellman error parametrized by feedback gain converges to the optimal LQR controller and stays inside the stabilizing region.

Data-Driven Continuous-Time Linear Quadratic Regulator via Closed-Loop and Reinforcement Learning Parameterizations

math.OC · 2026-04-30 · unverdicted · novelty 4.0

The authors adapt closed-loop and IRL parameterizations to continuous time, deriving policy iteration schemes, a data-driven CARE, convex reformulations, and a policy gradient flow while unifying the two approaches.

citing papers explorer

Showing 2 of 2 citing papers.

Bridging Continuous-time LQR and Reinforcement Learning via Gradient Flow of the Bellman Error eess.SY · 2025-06-11 · unverdicted · none · ref 9
A gradient flow on a continuous-time Bellman error parametrized by feedback gain converges to the optimal LQR controller and stays inside the stabilizing region.
Data-Driven Continuous-Time Linear Quadratic Regulator via Closed-Loop and Reinforcement Learning Parameterizations math.OC · 2026-04-30 · unverdicted · none · ref 1
The authors adapt closed-loop and IRL parameterizations to continuous time, deriving policy iteration schemes, a data-driven CARE, convex reformulations, and a policy gradient flow while unifying the two approaches.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer