Then, (24) implies thatdλ −1ν(µ+κ 2γ−1ηD2)≤ ϵ2 8Φ−1(1−δ)2

First, note thatexp(−˜γρ) ≲ 1 T 2, ζ≲ 1√ T, ηλ−1 ≲ 1and therefore C1 ≲ 1√ T

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Rate-Optimal Regret for the Safe Learning-based Control of the Constrained Linear Quadratic Regulator

math.OC · 2026-04-24 · unverdicted · novelty 8.0

An algorithm for constrained stochastic LQR achieves tilde O of square root T regret and chance constraint satisfaction via SDP-based optimistic policies scaled for safety.

citing papers explorer

Showing 1 of 1 citing paper.

Rate-Optimal Regret for the Safe Learning-based Control of the Constrained Linear Quadratic Regulator math.OC · 2026-04-24 · unverdicted · none · ref 1
An algorithm for constrained stochastic LQR achieves tilde O of square root T regret and chance constraint satisfaction via SDP-based optimistic policies scaled for safety.

Then, (24) implies thatdλ −1ν(µ+κ 2γ−1ηD2)≤ ϵ2 8Φ−1(1−δ)2

fields

years

verdicts

representative citing papers

citing papers explorer