2020.9304386

URLhttps://doi · 2020 · DOI 10.1109/cdc42340

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

representative citing papers

Preference-Based Reward Learning under Partial Observability with Inexact Dynamics

math.OC · 2026-06-29 · unverdicted · novelty 6.0

Establishes stability of belief filters to model error in log-linear and neural-softmax POMDPs under mixing conditions and derives finite-sample guarantees for preference-based reward learning that decouple statistical error from model-mismatch bias.

A Bias-Corrected Weighted Logistic Model for Gene Regulatory Networks: Functional Equivalence with the Product-of-Logistics and Comparison with Weighted-Sum Formulations

math.DS · 2026-06-03 · unverdicted · novelty 6.0

Introduces bcw single-sigmoid model for GRNs that recovers product-of-logistics critical value 1/2^{m_i} and shared equilibrium, with Jacobian and stability comparisons.

Permissive Safety Through Trusted Inference: Verifiable Belief-Space Neural Safety Filters for Assured Interactive Robotics

cs.RO · 2026-06-01 · unverdicted · novelty 6.0

A conformal prediction certification for belief-space safety filters focuses verification on reliable inference regions to produce less conservative yet high-probability safe filters than standard baselines in human-vehicle simulations.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Preference-Based Reward Learning under Partial Observability with Inexact Dynamics math.OC · 2026-06-29 · unverdicted · none · ref 15
Establishes stability of belief filters to model error in log-linear and neural-softmax POMDPs under mixing conditions and derives finite-sample guarantees for preference-based reward learning that decouple statistical error from model-mismatch bias.

2020.9304386

fields

years

verdicts

representative citing papers

citing papers explorer