Popcorn: Partially observed prediction constrained reinforcement learning

Joseph Futoma, Michael C Hughes, Finale Doshi-Velez · 2020 · arXiv 2001.04032

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

An adaptive variance estimator for relative sparsity

stat.ME · 2026-05-04 · unverdicted · novelty 6.0

A new adaptive variance estimator for relative sparsity coefficients is introduced that fully utilizes the prior asymptotic normality theorem and incorporates variable selection effects.

VentAgent: When LLMs Learn to Breathe -- Multi-Objective Arbitration for ARDS Ventilation

cs.LG · 2026-06-03 · unverdicted · novelty 5.0

VentAgent uses LLMs in a three-stage Perception-Planning-Orchestration hierarchy to perform multi-objective arbitration for mechanical ventilation in ARDS, outperforming RL baselines on a simulator while producing human-readable reasoning.

Treatment, evidence, imitation, and chat

stat.OT · 2025-06-29 · unverdicted · novelty 4.0

LLMs cannot solve the medical treatment problem through imitation alone because it requires evidence from experiments or observations, posing ethical challenges for training such systems.

citing papers explorer

Showing 3 of 3 citing papers.

An adaptive variance estimator for relative sparsity stat.ME · 2026-05-04 · unverdicted · none · ref 205
A new adaptive variance estimator for relative sparsity coefficients is introduced that fully utilizes the prior asymptotic normality theorem and incorporates variable selection effects.
VentAgent: When LLMs Learn to Breathe -- Multi-Objective Arbitration for ARDS Ventilation cs.LG · 2026-06-03 · unverdicted · none · ref 11
VentAgent uses LLMs in a three-stage Perception-Planning-Orchestration hierarchy to perform multi-objective arbitration for mechanical ventilation in ARDS, outperforming RL baselines on a simulator while producing human-readable reasoning.
Treatment, evidence, imitation, and chat stat.OT · 2025-06-29 · unverdicted · none · ref 29
LLMs cannot solve the medical treatment problem through imitation alone because it requires evidence from experiments or observations, posing ethical challenges for training such systems.

Popcorn: Partially observed prediction constrained reinforcement learning

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer