Title resolution pending

Alexander Sasha Vezhnevets, Simon Osindero, Tom Schaul, Nicolas Heess, Max Jaderberg, David Silver, Koray Kavukcuoglu · 2017

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Moira: Language-driven Hierarchical Reinforcement Learning for Pair Trading

cs.AI · 2026-05-03 · unverdicted · novelty 7.0

Moira parameterizes hierarchical RL policies for pair trading with LLMs and adapts them via prompt updates based on trajectory and episode feedback, outperforming baselines on real market data.

Distill-Belief: Closed-Loop Inverse Source Localization and Characterization in Physical Fields

cs.AI · 2026-04-28 · unverdicted · novelty 6.0

Distill-Belief distills Bayesian information-gain signals from a particle-filter teacher into a compact student policy for fast closed-loop source localization and parameter estimation while avoiding reward hacking.

citing papers explorer

Showing 2 of 2 citing papers.

Moira: Language-driven Hierarchical Reinforcement Learning for Pair Trading cs.AI · 2026-05-03 · unverdicted · none · ref 47
Moira parameterizes hierarchical RL policies for pair trading with LLMs and adapts them via prompt updates based on trajectory and episode feedback, outperforming baselines on real market data.
Distill-Belief: Closed-Loop Inverse Source Localization and Characterization in Physical Fields cs.AI · 2026-04-28 · unverdicted · none · ref 78
Distill-Belief distills Bayesian information-gain signals from a particle-filter teacher into a compact student policy for fast closed-loop source localization and parameter estimation while avoiding reward hacking.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer