Guidelines: • The answer NA means that the paper poses no such risks

Safeguards Question: Does the paper describe safeguards that have been put in place for responsible release of data or models that have a high risk for misuse (e

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

browse 3 citing papers

representative citing papers

Closed-Loop Vision-Language Planning for Multi-Agent Coordination

cs.AI · 2025-02-14 · unverdicted · novelty 7.0

COMPASS uses VLMs to generate and refine code-based strategies with structured communication, achieving 57% win rate on SMACv2 Protoss 5v5 versus 27% for QMIX.

Learning to Reason at the Frontier of Learnability

cs.LG · 2025-02-17 · unverdicted · novelty 4.0

A curriculum sampling questions with high variance in success rate improves reinforcement learning performance for LLM reasoning tasks.

Evaluating and Learning Robust Bandit Policies Under Uncertain Causal Mechanisms

cs.LG · 2025-08-04

citing papers explorer

Showing 3 of 3 citing papers.

Closed-Loop Vision-Language Planning for Multi-Agent Coordination cs.AI · 2025-02-14 · unverdicted · none · ref 54
COMPASS uses VLMs to generate and refine code-based strategies with structured communication, achieving 57% win rate on SMACv2 Protoss 5v5 versus 27% for QMIX.
Learning to Reason at the Frontier of Learnability cs.LG · 2025-02-17 · unverdicted · none · ref 84
A curriculum sampling questions with high variance in success rate improves reinforcement learning performance for LLM reasoning tasks.
Evaluating and Learning Robust Bandit Policies Under Uncertain Causal Mechanisms cs.LG · 2025-08-04 · unreviewed · ref 16

Guidelines: • The answer NA means that the paper poses no such risks

fields

years

verdicts

representative citing papers

citing papers explorer