Title resolution pending

Rishabh Agarwal, Max Schwarzer, Pablo Samuel Castro, Aaron Courville, Marc G · 2021 · arXiv 2108.13264

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Plasticity-Enhanced Multi-Agent Mixture of Experts for Dynamic Objective Adaptation in UAVs-Assisted Emergency Communication Networks

cs.MA · 2026-04-10 · unverdicted · novelty 7.0

PE-MAMoE combines sparsely gated mixture-of-experts actors with a non-parametric phase controller in MAPPO to maintain plasticity under dynamic user mobility and traffic, yielding 26.3% higher normalized IQM return in simulations.

Long-Horizon Q-Learning: Accurate Value Learning via n-Step Inequalities

cs.AI · 2026-05-07 · unverdicted · novelty 6.0 · 2 refs

LQL turns n-step action-sequence lower bounds into a practical hinge-loss stabilizer for off-policy Q-learning without extra networks or forward passes.

How to Do Statistical Evaluations in ECE/CS Papers: A Practical Playbook for Defensible Results

stat.ME · 2026-05-01 · accept · novelty 2.0

A tutorial playbook that organizes statistical evaluation into a workflow of claim, hypothesis, unit of analysis, baselines, sweeps, uncertainty, validation, and reporting, illustrated with Python code and a job-scheduling example.

citing papers explorer

Showing 3 of 3 citing papers.

Plasticity-Enhanced Multi-Agent Mixture of Experts for Dynamic Objective Adaptation in UAVs-Assisted Emergency Communication Networks cs.MA · 2026-04-10 · unverdicted · none · ref 58
PE-MAMoE combines sparsely gated mixture-of-experts actors with a non-parametric phase controller in MAPPO to maintain plasticity under dynamic user mobility and traffic, yielding 26.3% higher normalized IQM return in simulations.
Long-Horizon Q-Learning: Accurate Value Learning via n-Step Inequalities cs.AI · 2026-05-07 · unverdicted · none · ref 1 · 2 links
LQL turns n-step action-sequence lower bounds into a practical hinge-loss stabilizer for off-policy Q-learning without extra networks or forward passes.
How to Do Statistical Evaluations in ECE/CS Papers: A Practical Playbook for Defensible Results stat.ME · 2026-05-01 · accept · none · ref 6
A tutorial playbook that organizes statistical evaluation into a workflow of claim, hypothesis, unit of analysis, baselines, sweeps, uncertainty, validation, and reporting, illustrated with Python code and a job-scheduling example.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer