Ciresan and Ueli Meier and J

Dan C · 2012 · arXiv 2012.624811

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Repetition Mismatch: Why Data Mixture Experiments Don't Scale and How to Fix Them

cs.LG · 2026-05-29 · conditional · novelty 7.0

Repetition rate mismatch between small-scale proxies and target budgets is the main reason data mixture experiments do not scale; a subsampling procedure that equalizes repetition rates recovers optimal mixtures from 1/16-scale experiments.

sGPO: Trading Inference FLOPs for Training Efficiency in RLVR

cs.LG · 2026-06-07 · unverdicted · novelty 6.0

sGPO uses an initial-policy success-rate profiling pass to adaptively set rollout group sizes, filter data, and build a curriculum, cutting total RLVR training compute by 3x while matching baseline performance.

citing papers explorer

Showing 2 of 2 citing papers.

Repetition Mismatch: Why Data Mixture Experiments Don't Scale and How to Fix Them cs.LG · 2026-05-29 · conditional · none · ref 28
Repetition rate mismatch between small-scale proxies and target budgets is the main reason data mixture experiments do not scale; a subsampling procedure that equalizes repetition rates recovers optimal mixtures from 1/16-scale experiments.
sGPO: Trading Inference FLOPs for Training Efficiency in RLVR cs.LG · 2026-06-07 · unverdicted · none · ref 175
sGPO uses an initial-policy success-rate profiling pass to adaptively set rollout group sizes, filter data, and build a curriculum, cutting total RLVR training compute by 3x while matching baseline performance.

Ciresan and Ueli Meier and J

fields

years

verdicts

representative citing papers

citing papers explorer