Specialist agents in an autonomous research loop with lineage feedback improve ML training recipes, delivering 0.81% better validation bpb on Parameter Golf, 38.7% higher CORE on NanoChat-D12, and 4.59% lower wallclock on CIFAR-10 Airbench96 across 1797 trials with no human intervention after setup.
AAAI Conference on Artificial Intelligence , volume=
2 Pith papers cite this work. Polarity classification is still indexing.
years
2026 2verdicts
UNVERDICTED 2representative citing papers
Sparse MoE vision models show positive accuracy gaps only when routing a substantial compute fraction ρ and using k≥2 experts at large scale; batch-axis dispatch is identified as a key failure mode.
citing papers explorer
-
Auto Research with Specialist Agents Develops Effective and Non-Trivial Training Recipes
Specialist agents in an autonomous research loop with lineage feedback improve ML training recipes, delivering 0.81% better validation bpb on Parameter Golf, 38.7% higher CORE on NanoChat-D12, and 4.59% lower wallclock on CIFAR-10 Airbench96 across 1797 trials with no human intervention after setup.
-
When Does Sparse MoE Help in Vision? The Role of Backbone Compute Leverage in Sparse Routing
Sparse MoE vision models show positive accuracy gaps only when routing a substantial compute fraction ρ and using k≥2 experts at large scale; batch-axis dispatch is identified as a key failure mode.