pith. sign in

Latent prototype routing: Achieving near-perfect load balancing in mixture-of-experts

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

citation-role summary

background 1

citation-polarity summary

fields

cs.LG 2 cs.DC 1

years

2026 3

verdicts

UNVERDICTED 3

roles

background 1

polarities

background 1

clear filters

representative citing papers

$\phi$-Balancing for Mixture-of-Experts Training

cs.LG · 2026-05-14 · unverdicted · novelty 7.0

φ-balancing is a convex optimization method for population-level expert balance in MoE training that derives an online EMA adjustment and outperforms heuristic baselines.

citing papers explorer

Showing 3 of 3 citing papers.