Muon succeeds by guaranteeing local step-size optimality rather than by tracking any ideal global geometry, as random-spectrum and quasi-norm variants match its performance on language models.
arXiv preprint arXiv:1804.09028 , year=
4 Pith papers cite this work. Polarity classification is still indexing.
years
2026 4verdicts
UNVERDICTED 4representative citing papers
Natural-language descriptions generated and verified through generative models and digital twins capture the selectivity of most neurons in macaque V1 and V4.
Liberata introduces a graph-based system using continuous contribution shares and weighted citations to derive metrics for impact, risk, collaboration, and quality control in academic publishing.
An updated YOLOv8s detector improves student behavior recognition in crowded classrooms by 1.8-2.1% mAP through added feature modules and a reweighted loss.
citing papers explorer
-
Muon is Not That Special: Random or Inverted Spectra Work Just as Well
Muon succeeds by guaranteeing local step-size optimality rather than by tracking any ideal global geometry, as random-spectrum and quasi-norm variants match its performance on language models.
-
Letting the neural code speak: Automated characterization of monkey visual neurons through human language
Natural-language descriptions generated and verified through generative models and digital twins capture the selectivity of most neurons in macaque V1 and V4.
-
Liberata -- Graph Scientometrics for a Share Based System of Academic Publishing
Liberata introduces a graph-based system using continuous contribution shares and weighted citations to derive metrics for impact, risk, collaboration, and quality control in academic publishing.
-
Student Classroom Behavior Recognition Based on Improved YOLOv8s
An updated YOLOv8s detector improves student behavior recognition in crowded classrooms by 1.8-2.1% mAP through added feature modules and a reweighted loss.