A framework that builds tractable structured Hessian approximations by averaging over user-chosen weight-space symmetry groups, recovering Shampoo-like estimates for one choice of group.
Yurii Nesterov et al.Lectures on convex optimization, vol- ume 137
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
citation-role summary
background 1