Riemannian Adaptive Optimization Methods

Absil, P · 2018 · cs.LG · arXiv 1810.00760

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

open full Pith review browse 8 citing papers arXiv PDF

abstract

Several first order stochastic optimization methods commonly used in the Euclidean domain such as stochastic gradient descent (SGD), accelerated gradient descent or variance reduced methods have already been adapted to certain Riemannian settings. However, some of the most popular of these optimization tools - namely Adam , Adagrad and the more recent Amsgrad - remain to be generalized to Riemannian manifolds. We discuss the difficulty of generalizing such adaptive schemes to the most agnostic Riemannian setting, and then provide algorithms and convergence proofs for geodesically convex objectives in the particular case of a product of Riemannian manifolds, in which adaptivity is implemented across manifolds in the cartesian product. Our generalization is tight in the sense that choosing the Euclidean space as Riemannian manifold yields the same algorithms and regret bounds as those that were already known for the standard algorithms. Experimentally, we show faster convergence and to a lower train loss value for Riemannian adaptive methods over their corresponding baselines on the realistic task of embedding the WordNet taxonomy in the Poincare ball.

citation-role summary

method 2

citation-polarity summary

use method 2

representative citing papers

Dead-Direction Conditioners: Gauge-Equivariant Preconditioning for Deep Networks

cs.LG · 2026-06-28 · unverdicted · novelty 7.0

Dead-Direction Conditioners provide gauge-equivariant preconditioning by conditioning optimizer state on symmetry orbits, yielding improved resistance to over-training collapse and higher detection of dead directions compared to AdamW and Muon.

Riemannian Stochastic Optimization for Sufficient Dimension Reduction

stat.ML · 2026-05-29 · unverdicted · novelty 7.0

SMAVE recasts MAVE for SDR as Riemannian optimization on the Stiefel manifold, yielding a stochastic algorithm with almost-sure convergence and improved runtime over OPG and RMAVE.

Learning Variable-Length Tokenization for Generative Recommendation

cs.LG · 2026-05-18 · unverdicted · novelty 7.0

VarLenRec learns variable-length semantic IDs for generative recommendation by allocating longer codes to tail items via popularity-weighted information budget allocation, hyperbolic residual quantization, and a differentiable soft length controller.

LA-Sign: Looped Transformers with Geometry-aware Alignment for Skeleton-based Sign Language Recognition

cs.CV · 2026-03-30 · unverdicted · novelty 7.0

LA-Sign achieves state-of-the-art skeleton-based sign language recognition on WLASL and MSASL by using recurrent looped transformers with adaptive hyperbolic geometry alignment.

New non-Euclidean neural quantum states from additional types of hyperbolic recurrent neural networks

quant-ph · 2026-04-27 · unverdicted · novelty 7.0

Hyperbolic RNN and GRU neural quantum states outperform Euclidean versions on Heisenberg J1J2 and J1J2J3 models with 100 spins.

Deflation-Free Optimal Scoring

stat.ML · 2026-04-28 · unverdicted · novelty 6.0

DFSOS computes all sparse discriminant vectors at once with global orthogonality via Bregman iteration and augmented Lagrangian, achieving classification accuracy comparable to or better than deflation-based sparse optimal scoring on synthetic and real time series data.

Pion: A Spectrum-Preserving Optimizer via Orthogonal Equivalence Transformation

cs.LG · 2026-05-12 · unverdicted · novelty 5.0

Pion is an optimizer that preserves the singular values of weight matrices in LLM training by applying orthogonal equivalence transformations.

Inversion-Free Natural Gradient Descent on Riemannian Manifolds

stat.ML · 2026-04-03

citing papers explorer

Showing 8 of 8 citing papers.

Dead-Direction Conditioners: Gauge-Equivariant Preconditioning for Deep Networks cs.LG · 2026-06-28 · unverdicted · none · ref 4 · internal anchor
Dead-Direction Conditioners provide gauge-equivariant preconditioning by conditioning optimizer state on symmetry orbits, yielding improved resistance to over-training collapse and higher detection of dead directions compared to AdamW and Muon.
Riemannian Stochastic Optimization for Sufficient Dimension Reduction stat.ML · 2026-05-29 · unverdicted · none · ref 125 · internal anchor
SMAVE recasts MAVE for SDR as Riemannian optimization on the Stiefel manifold, yielding a stochastic algorithm with almost-sure convergence and improved runtime over OPG and RMAVE.
Learning Variable-Length Tokenization for Generative Recommendation cs.LG · 2026-05-18 · unverdicted · none · ref 3 · internal anchor
VarLenRec learns variable-length semantic IDs for generative recommendation by allocating longer codes to tail items via popularity-weighted information budget allocation, hyperbolic residual quantization, and a differentiable soft length controller.
LA-Sign: Looped Transformers with Geometry-aware Alignment for Skeleton-based Sign Language Recognition cs.CV · 2026-03-30 · unverdicted · none · ref 3 · internal anchor
LA-Sign achieves state-of-the-art skeleton-based sign language recognition on WLASL and MSASL by using recurrent looped transformers with adaptive hyperbolic geometry alignment.
New non-Euclidean neural quantum states from additional types of hyperbolic recurrent neural networks quant-ph · 2026-04-27 · unverdicted · none · ref 37
Hyperbolic RNN and GRU neural quantum states outperform Euclidean versions on Heisenberg J1J2 and J1J2J3 models with 100 spins.
Deflation-Free Optimal Scoring stat.ML · 2026-04-28 · unverdicted · none · ref 3
DFSOS computes all sparse discriminant vectors at once with global orthogonality via Bregman iteration and augmented Lagrangian, achieving classification accuracy comparable to or better than deflation-based sparse optimal scoring on synthetic and real time series data.
Pion: A Spectrum-Preserving Optimizer via Orthogonal Equivalence Transformation cs.LG · 2026-05-12 · unverdicted · none · ref 6
Pion is an optimizer that preserves the singular values of weight matrices in LLM training by applying orthogonal equivalence transformations.
Inversion-Free Natural Gradient Descent on Riemannian Manifolds stat.ML · 2026-04-03 · unreviewed · ref 1

Riemannian Adaptive Optimization Methods

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer