Title resolution pending

ISSN 0041-5553 · 2023 · DOI 10.1016/0041-5553(63)90382-3

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

open at publisher browse 4 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Transformers Provably Learn Sparse XOR with Polylogarithmic Parameters

cs.LG · 2025-02-11 · unverdicted · novelty 7.0

Single-layer two-head Transformers learn sparse XOR with O(polylog(d)) parameters in one gradient step, breaking the Omega(d) parameter bottleneck of FFNNs.

Adapt or Forget: Provable Tradeoffs Between Adam and SGD in Nonstationary Optimization

stat.ML · 2026-05-05 · unverdicted · novelty 6.0

Adam's adaptive preconditioning and first-moment averaging improve high-probability tracking error in noise-dominated nonstationary regimes but can increase it under strong drift, where SGD achieves a smaller floor, with explicit beta-dependent bounds.

On the Connectedness of Sublevel Sets in Invex Optimization

math.OC · 2026-04-13 · unverdicted · novelty 6.0

Sublevel sets of invex functions are connected under mild assumptions, with the result extended to solution sets in invex-incave minimax problems and incave games.

Metriplectic relaxation to equilibria

math-ph · 2025-06-11 · unverdicted · novelty 5.0

Metriplectic systems converge to entropy extrema at fixed Hamiltonian under stated conditions; a Landau-inspired class reduces the check to two simpler conditions for use in equilibrium relaxation schemes.

citing papers explorer

Showing 4 of 4 citing papers.

Transformers Provably Learn Sparse XOR with Polylogarithmic Parameters cs.LG · 2025-02-11 · unverdicted · none · ref 16
Single-layer two-head Transformers learn sparse XOR with O(polylog(d)) parameters in one gradient step, breaking the Omega(d) parameter bottleneck of FFNNs.
Adapt or Forget: Provable Tradeoffs Between Adam and SGD in Nonstationary Optimization stat.ML · 2026-05-05 · unverdicted · none · ref 72
Adam's adaptive preconditioning and first-moment averaging improve high-probability tracking error in noise-dominated nonstationary regimes but can increase it under strong drift, where SGD achieves a smaller floor, with explicit beta-dependent bounds.
On the Connectedness of Sublevel Sets in Invex Optimization math.OC · 2026-04-13 · unverdicted · none · ref 35
Sublevel sets of invex functions are connected under mild assumptions, with the result extended to solution sets in invex-incave minimax problems and incave games.
Metriplectic relaxation to equilibria math-ph · 2025-06-11 · unverdicted · none · ref 86
Metriplectic systems converge to entropy extrema at fixed Hamiltonian under stated conditions; a Landau-inspired class reduces the check to two simpler conditions for use in equilibrium relaxation schemes.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer