Avoiding Overfitting: A Survey on Regularization Methods for Convolutional Neural Networks , volume=

· 2022 · DOI 10.1145/3510413

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

representative citing papers

HORST: Composing Optimizer Geometries for Sparse Transformer Training

cs.LG · 2026-05-20 · unverdicted · novelty 6.0

HORST uses non-commutative operator composition and a hyperbolic mirror map to combine stability from adaptive optimizers with L1 sparsity bias, outperforming AdamW across sparsity levels on vision and language tasks.

A solution to generalized learning from small training sets found in infant repeated visual experiences of individual objects

cs.CV · 2025-10-16 · unverdicted · novelty 6.0

Infant daily visual experiences of objects are dominated by repeated instances of few exemplars in lumpy similarity clusters, enabling category generalization from small training sets in computational models.

Recurrent Transformer-Based Near- and Far-Field THz Wideband Channel Estimation for UM-MIMO

eess.SP · 2026-05-12 · unverdicted · novelty 5.0

A single recurrent transformer block trained once delivers 5 dB and 7.5 dB NMSE gains over prior methods for narrowband and wideband hybrid near-far field THz UM-MIMO channel estimation.

citing papers explorer

Showing 3 of 3 citing papers.

HORST: Composing Optimizer Geometries for Sparse Transformer Training cs.LG · 2026-05-20 · unverdicted · none · ref 9
HORST uses non-commutative operator composition and a hyperbolic mirror map to combine stability from adaptive optimizers with L1 sparsity bias, outperforming AdamW across sparsity levels on vision and language tasks.
A solution to generalized learning from small training sets found in infant repeated visual experiences of individual objects cs.CV · 2025-10-16 · unverdicted · none · ref 62
Infant daily visual experiences of objects are dominated by repeated instances of few exemplars in lumpy similarity clusters, enabling category generalization from small training sets in computational models.
Recurrent Transformer-Based Near- and Far-Field THz Wideband Channel Estimation for UM-MIMO eess.SP · 2026-05-12 · unverdicted · none · ref 42
A single recurrent transformer block trained once delivers 5 dB and 7.5 dB NMSE gains over prior methods for narrowband and wideband hybrid near-far field THz UM-MIMO channel estimation.

Avoiding Overfitting: A Survey on Regularization Methods for Convolutional Neural Networks , volume=

fields

years

verdicts

representative citing papers

citing papers explorer