pith. sign in

hub Canonical reference

Improved Regularization of Convolutional Neural Networks with Cutout

Canonical reference. 88% of citing Pith papers cite this work as background.

55 Pith papers citing it
Background 88% of classified citations
abstract

Convolutional neural networks are capable of learning powerful representational spaces, which are necessary for tackling complex learning tasks. However, due to the model capacity required to capture such representations, they are often susceptible to overfitting and therefore require proper regularization in order to generalize well. In this paper, we show that the simple regularization technique of randomly masking out square regions of input during training, which we call cutout, can be used to improve the robustness and overall performance of convolutional neural networks. Not only is this method extremely easy to implement, but we also demonstrate that it can be used in conjunction with existing forms of data augmentation and other regularizers to further improve model performance. We evaluate this method by applying it to current state-of-the-art architectures on the CIFAR-10, CIFAR-100, and SVHN datasets, yielding new state-of-the-art results of 2.56%, 15.20%, and 1.30% test error respectively. Code is available at https://github.com/uoguelph-mlrg/Cutout

hub tools

citation-role summary

background 7 method 1

citation-polarity summary

representative citing papers

Navigating Potholes with Geometry-Aware Sharpness Minimization

cs.LG · 2026-05-15 · unverdicted · novelty 7.0

LLQR+SAM pairs a slow learned geometry preconditioner with fast SAM perturbations to amplify escape from locally sharp 'potholes' while stabilizing flat basins, producing consistent gains over SAM and LLQR alone.

Layerwise LQR for Geometry-Aware Optimization of Deep Networks

cs.LG · 2026-05-05 · unverdicted · novelty 7.0

Steepest descent under divergence-induced quadratic models equals an LQR problem, enabling learning of diagonal or Kronecker-factored inverse preconditioners via a global layerwise objective for scalable geometry-aware training.

Point Cloud Sequence Encoding for Material-conditioned Graph Network Simulators

cs.LG · 2026-05-20 · unverdicted · novelty 6.0

PEACH uses a novel spatio-temporal point cloud sequence encoder plus auxiliary supervision to enable zero-shot adaptation of graph network simulators to unseen physical properties, outperforming mesh-based baselines in simulation accuracy while being more deployable for real scenes.

citing papers explorer

Showing 50 of 55 citing papers.