hub

Learning multiple layers of features from tiny images

Alex Krizhevsky, Geoffrey Hinton, et al · 2009

12 Pith papers cite this work. Polarity classification is still indexing.

12 Pith papers citing it

browse 12 citing papers

hub tools

JSON dossier citing papers JSON

representative citing papers

LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning

cs.AI · 2023-06-05 · conditional · novelty 8.0

LIBERO is a new benchmark for lifelong robot learning that evaluates transfer of declarative, procedural, and mixed knowledge across 130 manipulation tasks with provided demonstration data.

Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow

cs.LG · 2022-09-07 · unverdicted · novelty 8.0

Rectified flow learns straight-path neural ODEs for distribution transport, yielding efficient generative models and domain transfers that work well even with a single simulation step.

Backdoor Channels Hidden in Latent Space: Cryptographic Undetectability in Modern Neural Networks

cs.CR · 2026-05-13 · unverdicted · novelty 7.0

Backdoors can be realized as statistically natural latent directions in modern neural networks, achieving high attack success with negligible clean accuracy loss and resisting existing defenses.

Classification Fields: Arbitrarily Fine Recursive Hierarchical Clustering From Few Examples

stat.ML · 2026-05-08 · unverdicted · novelty 7.0

Classification fields are infinite recursive hierarchical cluster structures generated by a local refinement rule, and a ReLU network predictor learned from finite prefixes can approximate the generator and extend it to deeper levels with exponential convergence in the completed cell metric.

DSBD: Dual-Aligned Structural Basis Distillation for Graph Domain Adaptation

cs.LG · 2026-04-03 · unverdicted · novelty 7.0

DSBD distills a dual-aligned structural basis to adapt GNNs across graphs with structural distribution shifts, outperforming prior methods on benchmarks.

Understanding Generalization through Decision Pattern Shift

cs.LG · 2026-05-13 · unverdicted · novelty 6.0

DPS quantifies deviation of per-sample decision patterns from class averages and shows linear correlation with generalization gaps while unifying degradation scenarios into a continuous trajectory.

Learning to Perceive "Where": Spatial Pretext Tasks for Robust Self-Supervised Learning

cs.CV · 2026-05-11 · unverdicted · novelty 6.0

Spatial Prediction pretext task learns spatial structure in self-supervised learning by regressing relative position and scale between image views, yielding more structured representations and better generalization.

When Losses Align: Gradient-Based Composite Loss Weighting for Efficient Pretraining

cs.LG · 2026-05-08 · unverdicted · novelty 6.0

A bilevel method learns composite pretraining loss weights online via gradient alignment with a downstream objective, matching tuned baselines at roughly 30% extra cost over one training run.

Robust stochastic first order methods in heavy-tailed noise via medoid mini-batch gradient sampling

math.OC · 2026-05-08 · unverdicted · novelty 6.0

R-SGD-Mini achieves O(1/T) convergence of expected squared gradient norm to a noise-dependent neighborhood in heavy-tailed settings by selecting the medoid gradient from M data chunks.

Hierarchical Dual-Subspace Decoupling for Continual Learning in Vision-Language Models

cs.CV · 2026-05-08 · unverdicted · novelty 6.0

HDSD decouples parameter subspaces in vision-language models via a Feature Modulation Module, General Fusion Module with adaptive thresholds, and Hierarchical Learning Module with SVD scaling to minimize cross-task interference and achieve state-of-the-art class-incremental learning performance.

A Composite Activation Function for Learning Stable Binary Representations

cs.LG · 2026-05-12 · unverdicted · novelty 5.0

HTAF is a sigmoid-tanh composite that approximates the Heaviside function to allow stable gradient training of binary activation networks, yielding ICBMs with stable discretization and competitive performance on image tasks.

TINS: Test-time ID-prototype-separated Negative Semantics Learning for OOD Detection

cs.CV · 2026-05-11 · unverdicted · novelty 5.0

TINS improves OOD detection by learning negative semantics at test time with ID-prototype separation, cutting average FPR95 from 14.04% to 6.72% on the Four-OOD benchmark with ImageNet-1K.

citing papers explorer

Showing 12 of 12 citing papers.

LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot Learning cs.AI · 2023-06-05 · conditional · none · ref 34
LIBERO is a new benchmark for lifelong robot learning that evaluates transfer of declarative, procedural, and mixed knowledge across 130 manipulation tasks with provided demonstration data.
Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow cs.LG · 2022-09-07 · unverdicted · none · ref 36
Rectified flow learns straight-path neural ODEs for distribution transport, yielding efficient generative models and domain transfers that work well even with a single simulation step.
Backdoor Channels Hidden in Latent Space: Cryptographic Undetectability in Modern Neural Networks cs.CR · 2026-05-13 · unverdicted · none · ref 21
Backdoors can be realized as statistically natural latent directions in modern neural networks, achieving high attack success with negligible clean accuracy loss and resisting existing defenses.
Classification Fields: Arbitrarily Fine Recursive Hierarchical Clustering From Few Examples stat.ML · 2026-05-08 · unverdicted · none · ref 23
Classification fields are infinite recursive hierarchical cluster structures generated by a local refinement rule, and a ReLU network predictor learned from finite prefixes can approximate the generator and extend it to deeper levels with exponential convergence in the completed cell metric.
DSBD: Dual-Aligned Structural Basis Distillation for Graph Domain Adaptation cs.LG · 2026-04-03 · unverdicted · none · ref 21
DSBD distills a dual-aligned structural basis to adapt GNNs across graphs with structural distribution shifts, outperforming prior methods on benchmarks.
Understanding Generalization through Decision Pattern Shift cs.LG · 2026-05-13 · unverdicted · none · ref 20
DPS quantifies deviation of per-sample decision patterns from class averages and shows linear correlation with generalization gaps while unifying degradation scenarios into a continuous trajectory.
Learning to Perceive "Where": Spatial Pretext Tasks for Robust Self-Supervised Learning cs.CV · 2026-05-11 · unverdicted · none · ref 1
Spatial Prediction pretext task learns spatial structure in self-supervised learning by regressing relative position and scale between image views, yielding more structured representations and better generalization.
When Losses Align: Gradient-Based Composite Loss Weighting for Efficient Pretraining cs.LG · 2026-05-08 · unverdicted · none · ref 19
A bilevel method learns composite pretraining loss weights online via gradient alignment with a downstream objective, matching tuned baselines at roughly 30% extra cost over one training run.
Robust stochastic first order methods in heavy-tailed noise via medoid mini-batch gradient sampling math.OC · 2026-05-08 · unverdicted · none · ref 15
R-SGD-Mini achieves O(1/T) convergence of expected squared gradient norm to a noise-dependent neighborhood in heavy-tailed settings by selecting the medoid gradient from M data chunks.
Hierarchical Dual-Subspace Decoupling for Continual Learning in Vision-Language Models cs.CV · 2026-05-08 · unverdicted · none · ref 27
HDSD decouples parameter subspaces in vision-language models via a Feature Modulation Module, General Fusion Module with adaptive thresholds, and Hierarchical Learning Module with SVD scaling to minimize cross-task interference and achieve state-of-the-art class-incremental learning performance.
A Composite Activation Function for Learning Stable Binary Representations cs.LG · 2026-05-12 · unverdicted · none · ref 36
HTAF is a sigmoid-tanh composite that approximates the Heaviside function to allow stable gradient training of binary activation networks, yielding ICBMs with stable discretization and competitive performance on image tasks.
TINS: Test-time ID-prototype-separated Negative Semantics Learning for OOD Detection cs.CV · 2026-05-11 · unverdicted · none · ref 27
TINS improves OOD detection by learning negative semantics at test time with ID-prototype separation, cutting average FPR95 from 14.04% to 6.72% on the Four-OOD benchmark with ImageNet-1K.

Learning multiple layers of features from tiny images

hub tools

fields

years

verdicts

representative citing papers

citing papers explorer