Billion-scale semi-supervised learning for image classification

I. Zeki Yalniz , Herv\'e J\'egou , Kan Chen , Manohar Paluri , Dhruv Mahajan

Authors on Pith no claims yet

classification 💻 cs.CV

keywords classificationimagelearningsemi-supervisedapproachbillionimageslarge

read the original abstract

This paper presents a study of semi-supervised learning with large convolutional networks. We propose a pipeline, based on a teacher/student paradigm, that leverages a large collection of unlabelled images (up to 1 billion). Our main goal is to improve the performance for a given target architecture, like ResNet-50 or ResNext. We provide an extensive analysis of the success factors of our approach, which leads us to formulate some recommendations to produce high-accuracy models for image classification with semi-supervised learning. As a result, our approach brings important gains to standard architectures for image, video and fine-grained classification. For instance, by leveraging one billion unlabelled images, our learned vanilla ResNet-50 achieves 81.2% top-1 accuracy on the ImageNet benchmark.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Habitat-Matterport 3D Dataset (HM3D): 1000 Large-scale 3D Environments for Embodied AI
cs.CV 2021-09 accept novelty 8.0

HM3D offers 1000 building-scale 3D environments that are larger and higher-fidelity than existing datasets, enabling better-performing embodied AI agents for tasks like PointGoal navigation.
Vision Transformers Need Registers
cs.CV 2023-09 unverdicted novelty 6.0

Adding register tokens to Vision Transformers eliminates high-norm background artifacts and raises state-of-the-art performance on dense visual prediction tasks.
DINOv2: Learning Robust Visual Features without Supervision
cs.CV 2023-04 unverdicted novelty 5.0

Pith review generated a malformed one-line summary.
SatReg: Regression-based Neural Architecture Search for Lightweight Satellite Image Segmentation
cs.CV 2026-04 unverdicted novelty 4.0

SatReg uses regression surrogates on two width variables from CM-UNet students to select near-optimal lightweight segmentation architectures for edge satellite deployment without exhaustive search.