Active Learning for Convolutional Neural Networks: A Core-Set Approach

Ozan Sener , Silvio Savarese

Authors on Pith no claims yet

classification 📊 stat.ML cs.CVcs.LG

keywords learningactivelargesubsetveryappliedapproachchoosing

read the original abstract

Convolutional neural networks (CNNs) have been successfully applied to many recognition and learning tasks using a universal recipe; training a deep model on a very large dataset of supervised examples. However, this approach is rather restrictive in practice since collecting a large set of labeled images is very expensive. One way to ease this problem is coming up with smart ways for choosing images to be labelled from a very large collection (ie. active learning). Our empirical study suggests that many of the active learning heuristics in the literature are not effective when applied to CNNs in batch setting. Inspired by these limitations, we define the problem of active learning as core-set selection, ie. choosing set of points such that a model learned over the selected subset is competitive for the remaining data points. We further present a theoretical result characterizing the performance of any selected subset using the geometry of the datapoints. As an active learning algorithm, we choose the subset which is expected to yield best result according to our characterization. Our experiments show that the proposed method significantly outperforms existing approaches in image classification experiments by a large margin.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 22 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

MASS-DPO: Multi-negative Active Sample Selection for Direct Policy Optimization
cs.LG 2026-05 unverdicted novelty 7.0

MASS-DPO derives a Plackett-Luce-specific log-determinant Fisher information objective to select non-redundant negative samples, matching or exceeding multi-negative DPO performance with substantially fewer negatives ...
Clip-level Uncertainty and Temporal-aware Active Learning for End-to-End Multi-Object Tracking
cs.CV 2026-05 unverdicted novelty 7.0

CUTAL scores multi-frame clips for uncertainty and enforces temporal diversity to train transformer MOT models to near full-supervision performance with 50% of the labels.
ContextualJailbreak: Evolutionary Red-Teaming via Simulated Conversational Priming
cs.CL 2026-05 unverdicted novelty 7.0

ContextualJailbreak uses evolutionary search over simulated primed dialogues with novel mutations to reach 90-100% attack success on open LLMs and transfers to some closed frontier models at 15-90% rates.
Dynamic Class-Aware Active Learning for Unbiased Satellite Image Segmentation
cs.CV 2026-04 unverdicted novelty 7.0

DCAU-AL is a new active learning strategy that dynamically weights samples by real-time class-wise segmentation performance gaps to improve per-class accuracy under imbalance in satellite imagery.
Positive-First Most Ambiguous: A Simple Active Learning Criterion for Interactive Retrieval of Rare Categories
cs.CV 2026-03 unverdicted novelty 7.0

PF-MA is a new active learning rule that favors likely-positive uncertain samples to speed up discovery of rare categories in imbalanced visual retrieval.
LiBaGS: Lightweight Boundary Gap Synthesis for Targeted Synthetic Data Selection
cs.LG 2026-05 unverdicted novelty 6.0

LiBaGS scores and selects synthetic data near decision boundaries using proximity, uncertainty, density, and validity, with boundary-gap allocation and marginal stopping to improve training accuracy.
Active Testing of Large Language Models via Approximate Neyman Allocation
cs.AI 2026-05 unverdicted novelty 6.0

Active testing via surrogate semantic entropy stratification and approximate Neyman allocation reduces MSE by up to 28% versus uniform sampling and saves about 23% of the labeling budget on language and multimodal benchmarks.
Gradient-Discrepancy Acquisition for Pool-Based Active Learning
cs.LG 2026-05 unverdicted novelty 6.0

A new gradient-discrepancy acquisition function derived from a generalization bound enables more effective pool-based active learning by selecting informative samples.
Boundary-Centric Active Learning for Temporal Action Segmentation
cs.CV 2026-04 unverdicted novelty 6.0

B-ACT improves label efficiency in temporal action segmentation by selecting only boundary frames for annotation via a two-stage uncertainty-driven process that fuses neighborhood uncertainty, class ambiguity, and tem...
Select Smarter, Not More: Prompt-Aware Evaluation Scheduling with Submodular Guarantees
cs.AI 2026-04 unverdicted novelty 6.0

POES frames prompt evaluation as online adaptive testing and uses a provably submodular objective to pick informative examples, delivering 6.2% higher average accuracy and 35-60% token savings versus naive full-set scoring.
ADAM: A Systematic Data Extraction Attack on Agent Memory via Adaptive Querying
cs.CR 2026-04 unverdicted novelty 6.0

ADAM extracts data from LLM agent memory with up to 100% attack success rate by estimating data distribution and selecting queries via entropy guidance.
Scaling-Aware Data Selection for End-to-End Autonomous Driving Systems
cs.LG 2026-04 unverdicted novelty 6.0

MOSAIC is a scaling-aware data selection framework that outperforms baselines in training end-to-end autonomous driving planners, achieving comparable or better EPDMS scores with up to 80% less data.
Are Candidate Models Really Needed for Active Learning?
cs.CV 2026-05 unverdicted novelty 5.0

Active learning with randomly initialized models achieves comparable results to traditional candidate-model methods, with low-confidence sampling proving most effective.
LiBaGS: Lightweight Boundary Gap Synthesis for Targeted Synthetic Data Selection
cs.LG 2026-05 unverdicted novelty 5.0

LiBaGS is a lightweight method that picks synthetic data near decision boundaries while checking density and validity to improve training accuracy over standard oversampling or uncertainty sampling.
Portable Active Learning for Object Detection
cs.CV 2026-05 unverdicted novelty 5.0

PAL is a portable active learning method for object detection that uses class-specific logistic classifiers for uncertainty and image-level diversity to select annotation batches, showing better label efficiency than ...
Evidence-based Decision Modeling for Synthetic Face Detection with Uncertainty-driven Active Learning
cs.CV 2026-05 unverdicted novelty 5.0

EMSFD models synthetic face detection via Dirichlet evidence and uncertainty-driven active learning, reporting 15% higher accuracy than prior state-of-the-art methods while improving reliability on out-of-distribution images.
Evidence-based Decision Modeling for Synthetic Face Detection with Uncertainty-driven Active Learning
cs.CV 2026-05 unverdicted novelty 5.0

EMSFD uses Dirichlet-based evidence modeling to capture prediction uncertainty in synthetic face detection and applies uncertainty-driven active learning to achieve 15% higher accuracy than prior methods.
Uncertainty-Guided Edge Learning for Deep Image Regression in Remote Sensing
cs.CV 2026-05 unverdicted novelty 5.0

UGEL employs deep beta regression to estimate uncertainty in one forward pass, enabling faster convergence in edge learning for remote sensing image regression than active or semi-supervised baselines.
Selective Prediction from Agreement: A Lipschitz-Consistent Version Space Approach
cs.LG 2026-05 unverdicted novelty 5.0

Selective prediction abstains unless all Lipschitz-consistent heads in the version space agree on a certified label for each pool point.
When Active Learning Falls Short: An Empirical Study on Chemical Reaction Extraction
cs.LG 2026-04 unverdicted novelty 5.0

Active learning for chemical reaction extraction frequently produces non-monotonic learning curves and fails to deliver stable gains over random sampling because of strong pretraining, structured CRF decoding, and lab...
Neural Operator Representation of Granular Micromechanics-based Failure Envelope
physics.comp-ph 2026-04 unverdicted novelty 5.0

A differentiable neural operator learns the mapping from granular microstructure configurations to failure envelopes, with physics-informed convexity enforcement and active learning for efficient training.
Labeled TrustSet Guided: Batch Active Learning with Reinforcement Learning
cs.LG 2026-04 unverdicted novelty 5.0

BRAL-T uses TrustSet-guided reinforcement learning for batch active learning and reports state-of-the-art results on 10 image classification benchmarks plus 2 fine-tuning tasks.