pith. machine review for the scientific record. sign in

arxiv: 1707.09835 · v2 · submitted 2017-07-31 · 💻 cs.LG

Recognition: unknown

Meta-SGD: Learning to Learn Quickly for Few-Shot Learning

Authors on Pith no claims yet
classification 💻 cs.LG
keywords learningmeta-sgdlearnmeta-learnerfew-shotlearnercomparedjust
0
0 comments X
read the original abstract

Few-shot learning is challenging for learning algorithms that learn each task in isolation and from scratch. In contrast, meta-learning learns from many related tasks a meta-learner that can learn a new task more accurately and faster with fewer examples, where the choice of meta-learners is crucial. In this paper, we develop Meta-SGD, an SGD-like, easily trainable meta-learner that can initialize and adapt any differentiable learner in just one step, on both supervised learning and reinforcement learning. Compared to the popular meta-learner LSTM, Meta-SGD is conceptually simpler, easier to implement, and can be learned more efficiently. Compared to the latest meta-learner MAML, Meta-SGD has a much higher capacity by learning to learn not just the learner initialization, but also the learner update direction and learning rate, all in a single meta-learning process. Meta-SGD shows highly competitive performance for few-shot learning on regression, classification, and reinforcement learning.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Binomial Gradient-Based Meta-Learning for Enhanced Meta-Gradient Estimation

    cs.LG 2026-04 unverdicted novelty 6.0

    BinomMAML uses a binomial expansion to estimate meta-gradients more accurately than prior approximations, with error bounds that improve on existing methods and decay super-exponentially under mild conditions.

  2. AHC: Meta-Learned Adaptive Compression for Continual Object Detection on Memory-Constrained Microcontrollers

    cs.AI 2026-02 unverdicted novelty 6.0

    AHC applies meta-learned hierarchical compression with dual memory banks to enable continual object detection on MCUs under a 100KB budget, backed by a forgetting bound of O(ε√T + 1/√M) and competitive results on CORe...

  3. Frequency-Enhanced Dual-Subspace Networks for Few-Shot Fine-Grained Image Classification

    cs.CV 2026-04 unverdicted novelty 5.0

    FEDSNet improves few-shot fine-grained image classification by fusing spatial texture and frequency-based structural subspaces to reduce noise overfitting.