pith. sign in

arXiv preprint arXiv:1902.09229 , year=

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it
abstract

Recent empirical works have successfully used unlabeled data to learn feature representations that are broadly useful in downstream classification tasks. Several of these methods are reminiscent of the well-known word2vec embedding algorithm: leveraging availability of pairs of semantically "similar" data points and "negative samples," the learner forces the inner product of representations of similar pairs with each other to be higher on average than with negative samples. The current paper uses the term contrastive learning for such algorithms and presents a theoretical framework for analyzing them by introducing latent classes and hypothesizing that semantically similar points are sampled from the same latent class. This framework allows us to show provable guarantees on the performance of the learned representations on the average classification task that is comprised of a subset of the same set of latent classes. Our generalization bound also shows that learned representations can reduce (labeled) sample complexity on downstream tasks. We conduct controlled experiments in both the text and image domains to support the theory.

citation-role summary

background 1

citation-polarity summary

roles

background 1

polarities

background 1

representative citing papers

The Platonic Representation Hypothesis

cs.LG · 2024-05-13 · unverdicted · novelty 5.0

Representations learned by large AI models are converging toward a shared statistical model of reality.

On the Power of Foundation Models

cs.AI · 2022-11-29 · unverdicted · novelty 5.0

Category theory proves prompt-based learning on perfect foundation models works only for representable tasks, fine-tuning solves tasks in the pretext category, and models can represent unseen target-category objects using source-category structure.

citing papers explorer

Showing 6 of 6 citing papers.

  • Optimal Representations for Generalized Contrastive Learning with Imbalanced Datasets cs.LG · 2026-05-11 · unverdicted · none · ref 47

    In generalized contrastive learning with imbalanced classes, optimal representations collapse to class means whose angular geometry is determined by class proportions via convex optimization, and extreme imbalance causes all minority classes to collapse to one vector.

  • Optimal Representation Size: High-Dimensional Analysis of Pretraining and Linear Probing cs.LG · 2026-05-19 · unverdicted · none · ref 58 · internal anchor

    In high-dimensional analysis, pretrained PCA representations for linear probing generalize best at low dimensionality when pretraining data is plentiful but labeled data scarce, with an exact trade-off showing how much unlabeled data replaces one labeled sample.

  • Revisiting Feature Prediction for Learning Visual Representations from Video cs.CV · 2024-02-15 · conditional · none · ref 7

    V-JEPA models trained only on feature prediction from 2 million public videos achieve 81.9% on Kinetics-400, 72.2% on Something-Something-v2, and 77.9% on ImageNet-1K using frozen ViT-H/16 backbones.

  • The Platonic Representation Hypothesis cs.LG · 2024-05-13 · unverdicted · none · ref 210 · internal anchor

    Representations learned by large AI models are converging toward a shared statistical model of reality.

  • On the Power of Foundation Models cs.AI · 2022-11-29 · unverdicted · none · ref 6 · internal anchor

    Category theory proves prompt-based learning on perfect foundation models works only for representable tasks, fine-tuning solves tasks in the pretext category, and models can represent unseen target-category objects using source-category structure.

  • Supervise Thyself: Examining Self-Supervised Representations in Interactive Environments cs.LG · 2019-06-27 · unverdicted · none · ref 5 · internal anchor

    Empirical comparison finds that self-supervised representations vary in capturing agent state and generalizing to new levels or textures depending on environment visuals and dynamics.