A Simple Framework for Contrastive Learn- ing of Visual Representations

Ting Chen, Simon Kornblith, Mohammad Norouzi, Geoffrey Hinton

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Grounding Hierarchical Vision-Language-Action Models Through Explicit Language-Action Alignment

cs.RO · 2026-04-07 · unverdicted · novelty 6.0

A contrastive alignment model plus offline preference learning explicitly grounds hierarchical VLA language descriptions to actions and visuals on LanguageTable, achieving performance comparable to fully supervised fine-tuning while reducing annotation needs.

Rethinking Transfer Learning for Industrial Inspection: DINOv3 vs. ImageNet Pretraining Across RGB and X-ray Tasks

cs.CV · 2026-05-22 · unverdicted · novelty 4.0

DINOv3 pretraining yields no frozen advantage and underperforms ImageNet on X-ray but improves convergence and final performance after full finetuning on RGB industrial inspection tasks.

citing papers explorer

Showing 2 of 2 citing papers.

Grounding Hierarchical Vision-Language-Action Models Through Explicit Language-Action Alignment cs.RO · 2026-04-07 · unverdicted · none · ref 10
A contrastive alignment model plus offline preference learning explicitly grounds hierarchical VLA language descriptions to actions and visuals on LanguageTable, achieving performance comparable to fully supervised fine-tuning while reducing annotation needs.
Rethinking Transfer Learning for Industrial Inspection: DINOv3 vs. ImageNet Pretraining Across RGB and X-ray Tasks cs.CV · 2026-05-22 · unverdicted · none · ref 9
DINOv3 pretraining yields no frozen advantage and underperforms ImageNet on X-ray but improves convergence and final performance after full finetuning on RGB industrial inspection tasks.

A Simple Framework for Contrastive Learn- ing of Visual Representations

fields

years

verdicts

representative citing papers

citing papers explorer