Proposes Adaptive Tail-Head Alignment (ATHA) that breaks alignment for low-similarity 'tail tokens' in CLIP to boost source-free cross-domain few-shot learning.
Contrastive localized language-image pre-training
3 Pith papers cite this work. Polarity classification is still indexing.
3
Pith papers citing it
years
2026 3representative citing papers
ROGLE introduces automated pseudo region-sentence pairs via RSM and multi-granular learning to boost fine-grained alignment in text-based person search, plus the P-VLG benchmark with over 100k annotated regions.
citing papers explorer
-
Improving CLIP Adaptation by Breaking Tail Alignment for Source-Free Cross-Domain Few-Shot Learning
Proposes Adaptive Tail-Head Alignment (ATHA) that breaks alignment for low-similarity 'tail tokens' in CLIP to boost source-free cross-domain few-shot learning.
-
ROGLE: Robust Global-Local Alignment with Automated Region Supervision for Text-Based Person Search
ROGLE introduces automated pseudo region-sentence pairs via RSM and multi-granular learning to boost fine-grained alignment in text-based person search, plus the P-VLG benchmark with over 100k annotated regions.