pith. sign in

Tips: Text-image pretraining with spatial awareness.arXiv:2410.16512

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

citation-role summary

background 1 dataset 1

citation-polarity summary

years

2026 3 2025 2

representative citing papers

Multilingual Vision-Language Models, A Survey

cs.CL · 2025-09-26 · accept · novelty 3.0

The survey identifies a key tension in multilingual vision-language models between language neutrality via contrastive learning and cultural awareness via diverse data, with most benchmarks relying on translation-based evaluation.

citing papers explorer

Showing 5 of 5 citing papers.