pith. sign in

Colorswap: A color and word order dataset for multimodal evaluation.arXiv preprint arXiv:2402.04492,

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

fields

cs.AI 1 cs.LG 1

years

2025 2

verdicts

UNVERDICTED 2

representative citing papers

Test-Time Matching: Unlocking Compositional Reasoning in Multimodal Models

cs.AI · 2025-10-09 · unverdicted · novelty 6.0

Introduces group matching score for better evaluation of compositional reasoning and Test-Time Matching (TTM) algorithm for unsupervised self-improvement in multimodal models, achieving SOTA gains including surpassing GPT-4.1 and estimated human performance.

citing papers explorer

Showing 2 of 2 citing papers.