Enc-bench: A benchmark for evaluating multimodal large language models in electronic navigational chart understanding.arXiv preprint arXiv:2603.22763, 2026

Ao Cheng, Xingming Li, Xuanyu Ji, Xixiang He, Qiyao Sun, Chunping Qiu, Runke Huang, Qingyong Hu · 2026 · arXiv 2603.22763

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

read on arXiv browse 1 citing papers

representative citing papers

StemBind: When MLLMs Get Lost Between Rules and Instances in Abstract Visual Reasoning

cs.CV · 2026-05-29 · unverdicted · novelty 7.0

StemBind benchmark diagnoses MLLM failures in abstract visual reasoning by separating perception, rule induction, and answer selection on shared stems, finding a persistent rule-to-instance binding gap even when perception and rule are correct.

citing papers explorer

Showing 1 of 1 citing paper.

StemBind: When MLLMs Get Lost Between Rules and Instances in Abstract Visual Reasoning cs.CV · 2026-05-29 · unverdicted · none · ref 9
StemBind benchmark diagnoses MLLM failures in abstract visual reasoning by separating perception, rule induction, and answer selection on shared stems, finding a persistent rule-to-instance binding gap even when perception and rule are correct.

Enc-bench: A benchmark for evaluating multimodal large language models in electronic navigational chart understanding.arXiv preprint arXiv:2603.22763, 2026

fields

years

verdicts

representative citing papers

citing papers explorer