IFBench: Granular Instruction-Following Evaluation,

· 2025 · arXiv 2503.07879

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

DataComp-VLM: Improved Open Datasets for Vision-Language Models

cs.CV · 2026-06-26 · conditional · novelty 8.0 · 2 refs

DataComp-VLM benchmark shows instruction-heavy data mixing outperforms filtering for VLM training, with DCVLM-Baseline achieving 63.6% on 33 tasks for 8B models (+5.4pp over FineVision).

Brick: Spatial Capability Routing for the Mixture-of-Models (MoM) Paradigm

cs.AI · 2026-06-11 · unverdicted · novelty 4.0

Brick routes queries to LLMs using capability scores and difficulty estimates, reaching 76.98% accuracy at max-quality and 4.71x lower cost at neutral profile on 5,504 queries versus always using the strongest model.

citing papers explorer

Showing 2 of 2 citing papers.

DataComp-VLM: Improved Open Datasets for Vision-Language Models cs.CV · 2026-06-26 · conditional · none · ref 69 · 2 links
DataComp-VLM benchmark shows instruction-heavy data mixing outperforms filtering for VLM training, with DCVLM-Baseline achieving 63.6% on 33 tasks for 8B models (+5.4pp over FineVision).
Brick: Spatial Capability Routing for the Mixture-of-Models (MoM) Paradigm cs.AI · 2026-06-11 · unverdicted · none · ref 5
Brick routes queries to LLMs using capability scores and difficulty estimates, reaching 76.98% accuracy at max-quality and 4.71x lower cost at neutral profile on 5,504 queries versus always using the strongest model.

IFBench: Granular Instruction-Following Evaluation,

fields

years

verdicts

representative citing papers

citing papers explorer