pith. sign in

← back to paper

Review history

arxiv: 2606.24155 · 2 revisions

MedBench v5: A Dynamic, Process-Oriented, and Hallucination-Aware Benchmark for Clinical Multimodal Models

  1. 2026-06-26 UNVERDICTED LOW v0.9.1-grok novelty 5.0
    18668 ms 5794 in 1010 out 2026-06-26T05:40:27.169694+00:00
  2. 2026-06-26 UNVERDICTED LOW v0.9.1-grok novelty 5.0
    18500 ms 5791 in 1455 out 2026-06-26T00:38:49.976144+00:00