pith. sign in

← back to paper

Review history

arxiv: 2605.07647 · 2 revisions

Quality-Conditioned Agreement in Automated Short Answer Scoring: Mid-Range Degradation and the Impact of Task-Specific Adaptation

  1. 2026-06-30 UNVERDICTED LOW v0.9.1-grok novelty 4.0
    31714 ms 5805 in 1281 out 2026-06-30T23:16:53.801737+00:00
  2. 2026-05-11 UNVERDICTED LOW v0.9.0 novelty 4.0
    32751 ms 5574 in 1216 out 2026-05-11T01:49:18.189353+00:00