Latency-response theory model: Evaluating large language models via response accuracy and chain-of-thought length

Zhiyu Xu, Jia Liu, Yixin Wang, Yuqi Gu · 2025 · arXiv 2512.07019

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

An Interpretable and Scalable Framework for Evaluating Large Language Models

stat.ML · 2026-05-07 · unverdicted · novelty 6.0

A majorization-minimization framework turns IRT into scalable matrix factorization subproblems for LLM evaluation, delivering orders-of-magnitude speedups with identifiability guarantees.

citing papers explorer

Showing 1 of 1 citing paper.

An Interpretable and Scalable Framework for Evaluating Large Language Models stat.ML · 2026-05-07 · unverdicted · none · ref 58
A majorization-minimization framework turns IRT into scalable matrix factorization subproblems for LLM evaluation, delivering orders-of-magnitude speedups with identifiability guarantees.

Latency-response theory model: Evaluating large language models via response accuracy and chain-of-thought length

fields

years

verdicts

representative citing papers

citing papers explorer