$\beta^3$-IRT: A New Item Response Model and its Applications

Peter Flach; Ricardo B. C. Prud\^encio; Telmo Silva Filho; Tom Diethe; Yu Chen

arxiv: 1903.04016 · v3 · pith:SCYZEU4Gnew · submitted 2019-03-10 · 📊 stat.ML · cs.LG

β³-IRT: A New Item Response Model and its Applications

Yu Chen , Telmo Silva Filho , Ricardo B. C. Prud\^encio , Tom Diethe , Peter Flach This is my paper

classification 📊 stat.ML cs.LG

keywords modelbetaitemassessdatadifficultyresponseabilities

0 comments

read the original abstract

Item Response Theory (IRT) aims to assess latent abilities of respondents based on the correctness of their answers in aptitude test items with different difficulty levels. In this paper, we propose the $\beta^3$-IRT model, which models continuous responses and can generate a much enriched family of Item Characteristic Curve (ICC). In experiments we applied the proposed model to data from an online exam platform, and show our model outperforms a more standard 2PL-ND model on all datasets. Furthermore, we show how to apply $\beta^3$-IRT to assess the ability of machine learning classifiers. This novel application results in a new metric for evaluating the quality of the classifier's probability estimates, based on the inferred difficulty and discrimination of data instances.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Item Response Scaling Laws: A Measurement Theory Approach for Efficient and Generalizable Neural Scaling Estimation
cs.LG 2026-05 unverdicted novelty 7.0

IRSL applies IRT to reduce scaling law estimation from O(M×N) to O(M+N) parameters, enabling reliable estimates with only 50 questions per benchmark after calibration and generalizable ability scores across related be...