arXiv:2504.03997

Towards Robust Offline Evaluation: A Causal, Information Theoretic Framework for Debiasing Ranking Systems · arXiv 2504.03997

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

read on arXiv browse 1 citing papers

representative citing papers

LLM-as-a-Judge for Reliable and Explainable Offline Evaluation in Top-K Recommendation

cs.IR · 2026-06-22 · unverdicted · novelty 5.0

An LLM judge uses semantic matching on user text to deliver reliable, explainable top-K recommendation scores instead of biased ID-based holdout metrics.

citing papers explorer

Showing 1 of 1 citing paper after filters.

LLM-as-a-Judge for Reliable and Explainable Offline Evaluation in Top-K Recommendation cs.IR · 2026-06-22 · unverdicted · none · ref 30
An LLM judge uses semantic matching on user text to deliver reliable, explainable top-K recommendation scores instead of biased ID-based holdout metrics.

arXiv:2504.03997

fields

years

verdicts

representative citing papers

citing papers explorer