InFindings of the Asso- ciation for Computational Linguistics, ACL 2024, Bangkok, Thailand and virtual meeting, August 11- 16, 2024, pages 12688–12701

Llms as narcissistic evaluators: When ego inflates evaluation scores · 2024 · arXiv 2505.15365

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

LLMs Judge Themselves: A Game-Theoretic Framework for Human-Aligned Evaluation

cs.CL · 2025-10-17 · unverdicted · novelty 7.0

A mutual evaluation system for LLMs that uses game-theoretic aggregation of peer reviews and validates alignment with human voting on subjective outputs.

citing papers explorer

Showing 1 of 1 citing paper.

LLMs Judge Themselves: A Game-Theoretic Framework for Human-Aligned Evaluation cs.CL · 2025-10-17 · unverdicted · none · ref 2
A mutual evaluation system for LLMs that uses game-theoretic aggregation of peer reviews and validates alignment with human voting on subjective outputs.

InFindings of the Asso- ciation for Computational Linguistics, ACL 2024, Bangkok, Thailand and virtual meeting, August 11- 16, 2024, pages 12688–12701

fields

years

verdicts

representative citing papers

citing papers explorer