pith. sign in

Toward generalizable evaluation in the llm era: A survey beyond benchmarks

7 Pith papers cite this work. Polarity classification is still indexing.

7 Pith papers citing it

citation-role summary

background 2 dataset 1

citation-polarity summary

years

2026 6 2025 1

representative citing papers

Security in LLM-as-a-Judge: A Comprehensive SoK

cs.CR · 2026-03-31 · accept · novelty 8.0

The first SoK on LLM-as-a-Judge security organizes attacks targeting judges, attacks using judges, defenses leveraging judges, and security-domain applications while flagging vulnerabilities.

citing papers explorer

Showing 7 of 7 citing papers.