MÖVE presents a new German-language benchmark evaluating 39 LLMs on performance and governance criteria using ten public-administration datasets.
Should we respect LLMs? a cross-lingual study on the influence of prompt politeness on LLM performance
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
citation-role summary
background 1
citation-polarity summary
years
2026 2verdicts
UNVERDICTED 2roles
background 1polarities
background 1representative citing papers
Visual fingerprints represent distributions of linguistic choices extracted from repeated LLM samples to enable direct comparison of behaviors under different generation conditions.
citing papers explorer
-
M\"OVE: A Holistic LLM Benchmark for the German Public Sector
MÖVE presents a new German-language benchmark evaluating 39 LLMs on performance and governance criteria using ten public-administration datasets.
-
Visual Fingerprints for LLM Generation Comparison
Visual fingerprints represent distributions of linguistic choices extracted from repeated LLM samples to enable direct comparison of behaviors under different generation conditions.