MÖVE presents a new German-language benchmark evaluating 39 LLMs on performance and governance criteria using ten public-administration datasets.
The political biases of ChatGPT.Social Sciences, 12(3):148
4 Pith papers cite this work. Polarity classification is still indexing.
years
2026 4representative citing papers
LLMs exhibit higher perplexity on far-right and nationalist party texts than social-democratic ones, consistent across models and languages with correlation to translation metrics.
Centralized matching mechanisms outperform free negotiation in stability and efficiency with LLM agents, who also report preferences truthfully more often than humans, though not always in line with strategy-proofness predictions.
citing papers explorer
-
M\"OVE: A Holistic LLM Benchmark for the German Public Sector
MÖVE presents a new German-language benchmark evaluating 39 LLMs on performance and governance criteria using ten public-administration datasets.
-
Large Language Models are Perplexed by some Political Parties
LLMs exhibit higher perplexity on far-right and nationalist party texts than social-democratic ones, consistent across models and languages with correlation to translation metrics.
-
Do Matching Mechanisms Work with LLM Agents?
Centralized matching mechanisms outperform free negotiation in stability and efficiency with LLM agents, who also report preferences truthfully more often than humans, though not always in line with strategy-proofness predictions.
- Reducing Political Manipulation with Consistency Training