YOMI-Bench is a new benchmark of four tasks for kanji reading and phonological understanding in LLMs, showing low performance even for Japanese-specific and commercial models.
Should We Respect LLM s? A Cross-Lingual Study on the Influence of Prompt Politeness on LLM Performance
3 Pith papers cite this work. Polarity classification is still indexing.
3
Pith papers citing it
citation-role summary
background 1
citation-polarity summary
years
2026 3verdicts
UNVERDICTED 3roles
background 1polarities
background 1representative citing papers
MÖVE presents a new German-language benchmark evaluating 39 LLMs on performance and governance criteria using ten public-administration datasets.
Visual fingerprints represent distributions of linguistic choices extracted from repeated LLM samples to enable direct comparison of behaviors under different generation conditions.
citing papers explorer
-
Visual Fingerprints for LLM Generation Comparison
Visual fingerprints represent distributions of linguistic choices extracted from repeated LLM samples to enable direct comparison of behaviors under different generation conditions.