LLMs achieve high-quality translations of Galen’s expository Greek (MQM 95.2/100) but lower and bimodal quality on pharmacological texts (79.9/100), with terminology rarity (corpus frequency) predicting failure at r = -0.97.
ChatGPT 31.4 (± 6.1) 53.4 (± 3.7) 46.4 (± 5.4) 50.9 (± 5.3) 91.0 (± 1.2) 79.9 (± 1.9) 49.8 (± 3.4) Zainaldin et al
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
ACCEPT 1representative citing papers
citing papers explorer
-
Evaluating LLM-Based Translation of a Low-Resource Technical Language: The Medical and Philosophical Greek of Galen
LLMs achieve high-quality translations of Galen’s expository Greek (MQM 95.2/100) but lower and bimodal quality on pharmacological texts (79.9/100), with terminology rarity (corpus frequency) predicting failure at r = -0.97.