A multi-reference audit framework for LLM translations of the Pali Canon uses embedding drift from a human reference centroid to triage candidates for LLM-judge adjudication, showing drift correlates with major error rates and model-specific differences in the high-drift tail.
(2024) ’One Model Is All Y ou Need: ByT5- Sanskrit, a Unified Model for Sanskrit NLP Tasks’ in Findings of the Association for Computational Lin- guistics: EMNLP 2024
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.CL 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
Proposes treating Pāṇini's Astādhyāyī as a unifying computational architecture and benchmark foundation for Indic language NLP to improve accuracy, data efficiency, and transfer.
citing papers explorer
-
From Outliers to Errors: Auditing Pali-to-English LLM Translations with Multi-Reference Adjudication
A multi-reference audit framework for LLM translations of the Pali Canon uses embedding drift from a human reference centroid to triage candidates for LLM-judge adjudication, showing drift correlates with major error rates and model-specific differences in the high-drift tail.
-
A P\={a}ninian Foundation for Indic Language Processing
Proposes treating Pāṇini's Astādhyāyī as a unifying computational architecture and benchmark foundation for Indic language NLP to improve accuracy, data efficiency, and transfer.