A Paradigm Shift: The Future of Machine Translation Lies with Large Language Models

Alham Fikri Aji; Chenyang Lyu; Derek F. Wong; Jitao Xu; Longyue Wang; Minghao Wu; Siyou Liu; Teresa Lynn; Yitao Duan; Zefeng Du

arxiv: 2305.01181 · v3 · pith:SD6FQNIGnew · submitted 2023-05-02 · 💻 cs.CL

A Paradigm Shift: The Future of Machine Translation Lies with Large Language Models

Chenyang Lyu , Zefeng Du , Jitao Xu , Yitao Duan , Minghao Wu , Teresa Lynn , Alham Fikri Aji , Derek F. Wong

show 2 more authors

Siyou Liu Longyue Wang

This is my paper

classification 💻 cs.CL

keywords llmstranslationfuturemodelsofferemphasizinglanguagelarge

0 comments

read the original abstract

Machine Translation (MT) has greatly advanced over the years due to the developments in deep neural networks. However, the emergence of Large Language Models (LLMs) like GPT-4 and ChatGPT is introducing a new phase in the MT domain. In this context, we believe that the future of MT is intricately tied to the capabilities of LLMs. These models not only offer vast linguistic understandings but also bring innovative methodologies, such as prompt-based techniques, that have the potential to further elevate MT. In this paper, we provide an overview of the significant enhancements in MT that are influenced by LLMs and advocate for their pivotal role in upcoming MT research and implementations. We highlight several new MT directions, emphasizing the benefits of LLMs in scenarios such as Long-Document Translation, Stylized Translation, and Interactive Translation. Additionally, we address the important concern of privacy in LLM-driven MT and suggest essential privacy-preserving strategies. By showcasing practical instances, we aim to demonstrate the advantages that LLMs offer, particularly in tasks like translating extended documents. We conclude by emphasizing the critical role of LLMs in guiding the future evolution of MT and offer a roadmap for future exploration in the sector.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Ancient Greek to Modern Greek Machine Translation: A Novel Benchmark and Fine-Tuning Experiments on LLMs and NMT Models
cs.CL 2026-05 unverdicted novelty 7.0

Introduces the AG-MG Parallel Corpus of 132k aligned pairs and benchmarks fine-tuning of NLLB, M2M100, and Llama-Krikri-8B models, reporting up to +10.3 BLEU improvement with a peak score of 13.16.
Mix-MoE: Improving Multilingual Machine Translation of Large Language Models through Mixed MoEs
cs.CL 2026-05 unverdicted novelty 4.0

Mix-MoE applies separate LM and MT expert groups in two post-pretraining stages with Fourier-enhanced routing to reduce parameter interference and improve multilingual MT over baselines.