pith. sign in

arxiv: 1604.02201 · v1 · pith:IMB6CTUGnew · submitted 2016-04-08 · 💻 cs.CL

Transfer Learning for Low-Resource Neural Machine Translation

classification 💻 cs.CL
keywords low-resourcemachinetransfertranslationbleulearninglanguagemodel
0
0 comments X
read the original abstract

The encoder-decoder framework for neural machine translation (NMT) has been shown effective in large data scenarios, but is much less effective for low-resource languages. We present a transfer learning method that significantly improves Bleu scores across a range of low-resource languages. Our key idea is to first train a high-resource language pair (the parent model), then transfer some of the learned parameters to the low-resource pair (the child model) to initialize and constrain training. Using our transfer learning method we improve baseline NMT models by an average of 5.6 Bleu on four low-resource language pairs. Ensembling and unknown word replacement add another 2 Bleu which brings the NMT performance on low-resource machine translation close to a strong syntax based machine translation (SBMT) system, exceeding its performance on one language pair. Additionally, using the transfer learning model for re-scoring, we can improve the SBMT system by an average of 1.3 Bleu, improving the state-of-the-art on low-resource machine translation.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. The Role of Vocabularies in Learning Sparse Representations for Ranking

    cs.IR 2025-09 unverdicted novelty 5.0

    Larger 100K vocabularies in SPLADE models, especially those initialized with ESPLADE pretraining, improve retrieval effectiveness after pruning compared to 32K baselines while keeping similar efficiency.

  2. Opportunities and Challenges of Large Language Models for Low-Resource Languages in Humanities Research

    cs.CL 2024-11 unverdicted novelty 2.0

    This survey paper identifies opportunities for LLMs in low-resource language humanities research along with challenges in data accessibility, model adaptability, and cultural sensitivity.