pith. machine review for the scientific record. sign in

arxiv: 1603.00810 · v3 · submitted 2016-03-02 · 💻 cs.CL · cs.LG· cs.NE· stat.ML

Recognition: unknown

Character-based Neural Machine Translation

Authors on Pith no claims yet
classification 💻 cs.CL cs.LGcs.NEstat.ML
keywords neuralcharacter-basedembeddingsmachinemorphologicallyresultsrichsource
0
0 comments X
read the original abstract

Neural Machine Translation (MT) has reached state-of-the-art results. However, one of the main challenges that neural MT still faces is dealing with very large vocabularies and morphologically rich languages. In this paper, we propose a neural MT system using character-based embeddings in combination with convolutional and highway layers to replace the standard lookup-based word representations. The resulting unlimited-vocabulary and affix-aware source word embeddings are tested in a state-of-the-art neural MT based on an attention-based bidirectional recurrent neural network. The proposed MT scheme provides improved results even when the source language is not morphologically rich. Improvements up to 3 BLEU points are obtained in the German-English WMT task.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation

    cs.CL 2016-09 accept novelty 6.0

    GNMT deploys 8-layer LSTMs with attention, wordpieces, low-precision inference, and coverage-penalized beam search to match state-of-the-art on WMT'14 En-Fr and En-De while cutting translation errors by 60% in human e...