Sequence-to-sequence neural network models for transliteration

Mihaela Rosca; Thomas Breuel

arxiv: 1610.09565 · v1 · pith:RABRFMAHnew · submitted 2016-10-29 · 💻 cs.CL

Sequence-to-sequence neural network models for transliteration

Mihaela Rosca , Thomas Breuel This is my paper

classification 💻 cs.CL

keywords transliterationmodelsmachineneuralsequence-to-sequencestateaccessiblearabic

0 comments

read the original abstract

Transliteration is a key component of machine translation systems and software internationalization. This paper demonstrates that neural sequence-to-sequence models obtain state of the art or close to state of the art results on existing datasets. In an effort to make machine transliteration accessible, we open source a new Arabic to English transliteration dataset and our trained models.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

PheMT: A Phenomenon-wise Dataset for Machine Translation Robustness on User-Generated Contents
cs.CL 2020-11 unverdicted novelty 6.0

PheMT is a phenomenon-wise dataset created to evaluate NMT robustness against linguistic phenomena in Japanese-English UGC translation, with experiments showing major performance drops on certain phenomena.