Sequence-to-sequence neural network models for transliteration
classification
💻 cs.CL
keywords
transliterationmodelsmachineneuralsequence-to-sequencestateaccessiblearabic
read the original abstract
Transliteration is a key component of machine translation systems and software internationalization. This paper demonstrates that neural sequence-to-sequence models obtain state of the art or close to state of the art results on existing datasets. In an effort to make machine transliteration accessible, we open source a new Arabic to English transliteration dataset and our trained models.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
PheMT: A Phenomenon-wise Dataset for Machine Translation Robustness on User-Generated Contents
PheMT is a phenomenon-wise dataset created to evaluate NMT robustness against linguistic phenomena in Japanese-English UGC translation, with experiments showing major performance drops on certain phenomena.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.