pith. machine review for the scientific record. sign in

arxiv: 1605.06353 · v2 · submitted 2016-05-20 · 💻 cs.CL

Recognition: unknown

Phrase-based Machine Translation is State-of-the-Art for Automatic Grammatical Error Correction

Authors on Pith no claims yet
classification 💻 cs.CL
keywords featuressparsetuningautomaticconll-2014correctiondenseerror
0
0 comments X
read the original abstract

In this work, we study parameter tuning towards the M^2 metric, the standard metric for automatic grammar error correction (GEC) tasks. After implementing M^2 as a scorer in the Moses tuning framework, we investigate interactions of dense and sparse features, different optimizers, and tuning strategies for the CoNLL-2014 shared task. We notice erratic behavior when optimizing sparse feature weights with M^2 and offer partial solutions. We find that a bare-bones phrase-based SMT setup with task-specific parameter-tuning outperforms all previously published results for the CoNLL-2014 test set by a large margin (46.37% M^2 over previously 41.75%, by an SMT system with neural features) while being trained on the same, publicly available data. Our newly introduced dense and sparse features widen that gap, and we improve the state-of-the-art to 49.49% M^2.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Neural Grammatical Error Correction for Romanian

    cs.CL 2026-04 unverdicted novelty 6.0

    A new Romanian GEC corpus of 10k pairs plus pretraining a Transformer on artificial errors generated via POS tagger yields F0.5 of 53.76, beating the 44.38 baseline from training only on the corpus.