Large Margin Neural Language Model

· 2018 · cs.CL · arXiv 1808.08987

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

We propose a large margin criterion for training neural language models. Conventionally, neural language models are trained by minimizing perplexity (PPL) on grammatical sentences. However, we demonstrate that PPL may not be the best metric to optimize in some tasks, and further propose a large margin formulation. The proposed method aims to enlarge the margin between the "good" and "bad" sentences in a task-specific sense. It is trained end-to-end and can be widely applied to tasks that involve re-scoring of generated text. Compared with minimum-PPL training, our method gains up to 1.1 WER reduction for speech recognition and 1.0 BLEU increase for machine translation.

representative citing papers

Harnessing non-adversarial robustness in large language models

cs.AI · 2026-05-28 · unverdicted · novelty 3.0

Debiasing via fine-tuning can enhance LLM robustness to semantically neutral prompt perturbations by addressing perturbation-induced bias in neural network outputs.

citing papers explorer

Showing 1 of 1 citing paper.

Harnessing non-adversarial robustness in large language models cs.AI · 2026-05-28 · unverdicted · none · ref 8 · internal anchor
Debiasing via fine-tuning can enhance LLM robustness to semantically neutral prompt perturbations by addressing perturbation-induced bias in neural network outputs.

Large Margin Neural Language Model

fields

years

verdicts

representative citing papers

citing papers explorer