Augmenting Data with Mixup for Sentence Classification: An Empirical Study

Hongyu Guo; Richong Zhang; Yongyi Mao

arxiv: 1905.08941 · v1 · pith:FVW27UD2new · submitted 2019-05-22 · 💻 cs.CL · cs.AI

Augmenting Data with Mixup for Sentence Classification: An Empirical Study

Hongyu Guo , Yongyi Mao , Richong Zhang This is my paper

classification 💻 cs.CL cs.AI

keywords classificationsentencedatamixupaccuracyaugmentationembeddingsinterpolation

0 comments

read the original abstract

Mixup, a recent proposed data augmentation method through linearly interpolating inputs and modeling targets of random samples, has demonstrated its capability of significantly improving the predictive accuracy of the state-of-the-art networks for image classification. However, how this technique can be applied to and what is its effectiveness on natural language processing (NLP) tasks have not been investigated. In this paper, we propose two strategies for the adaption of Mixup on sentence classification: one performs interpolation on word embeddings and another on sentence embeddings. We conduct experiments to evaluate our methods using several benchmark datasets. Our studies show that such interpolation strategies serve as an effective, domain independent data augmentation approach for sentence classification, and can result in significant accuracy improvement for both CNN and LSTM models.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Quantifying and Optimizing Simplicity via Polynomial Representations
cs.AI 2026-05 unverdicted novelty 6.0

Polynomial representations yield an effective-degree simplicity metric that predicts generalization across tasks and serves as a differentiable regularizer improving performance in classification and RL.