pith. sign in

arxiv: 1805.09929 · v1 · pith:HG2UPEVWnew · submitted 2018-05-24 · 💻 cs.CL

DSGAN: Generative Adversarial Training for Distant Supervision Relation Extraction

classification 💻 cs.CL
keywords distantgeneratorrelationsamplessupervisionadversarialextractionpositive
0
0 comments X
read the original abstract

Distant supervision can effectively label data for relation extraction, but suffers from the noise labeling problem. Recent works mainly perform soft bag-level noise reduction strategies to find the relatively better samples in a sentence bag, which is suboptimal compared with making a hard decision of false positive samples in sentence level. In this paper, we introduce an adversarial learning framework, which we named DSGAN, to learn a sentence-level true-positive generator. Inspired by Generative Adversarial Networks, we regard the positive samples generated by the generator as the negative samples to train the discriminator. The optimal generator is obtained until the discrimination ability of the discriminator has the greatest decline. We adopt the generator to filter distant supervision training dataset and redistribute the false positive instances into the negative set, in which way to provide a cleaned dataset for relation classification. The experimental results show that the proposed strategy significantly improves the performance of distant supervision relation extraction comparing to state-of-the-art systems.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Deep Ranking Based Cost-sensitive Multi-label Learning for Distant Supervision Relation Extraction

    cs.CL 2019-07 unverdicted novelty 4.0

    A deep ranking cost-sensitive multi-label model is introduced for distant supervision relation extraction that models class ties between relations via ranking losses and rescales costs for imbalance.