pith. machine review for the scientific record. sign in

arxiv: 1710.04334 · v4 · submitted 2017-10-12 · 💻 cs.CL · cs.AI

Recognition: unknown

DisSent: Sentence Representation Learning from Explicit Discourse Relations

Authors on Pith no claims yet
classification 💻 cs.CL cs.AI
keywords sentencecurateddiscoursehighlearningrelationrelationsachieve
0
0 comments X
read the original abstract

Learning effective representations of sentences is one of the core missions of natural language understanding. Existing models either train on a vast amount of text, or require costly, manually curated sentence relation datasets. We show that with dependency parsing and rule-based rubrics, we can curate a high quality sentence relation task by leveraging explicit discourse relations. We show that our curated dataset provides an excellent signal for learning vector representations of sentence meaning, representing relations that can only be determined when the meanings of two sentences are combined. We demonstrate that the automatically curated corpus allows a bidirectional LSTM sentence encoder to yield high quality sentence embeddings and can serve as a supervised fine-tuning dataset for larger models such as BERT. Our fixed sentence embeddings achieve high performance on a variety of transfer tasks, including SentEval, and we achieve state-of-the-art results on Penn Discourse Treebank's implicit relation prediction task.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding

    cs.CL 2018-04 unverdicted novelty 7.0

    GLUE is a multi-task benchmark for general natural language understanding that includes a diagnostic test suite and finds limited gains from current multi-task learning methods over single-task training.