Coarse-grain Fine-grain Coattention Network for Multi-evidence Question Answering

Caiming Xiong; Nitish Shirish Keskar; Richard Socher; Victor Zhong

arxiv: 1901.00603 · v2 · pith:6LEPZ7KDnew · submitted 2019-01-03 · 💻 cs.CL · cs.AI

Coarse-grain Fine-grain Coattention Network for Multi-evidence Question Answering

Victor Zhong , Caiming Xiong , Nitish Shirish Keskar , Richard Socher This is my paper

classification 💻 cs.CL cs.AI

keywords answeringquestionanswercoarse-graincoattentiondocumentsfine-grainacross

0 comments

read the original abstract

End-to-end neural models have made significant progress in question answering, however recent studies show that these models implicitly assume that the answer and evidence appear close together in a single document. In this work, we propose the Coarse-grain Fine-grain Coattention Network (CFC), a new question answering model that combines information from evidence across multiple documents. The CFC consists of a coarse-grain module that interprets documents with respect to the query then finds a relevant answer, and a fine-grain module which scores each candidate answer by comparing its occurrences across all of the documents with the query. We design these modules using hierarchies of coattention and self-attention, which learn to emphasize different parts of the input. On the Qangaroo WikiHop multi-evidence question answering task, the CFC obtains a new state-of-the-art result of 70.6% on the blind test set, outperforming the previous best by 3% accuracy despite not using pretrained contextual encoders.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

The Master-Slave Encoder Model for Improving Patent Text Summarization: A New Approach to Combining Specifications and Claims
cs.CL 2024-11 unverdicted novelty 4.0

MSEA uses a master-slave encoder architecture on patent specifications and claims, enhanced with pointer networks and repetition suppression, to generate better summaries as measured by small ROUGE score gains.