Bidirectional Attention Flow for Machine Comprehension

Ali Farhadi; Aniruddha Kembhavi; Hannaneh Hajishirzi; Minjoon Seo

arxiv: 1611.01603 · v6 · pith:LBX6QMUXnew · submitted 2016-11-05 · 💻 cs.CL

Bidirectional Attention Flow for Machine Comprehension

Minjoon Seo , Aniruddha Kembhavi , Ali Farhadi , Hannaneh Hajishirzi This is my paper

classification 💻 cs.CL

keywords attentioncontextflowansweringbi-directionalcomprehensionmachinequery

0 comments

read the original abstract

Machine comprehension (MC), answering a query about a given context paragraph, requires modeling complex interactions between the context and the query. Recently, attention mechanisms have been successfully extended to MC. Typically these methods use attention to focus on a small portion of the context and summarize it with a fixed-size vector, couple attentions temporally, and/or often form a uni-directional attention. In this paper we introduce the Bi-Directional Attention Flow (BIDAF) network, a multi-stage hierarchical process that represents the context at different levels of granularity and uses bi-directional attention flow mechanism to obtain a query-aware context representation without early summarization. Our experimental evaluations show that our model achieves the state-of-the-art results in Stanford Question Answering Dataset (SQuAD) and CNN/DailyMail cloze test.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 10 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Online Learning-to-Defer with Varying Experts
stat.ML 2026-05 unverdicted novelty 8.0

Presents the first online learning-to-defer algorithm with regret bounds O((n + n_e) T^{2/3}) generally and O((n + n_e) sqrt(T)) under low noise for multiclass classification with varying experts.
Passage Re-ranking with BERT
cs.IR 2019-01 unverdicted novelty 8.0

Fine-tuning BERT for query-passage relevance classification achieves state-of-the-art results on TREC-CAR and MS MARCO, with a 27% relative gain in MRR@10 over prior methods.
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension
cs.CL 2017-05 accept novelty 8.0

TriviaQA is a new large-scale dataset for reading comprehension that features complex compositional questions, high lexical variability, and cross-sentence reasoning requirements, where current baselines reach only 40...
Online Learning-to-Defer with Varying Experts
stat.ML 2026-05 unverdicted novelty 7.0

Presents the first online Learning-to-Defer algorithm achieving regret O((n + n_e) T^{2/3}) generally and O((n + n_e) sqrt(T)) under low noise for multiclass classification with varying experts.
MS MARCO: A Human Generated MAchine Reading COmprehension Dataset
cs.CL 2016-11 accept novelty 7.0

MS MARCO is a new large-scale machine reading comprehension dataset built from real Bing search queries, human-generated answers, and web passages, supporting three tasks including answer synthesis and passage ranking.
RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval
cs.CL 2024-01 unverdicted novelty 6.0

RAPTOR introduces a tree-organized retrieval method using recursive abstractive summaries, achieving a 20% absolute accuracy improvement on the QuALITY benchmark when paired with GPT-4.
Gated Embeddings in End-to-End Speech Recognition for Conversational-Context Fusion
cs.CL 2019-06 unverdicted novelty 6.0

Gated fusion of fastText and BERT embeddings into an end-to-end ASR model captures multi-sentence conversational context and lowers word error rate on the Switchboard corpus.
Weak Supervision Enhanced Generative Network for Question Generation
cs.CL 2019-07 unverdicted novelty 5.0

WeGen adds a weakly supervised Relation Guider and dynamic multi-interaction transfer to an encoder-decoder question generator to better use whole-passage context around an answer span.
EQuANt (Enhanced Question Answer Network)
cs.CL 2019-06 unverdicted novelty 4.0

EQuANt extends QANet to SQuAD 2, achieving nearly twice the performance of a lightweight QANet baseline while also improving SQuAD 1.1 results via multi-task learning.
Machine Reading Comprehension: a Literature Review
cs.CL 2019-06 unverdicted novelty 1.0

A 2019 survey of machine reading comprehension corpora and methods.