Look wide and interpret twice: Improving performance on interactive instruction-following tasks.arXiv preprint arXiv:2106.00596, 2021

Van-Quang Nguyen, Masanori Suganuma, Takayuki Okatani · 2021 · arXiv 2106.00596

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

RePlan-Bot: Multi-Level Replanning for Embodied Instruction Following

cs.RO · 2026-05-25 · unverdicted · novelty 5.0

RePlan-Bot achieves state-of-the-art results on the ALFRED benchmark for embodied instruction following by integrating LLM-based auditing, commonsense map search, and ViT action correction.

Machine Intelligence that Understands Visual and Linguistic Information and Interacts with Humans and Environments

cs.CV · 2026-05-20 · unverdicted · novelty 4.0

Introduces GRIT, LTMI, and a hierarchical attention framework claiming performance gains on image captioning, visual dialog, and ALFRED instruction following.

citing papers explorer

Showing 1 of 1 citing paper after filters.

RePlan-Bot: Multi-Level Replanning for Embodied Instruction Following cs.RO · 2026-05-25 · unverdicted · none · ref 22
RePlan-Bot achieves state-of-the-art results on the ALFRED benchmark for embodied instruction following by integrating LLM-based auditing, commonsense map search, and ViT action correction.

Look wide and interpret twice: Improving performance on interactive instruction-following tasks.arXiv preprint arXiv:2106.00596, 2021

fields

years

verdicts

representative citing papers

citing papers explorer