pith. sign in

arxiv: 1706.00130 · v2 · pith:2OVB4PM2new · submitted 2017-06-01 · 💻 cs.CL · cs.AI· cs.CV· cs.HC

Teaching Machines to Describe Images via Natural Language Feedback

classification 💻 cs.CL cs.AIcs.CVcs.HC
keywords feedbackhumancaptioningdescriptiveeasilyenablelanguagelearning
0
0 comments X
read the original abstract

Robots will eventually be part of every household. It is thus critical to enable algorithms to learn from and be guided by non-expert users. In this paper, we bring a human in the loop, and enable a human teacher to give feedback to a learning agent in the form of natural language. We argue that a descriptive sentence can provide a much stronger learning signal than a numeric reward in that it can easily point to where the mistakes are and how to correct them. We focus on the problem of image captioning in which the quality of the output can easily be judged by non-experts. We propose a hierarchical phrase-based captioning model trained with policy gradients, and design a feedback network that provides reward to the learner by conditioning on the human-provided feedback. We show that by exploiting descriptive feedback our model learns to perform better than when given independently written human captions.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Why Build an Assistant in Minecraft?

    cs.AI 2019-07 unverdicted novelty 4.0

    A rationale is presented for developing an assistant in Minecraft to advance natural language understanding and dialogue learning.