Training Classifiers with Natural Language Explanations

Braden Hancock; Christopher R\'e; Martin Bringmann; Paroma Varma; Percy Liang; Stephanie Wang

arxiv: 1805.03818 · v4 · pith:SCM5QKJEnew · submitted 2018-05-10 · 💻 cs.CL

Training Classifiers with Natural Language Explanations

Braden Hancock , Paroma Varma , Stephanie Wang , Martin Bringmann , Percy Liang , Christopher R\'e This is my paper

classification 💻 cs.CL

keywords classifiersexplanationslabelinglabelstrainingfindfunctionslanguage

0 comments

read the original abstract

Training accurate classifiers requires many labels, but each label provides only limited information (one bit for binary classification). In this work, we propose BabbleLabble, a framework for training classifiers in which an annotator provides a natural language explanation for each labeling decision. A semantic parser converts these explanations into programmatic labeling functions that generate noisy labels for an arbitrary amount of unlabeled data, which is used to train a classifier. On three relation extraction tasks, we find that users are able to train classifiers with comparable F1 scores from 5-100$\times$ faster by providing explanations instead of just labels. Furthermore, given the inherent imperfection of labeling functions, we find that a simple rule-based semantic parser suffices.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

A Single Rewrite Suffices: Empirical Lessons from Production Skill Description Optimization
cs.CL 2026-06 unverdicted novelty 5.0

A single LLM rewrite of skill descriptions using false positive and negative cases matches manual optimization performance in production, with most other pipeline components adding little value.