Compositional Generalization in Grounded Language Learning via Induced Model Sparsity

Alexander Ilin; Sam Spilsbury

arxiv: 2207.02518 · v1 · pith:GGHIYSBWnew · submitted 2022-07-06 · 💻 cs.CL · cs.LG

Compositional Generalization in Grounded Language Learning via Induced Model Sparsity

Sam Spilsbury , Alexander Ilin This is my paper

classification 💻 cs.CL cs.LG

keywords agentcompositionalgeneralizationgoallearningattributesenvironmentfind

0 comments

read the original abstract

We provide a study of how induced model sparsity can help achieve compositional generalization and better sample efficiency in grounded language learning problems. We consider simple language-conditioned navigation problems in a grid world environment with disentangled observations. We show that standard neural architectures do not always yield compositional generalization. To address this, we design an agent that contains a goal identification module that encourages sparse correlations between words in the instruction and attributes of objects, composing them together to find the goal. The output of the goal identification module is the input to a value iteration network planner. Our agent maintains a high level of performance on goals containing novel combinations of properties even when learning from a handful of demonstrations. We examine the internal representations of our agent and find the correct correspondences between words in its dictionary and attributes in the environment.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Compositionality Emerges in a Narrow Depth-Connectivity Regime: Architecture Constraints and Solution Manifolds
cs.LG 2026-06 unverdicted novelty 6.0

Compositionality emerges in neural networks only in a narrow depth-connectivity regime, with gradient descent converging to fractured solutions outside it.
A Systematic Study of Behavioral Cloning for Scientific Data Annotation
cs.HC 2026-05 unverdicted novelty 6.0

Introduces 9 synthetic annotation tasks and benchmarks for behavioral cloning, finding hierarchical skill learning, scaling benefits, effective multi-task pretraining, and shared internal representations of task phase...