Ethan Gotlieb Wilcox, Tiago Pimentel, Clara Meister, Ryan Cotterell, and Roger P

Construction grammar provides unique insight into neural language models , author= · 2023 · arXiv 2302.02178

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

Do Language Models Know What Not to Say? Causal Evidence for Statistical Preemption in LLMs

cs.CL · 2026-05-21 · unverdicted · novelty 8.0

LLMs show statistical preemption for 120 verb-construction pairs, with surprisal driven by competing-form frequency rather than verb frequency, scaling as a power law with size, and causally shifted by controlled fine-tuning.

Developmental approach reveals the statistical learning of Neural Language Models: Transformers generalize from the most abstract statistical patterns

cs.CL · 2026-06-25 · unverdicted · novelty 5.0

Transformers on synthetic grammar acquire abstract global statistical knowledge first, then local dependencies, showing initial over-generalizations that are later constrained.

Linguistic Productivity in Large Language Models: Models Coerce, but do not Preempt

cs.CL · 2026-06-01 · unverdicted · novelty 5.0

Larger LLMs reproduce constructional productivity via entrenchment in coercion cases with nonce words but fail to use statistical preemption to avoid overgeneralizing semantically plausible but unobserved patterns.

citing papers explorer

Showing 3 of 3 citing papers after filters.

Do Language Models Know What Not to Say? Causal Evidence for Statistical Preemption in LLMs cs.CL · 2026-05-21 · unverdicted · none · ref 7
LLMs show statistical preemption for 120 verb-construction pairs, with surprisal driven by competing-form frequency rather than verb frequency, scaling as a power law with size, and causally shifted by controlled fine-tuning.
Developmental approach reveals the statistical learning of Neural Language Models: Transformers generalize from the most abstract statistical patterns cs.CL · 2026-06-25 · unverdicted · none · ref 59
Transformers on synthetic grammar acquire abstract global statistical knowledge first, then local dependencies, showing initial over-generalizations that are later constrained.
Linguistic Productivity in Large Language Models: Models Coerce, but do not Preempt cs.CL · 2026-06-01 · unverdicted · none · ref 200
Larger LLMs reproduce constructional productivity via entrenchment in coercion cases with nonce words but fail to use statistical preemption to avoid overgeneralizing semantically plausible but unobserved patterns.

Ethan Gotlieb Wilcox, Tiago Pimentel, Clara Meister, Ryan Cotterell, and Roger P

fields

years

verdicts

representative citing papers

citing papers explorer