Structural Guidance for Transformer Language Models

Peng Qian, Tahira Naseem, Roger Levy · 2021 · DOI 10.18653/v1/2021.acl-long.289

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

representative citing papers

GiLT: Augmenting Transformer Language Models with Dependency Graphs

cs.CL · 2026-05-15 · unverdicted · novelty 6.0

GiLT augments Transformers with semantic dependency graphs by modulating attention to improve syntactic generalization while keeping perplexity competitive and enabling better finetuning on downstream tasks.

Why are language models less surprised than humans? Testing the Parse Multiplicity Mismatch Hypothesis

cs.CL · 2026-05-14 · conditional · novelty 6.0

Varying the number of simultaneous parses in RNNGs increases predicted garden-path effects but does not fully reconcile LM surprisal with human reading times.

Linguistic Frameworks Go Toe-to-Toe at Neuro-Symbolic Language Modeling

cs.CL · 2021-12-15 · unverdicted · novelty 6.0

Semantic constituency graphs outperform syntactic constituency and dependency structures from seven formalisms when added to a Transformer for language modeling.

citing papers explorer

Showing 3 of 3 citing papers.

GiLT: Augmenting Transformer Language Models with Dependency Graphs cs.CL · 2026-05-15 · unverdicted · none · ref 13
GiLT augments Transformers with semantic dependency graphs by modulating attention to improve syntactic generalization while keeping perplexity competitive and enabling better finetuning on downstream tasks.
Why are language models less surprised than humans? Testing the Parse Multiplicity Mismatch Hypothesis cs.CL · 2026-05-14 · conditional · none · ref 247
Varying the number of simultaneous parses in RNNGs increases predicted garden-path effects but does not fully reconcile LM surprisal with human reading times.
Linguistic Frameworks Go Toe-to-Toe at Neuro-Symbolic Language Modeling cs.CL · 2021-12-15 · unverdicted · none · ref 47
Semantic constituency graphs outperform syntactic constituency and dependency structures from seven formalisms when added to a Transformer for language modeling.

Structural Guidance for Transformer Language Models

fields

years

verdicts

representative citing papers

citing papers explorer