Dependency Transformer Grammars: Integrating Dependency Structures into Transformer Language Models

Zhao, Yida, Lou, Chao, Tu, Kewei · 2024 · DOI 10.18653/v1/2024.acl-long.84

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

representative citing papers

GiLT: Augmenting Transformer Language Models with Dependency Graphs

cs.CL · 2026-05-15 · unverdicted · novelty 6.0

GiLT augments Transformers with semantic dependency graphs by modulating attention to improve syntactic generalization while keeping perplexity competitive and enabling better finetuning on downstream tasks.

Why are language models less surprised than humans? Testing the Parse Multiplicity Mismatch Hypothesis

cs.CL · 2026-05-14 · conditional · novelty 6.0

Varying the number of simultaneous parses in RNNGs increases predicted garden-path effects but does not fully reconcile LM surprisal with human reading times.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Why are language models less surprised than humans? Testing the Parse Multiplicity Mismatch Hypothesis cs.CL · 2026-05-14 · conditional · none · ref 193
Varying the number of simultaneous parses in RNNGs increases predicted garden-path effects but does not fully reconcile LM surprisal with human reading times.

Dependency Transformer Grammars: Integrating Dependency Structures into Transformer Language Models

fields

years

verdicts

representative citing papers

citing papers explorer