pith. sign in

arxiv: 2606.21203 · v1 · pith:SIVDFI4Vnew · submitted 2026-06-19 · 💻 cs.CL · cs.AI

When Context Misleads: Surprisal, Energy and Attention Entropy as Metrics of Coherence Illusions in LLMs

Pith reviewed 2026-06-26 14:39 UTC · model grok-4.3

classification 💻 cs.CL cs.AI
keywords coherence illusionssurprisalattention entropyenergy metricDutch LLMsdiscourse coherencepsycholinguistics
0
0 comments X

The pith

Dutch language models fall for coherence illusions when a distractor in prior context reduces surprisal at incoherent text.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper tests whether LLMs mirror human readers by accepting incoherent discourse as coherent when a word in the earlier context matches the next expected item. It uses Dutch models and texts built around link-back words such as 'again' and 'too' to create controlled illusion cases. Surprisal at the critical word falls when the distractor is present, matching patterns seen in human judgments. Attention entropy isolates heads whose behavior changes between coherent and incoherent conditions, and ablating those heads produces effects that carry over between experiments. Energy, drawn from associative-memory models, is introduced as an additional scalar for measuring overall discourse coherence.

Core claim

Coherence illusions arise in Dutch LLMs because a distractor in the prior context that matches the critical word reduces the surprisal at that word even when the overall discourse is incoherent. Attention entropy identifies heads that differ under coherent versus incoherent conditions, and ablating them shows transfer effects across experiments, suggesting a shared mechanism. Energy from the associative-memory literature serves as a metric that quantifies discourse coherence across the tested settings.

What carries the argument

Surprisal at the critical word, together with attention entropy over heads and the energy metric, which together detect when prior context lowers surprise for an otherwise incoherent continuation.

If this is right

  • Surprisal values align with human acceptability judgments and eye-tracking measures on the same materials.
  • Specific attention heads show distinct entropy patterns under coherent versus incoherent conditions.
  • Ablating the identified heads produces measurable transfer effects between different illusion experiments.
  • The energy metric tracks discourse coherence in a way that operates across both monolingual and multilingual models.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • If the same heads and energy patterns appear in non-Dutch models, the illusion mechanism may be architecture-level rather than language-specific.
  • Energy could be added as an auxiliary training signal to penalize low-coherence states during pretraining.
  • Long-context applications such as summarization may inherit the same distractor sensitivity observed here.

Load-bearing premise

The chosen texts built around 'again' and 'too' plus the ten specific Dutch models are representative enough to support broader claims about coherence illusions in language models.

What would settle it

A new set of illusion texts or models in which surprisal at the critical word does not drop when a matching distractor is present would falsify the claim that these illusions occur in Dutch LLMs.

Figures

Figures reproduced from arXiv: 2606.21203 by Ece Takmaz, Jakub Dotlacil, Li Kloostra, Nitin Kumar.

Figure 1
Figure 1. Figure 1: Surprisals for the ‘Again’ Study 1 for the [PITH_FULL_IMAGE:figures/full_fig_p005_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Surprisal results for the ‘Again’ Study 2 for [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Surprisal results for the ‘Again’ Study 1 for [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Surprisal results for the ‘Too’ Study for the [PITH_FULL_IMAGE:figures/full_fig_p006_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Attention Entropy difference in ‘Again’ Study [PITH_FULL_IMAGE:figures/full_fig_p006_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: depicts the Energy 1 scores for the medium GPT-2 model for the ‘Again’ Study 1, see [PITH_FULL_IMAGE:figures/full_fig_p007_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Energy 2 scores across layers for the medium Dutch GPT-2 model for the ‘Again’ Study 1 [PITH_FULL_IMAGE:figures/full_fig_p007_7.png] view at source ↗
Figure 9
Figure 9. Figure 9: Representations from GPT-2 medium for ‘Again’ Study 1. Difference from MAMA, when the critical word was seen. To examine whether it is possible to predict coherence conditions, we train linear probes (Be￾linkov, 2022; Ettinger et al., 2016; Giulianelli et al., 2018), consisting of a linear layer projecting from the model’s hidden dimensions to 4 classes. We use 80% of the stimulus sets in training and 20% … view at source ↗
Figure 8
Figure 8. Figure 8: Surprisal in the original vs. ablated model. [PITH_FULL_IMAGE:figures/full_fig_p008_8.png] view at source ↗
Figure 10
Figure 10. Figure 10: Surprisal results for the ‘Again’ Study 2 for [PITH_FULL_IMAGE:figures/full_fig_p012_10.png] view at source ↗
Figure 11
Figure 11. Figure 11: Surprisal results for the ‘Too’ Study for the [PITH_FULL_IMAGE:figures/full_fig_p012_11.png] view at source ↗
Figure 13
Figure 13. Figure 13: Attention Entropy difference in ‘Again’ Study 2 between RMIMA and RMIMI in the medium Dutch GPT-2 model’s heads across layers. 0 5 10 15 Head 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 Layer 0.4 0.2 0.0 0.2 0.4 Mean attention entropy difference (MM_M - MM_MM) [PITH_FULL_IMAGE:figures/full_fig_p013_13.png] view at source ↗
Figure 14
Figure 14. Figure 14: Attention Entropy difference in ‘Too’ Study [PITH_FULL_IMAGE:figures/full_fig_p013_14.png] view at source ↗
Figure 17
Figure 17. Figure 17: Energy 2 scores across layers for the large Dutch GPT-2 model for the ‘Again’ Study 1 [PITH_FULL_IMAGE:figures/full_fig_p014_17.png] view at source ↗
Figure 18
Figure 18. Figure 18: Surprisal after transferring head ablation to [PITH_FULL_IMAGE:figures/full_fig_p014_18.png] view at source ↗
Figure 19
Figure 19. Figure 19: Surprisal after transferring head ablation to [PITH_FULL_IMAGE:figures/full_fig_p014_19.png] view at source ↗
Figure 20
Figure 20. Figure 20: Representations from GPT-2 medium for ‘Again’ Study 1 at the trigger position [PITH_FULL_IMAGE:figures/full_fig_p014_20.png] view at source ↗
read the original abstract

Psycholinguistics studies show that human readers fall for coherence illusions: an incoherent discourse can seem coherent simply because a distractor matches what comes next. We investigate whether Dutch language models (6 monolingual and 4 multilingual) show the same behavior on texts that link back to earlier context with words such as 'again' and 'too'. First, we find that surprisal at the critical word tracks human acceptability judgments and eye-tracking data. Models are more surprised by incoherent continuations, but a matching distractor in the prior context reduces this surprisal. Second, attention entropy at the critical position identifies heads that behave differently under coherence vs. incoherence. We find that ablating these heads shows transfer effects across experiments, suggesting a shared mechanism. Third, we introduce energy from the associative-memory literature as a metric to quantify discourse coherence. Taken together, our results show that coherence illusions arise in Dutch LLMs, with entropy and energy exposing mechanisms that operate across settings.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 2 minor

Summary. The paper claims that Dutch LLMs (6 monolingual + 4 multilingual) exhibit human-like coherence illusions on discourses linked by words such as 'again' and 'too': surprisal at critical words tracks human acceptability judgments and eye-tracking data, with distractors reducing surprisal; attention entropy identifies heads whose ablation produces transfer effects across experiments; and a new energy metric from associative-memory literature quantifies discourse coherence. The results are presented as evidence that coherence illusions arise in Dutch LLMs with mechanisms operating across settings.

Significance. If the central results hold after addressing scope, the work would be significant for bridging psycholinguistics and mechanistic interpretability: it supplies concrete evidence that LLMs replicate a specific human discourse illusion, demonstrates ablation-based transfer as a test for shared mechanisms, and introduces the energy metric as a potentially reusable tool for quantifying coherence. These elements, if rigorously supported, would strengthen claims about cross-linguistic and cross-model applicability of illusion effects.

major comments (3)
  1. [Methods (model and stimulus selection)] Model and text selection (Methods section): The generalization that coherence illusions 'arise in Dutch LLMs' and that 'entropy and energy expos[e] mechanisms that operate across settings' is load-bearing on the 10 chosen models and the 'again'/'too' items being representative. The manuscript must supply explicit selection criteria, training-corpus diversity metrics, and text-sampling rationale; absent these, the evidence supports the phenomena only for this narrow set rather than Dutch LLMs in general.
  2. [Results (surprisal analysis)] Surprisal results (Results section): The claim that surprisal 'tracks human acceptability judgments and eye-tracking data' and is reduced by distractors requires reported item counts, statistical tests (e.g., mixed-effects models), effect sizes, and error bars. Without these, it is impossible to assess whether the distractor effect is reliable or item-specific, directly undermining the first main finding.
  3. [Results (attention entropy and ablation)] Ablation transfer (Results section on attention entropy): The transfer effects across experiments are presented as evidence for a shared mechanism, yet the manuscript must demonstrate that the identified heads are not overfit to the specific lexical triggers or model subset; a control ablation on non-entropy-selected heads or a cross-validation across model families would be needed to secure this inference.
minor comments (2)
  1. [Abstract] The abstract would be clearer if it briefly stated the number of experimental items, the precise definition of the energy metric, and the statistical approach used for the human-model comparisons.
  2. [Methods (energy metric)] Notation for the energy metric should be introduced with an explicit equation in the main text rather than only by reference to the associative-memory literature.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive feedback, which highlights important areas for strengthening the generalizability and statistical support of our claims. We respond to each major comment below and commit to revisions where the manuscript currently falls short of the requested standards.

read point-by-point responses
  1. Referee: Model and text selection (Methods section): The generalization that coherence illusions 'arise in Dutch LLMs' and that 'entropy and energy expos[e] mechanisms that operate across settings' is load-bearing on the 10 chosen models and the 'again'/'too' items being representative. The manuscript must supply explicit selection criteria, training-corpus diversity metrics, and text-sampling rationale; absent these, the evidence supports the phenomena only for this narrow set rather than Dutch LLMs in general.

    Authors: We agree that the Methods section requires explicit documentation to support the stated scope. In the revised manuscript we will add a dedicated subsection on model and stimulus selection. This will report the criteria applied (availability of Dutch monolingual models, parameter-size range, and multilingual comparators), available training-corpus statistics, and the rationale for the 'again'/'too' items drawn from prior psycholinguistic work on coherence illusions. These additions will make the limits of generalization transparent. revision: yes

  2. Referee: Surprisal results (Results section): The claim that surprisal 'tracks human acceptability judgments and eye-tracking data' and is reduced by distractors requires reported item counts, statistical tests (e.g., mixed-effects models), effect sizes, and error bars. Without these, it is impossible to assess whether the distractor effect is reliable or item-specific, directly undermining the first main finding.

    Authors: The referee correctly identifies that the current presentation relies on directional trends without the quantitative statistics needed for rigorous evaluation. We will revise the Results section to report the exact item counts per condition, mixed-effects models testing the distractor effect (with p-values), effect sizes, and error bars on the relevant plots. These additions will allow readers to judge reliability and item-specificity directly. revision: yes

  3. Referee: Ablation transfer (Results section on attention entropy): The transfer effects across experiments are presented as evidence for a shared mechanism, yet the manuscript must demonstrate that the identified heads are not overfit to the specific lexical triggers or model subset; a control ablation on non-entropy-selected heads or a cross-validation across model families would be needed to secure this inference.

    Authors: We accept that the current ablation results would be strengthened by explicit controls against overfitting. In revision we will add control ablations on heads not selected by the entropy criterion and will report transfer effects broken down by model family. These controls will be included to test the specificity of the identified heads. revision: yes

Circularity Check

0 steps flagged

No circularity: metrics defined independently of results

full rationale

The paper applies standard surprisal, defines attention entropy at critical positions, and introduces energy from the associative-memory literature as separate metrics. No equations, derivations, or fitted parameters are shown that reduce the target claims about coherence illusions to inputs by construction. No self-citations are invoked as load-bearing uniqueness theorems, and no ansatz or renaming of known results is presented as a derivation. The experimental findings rest on empirical measurements across models and texts rather than definitional equivalence.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract provides no explicit free parameters, axioms, or invented entities.

pith-pipeline@v0.9.1-grok · 5712 in / 977 out tokens · 17705 ms · 2026-06-26T14:39:41.167627+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

53 extracted references · 29 canonical work pages · 1 internal anchor

  1. [1]

    Aryaman Arora, Dan Jurafsky, and Christopher Potts. 2024. https://doi.org/10.18653/v1/2024.acl-long.785 C ausal G ym: Benchmarking causal interpretability methods on linguistic tasks . In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 14638--14663, Bangkok, Thailand. Association for C...

  2. [2]

    Yonatan Belinkov. 2022. https://doi.org/10.1162/coli_a_00422 Probing classifiers: Promises, shortcomings, and advances . Computational Linguistics, 48(1):207--219

  3. [3]

    Sid Black, Leo Gao, Phil Wang, Connor Leahy, and Stella Biderman. 2021. https://doi.org/10.5281/zenodo.5297715 GPT-Neo: Large Scale Autoregressive Language Modeling with Mesh-Tensorflow . If you use this software, please cite it using these metadata

  4. [4]

    Kathryn Bock and Carol A Miller. 1991. https://doi.org/10.1016/0010-0285(91)90003-7 Broken agreement . Cognitive Psychology, 23(1):45--93

  5. [5]

    Sasha Boguraev, Christopher Potts, and Kyle Mahowald. 2025. https://doi.org/10.18653/v1/2025.emnlp-main.1271 Causal interventions reveal shared structure across E nglish filler -- gap constructions . In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 25021--25042, Suzhou, China. Association for Computational L...

  6. [6]

    Lisa Bylinina, Silvana Abdi, Hylke Brouwer, Martine Elzinga, Shenza Gunput, Sem Huisman, Collin Krooneman, David Poot, Jelmer Top, and Cain Weideman. 2024. https://doi.org/10.57967/hf/3825 Dutch-CoLA (Revision 5a4196c)

  7. [7]

    John Dang, Shivalika Singh, Daniel D'souza, Arash Ahmadian, Alejandro Salamanca, Madeline Smith, Aidan Peppin, Sungjin Hong, Manoj Govindassamy, Terrence Zhao, Sandra Kublik, Meor Amer, Viraat Aryabumi, Jon Ander Campos, Yi-Chern Tan, Tom Kocmi, Florian Strub, Nathan Grinsztajn, Yannis Flet-Berliac, and 26 others. 2024. https://arxiv.org/abs/2412.04261 Ay...

  8. [8]

    Wietse de Vries and Malvina Nissim. 2021. https://doi.org/10.18653/v1/2021.findings-acl.74 As good as new. how to successfully recycle E nglish GPT -2 to make models for other languages . In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, pages 836--846, Online. Association for Computational Linguistics

  9. [9]

    Nima Dehmamy, Benjamin Hoover, Bishwajit Saha, Leo Kozachkov, Jean-Jacques Slotine, and Dmitry Krotov. 2026. Nrgpt: An energy-based alternative for gpt. Proceedings of ICLR . arXiv preprint arXiv:2512.16762

  10. [10]

    Nelson Elhage, Neel Nanda, Catherine Olsson, Tom Henighan, Nicholas Joseph, Ben Mann, Amanda Askell, Yuntao Bai, Anna Chen, Tom Conerly, Nova DasSarma, Dawn Drain, Deep Ganguli, Zac Hatfield-Dodds, Danny Hernandez, Andy Jones, Jackson Kernion, Liane Lovitt, Kamal Ndousse, and 6 others. 2021. A mathematical framework for transformer circuits. Transformer C...

  11. [11]

    Allyson Ettinger, Ahmed Elgohary, and Philip Resnik. 2016. https://doi.org/10.18653/v1/W16-2524 Probing for semantic evidence of composition by means of simple classification tasks . In Proceedings of the 1st Workshop on Evaluating Vector-Space Representations for NLP , pages 134--139, Berlin, Germany. Association for Computational Linguistics

  12. [12]

    Mario Giulianelli, Jack Harding, Florian Mohnert, Dieuwke Hupkes, and Willem Zuidema. 2018. https://doi.org/10.18653/v1/W18-5426 Under the hood: Using diagnostic classifiers to investigate and improve how language models track agreement information . In Proceedings of the 2018 EMNLP Workshop B lackbox NLP : Analyzing and Interpreting Neural Networks for N...

  13. [13]

    Aaron Grattafiori, Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-Dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Alex Vaughan, Amy Yang, Angela Fan, Anirudh Goyal, Anthony Hartshorn, Aobo Yang, Archi Mitra, Archie Sravankumar, Artem Korenev, Arthur Hinsvark, and 542 others. 2024. https://arxiv.org/abs/2407.21783 The llama 3...

  14. [14]

    John Hale. 2001. https://doi.org/10.3115/1073336.1073357 A probabilistic earley parser as a psycholinguistic model . In Proceedings of the Second Meeting of the North American Chapter of the Association for Computational Linguistics on Language Technologies, NAACL '01, page 1–8, USA. Association for Computational Linguistics

  15. [15]

    John J Hopfield. 1982. Neural networks and physical systems with emergent collective computational abilities. Proceedings of the national academy of sciences, 79(8):2554--2558

  16. [16]

    John J Hopfield. 1984. Neurons with graded response have collective computational properties like those of two-state neurons. Proceedings of the national academy of sciences, 81(10):3088--3092

  17. [17]

    Yibo Jiang, Goutham Rajendran, Pradeep Kumar Ravikumar, and Bryon Aragam. 2024. https://openreview.net/forum?id=WJ04ZX8txM Do LLM s dream of elephants (when told not to)? latent concept association and associative memory in transformers . In The Thirty-eighth Annual Conference on Neural Information Processing Systems

  18. [18]

    Jaap Jumelet, Leonie Weissweiler, Joakim Nivre, and Arianna Bisazza. 2026. https://doi.org/10.1162/tacl.a.600 M ulti BL i MP 1.0: A massively multilingual benchmark of linguistic minimal pairs . Transactions of the Association for Computational Linguistics, 14:193--216

  19. [19]

    Li Kloostra, Rick Nouwen, and Jakub Dotlačil. 2026. Coherence illusions and failed binding in discourse processing: interferences in the presupposition resolution of again. Under Review. Journal of XXX, XXX:XXX

  20. [20]

    Hopfield

    Dmitry Krotov and John J. Hopfield. 2016. https://proceedings.neurips.cc/paper/2016/file/eaae339c4d89fc102edd9dbdb6a28915-Paper.pdf Dense associative memory for pattern recognition . In Advances in Neural Information Processing Systems, volume 29. Curran Associates, Inc

  21. [21]

    Tatsuki Kuribayashi, Yohei Oseki, and Timothy Baldwin. 2024. https://doi.org/10.18653/v1/2024.findings-naacl.129 Psychometric predictive power of large language models . In Findings of the Association for Computational Linguistics: NAACL 2024, pages 1983--2005, Mexico City, Mexico. Association for Computational Linguistics

  22. [22]

    Tatsuki Kuribayashi, Yohei Oseki, Souhaib Ben Taieb, Kentaro Inui, and Timothy Baldwin. 2025. https://doi.org/10.1162/TACL.a.58 Large language models are human-like internally . Transactions of the Association for Computational Linguistics, 13:1743--1766

  23. [23]

    Roger Levy. 2008. https://doi.org/10.1016/j.cognition.2007.05.006 Expectation-based syntactic comprehension . Cognition, 106(3):1126--1177

  24. [24]

    Lewis and Shravan Vasishth

    Richard L. Lewis and Shravan Vasishth. 2005. https://doi.org/10.1207/s15516709cog0000\_25 An activation-based model of sentence processing as skilled memory retrieval . Cognitive Science, 29(3):375--419

  25. [25]

    Tal Linzen, Emmanuel Dupoux, and Yoav Goldberg. 2016. https://doi.org/10.1162/tacl_a_00115 Assessing the ability of LSTM s to learn syntax-sensitive dependencies . Transactions of the Association for Computational Linguistics, 4:521--535

  26. [26]

    Guerreiro, Ricardo Rei, Amin Farajian, Mateusz Klimaszewski, Duarte M

    Pedro Henrique Martins, João Alves, Patrick Fernandes, Nuno M. Guerreiro, Ricardo Rei, Amin Farajian, Mateusz Klimaszewski, Duarte M. Alves, José Pombal, Nicolas Boizard, Manuel Faysse, Pierre Colombo, François Yvon, Barry Haddow, José G. C. de Souza, Alexandra Birch, and André F. T. Martins. 2025 a . https://arxiv.org/abs/2506.04079 Eurollm-9b: Technical...

  27. [27]

    Guerreiro, Ricardo Rei, Duarte M

    Pedro Henrique Martins, Patrick Fernandes, João Alves, Nuno M. Guerreiro, Ricardo Rei, Duarte M. Alves, José Pombal, Amin Farajian, Manuel Faysse, Mateusz Klimaszewski, Pierre Colombo, Barry Haddow, José G.C. de Souza , Alexandra Birch, and André F.T. Martins. 2025 b . https://doi.org/10.1016/j.procs.2025.02.260 Eurollm: Multilingual language models for e...

  28. [28]

    Hosein Mohebbi, Willem Zuidema, Grzegorz Chrupa a, and Afra Alishahi. 2023. https://doi.org/10.18653/v1/2023.eacl-main.245 Quantifying context mixing in transformers . In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, pages 3378--3400, Dubrovnik, Croatia. Association for Computational Linguistics

  29. [29]

    Jingcheng Niu, Xingdi Yuan, Tong Wang, Hamidreza Saghir, and Amir H. Abdi. 2025. https://doi.org/10.18653/v1/2025.acl-long.791 Llama see, llama do: A mechanistic perspective on contextual entrainment and distraction in LLM s . In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 16218--1...

  30. [30]

    Byung-Doh Oh and William Schuler. 2022. https://doi.org/10.18653/v1/2022.emnlp-main.632 Entropy- and distance-based predictors from GPT -2 attention patterns predict reading times over and above GPT -2 surprisal . In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pages 9324--9334, Abu Dhabi, United Arab Emirates. A...

  31. [31]

    Byung-Doh Oh and William Schuler. 2023. https://doi.org/10.1162/tacl_a_00548 Why does surprisal from larger transformer-based language models provide a poorer fit to human reading times? Transactions of the Association for Computational Linguistics, 11:336--350

  32. [32]

    Catherine Olsson, Nelson Elhage, Neel Nanda, Nicholas Joseph, Nova DasSarma, Tom Henighan, Ben Mann, Amanda Askell, Yuntao Bai, Anna Chen, Tom Conerly, Dawn Drain, Deep Ganguli, Zac Hatfield-Dodds, Danny Hernandez, Scott Johnston, Andy Jones, Jackson Kernion, Liane Lovitt, and 7 others. 2022. In-context learning and induction heads. Transformer Circuits T...

  33. [33]

    Wagers, and Ellen F

    Colin Phillips, Matthew W. Wagers, and Ellen F. Lau. 2011. https://api.semanticscholar.org/CorpusID:40053259 5: Grammatical illusions and selective fallibility in real-time language comprehension

  34. [34]

    Alec Radford, Jeffrey Wu, Rewon Child, David Luan, Dario Amodei, Ilya Sutskever, and 1 others. 2019. Language models are unsupervised multitask learners. OpenAI blog, 1(8):9

  35. [35]

    a fl, Johannes Lehner, Philipp Seidl, Michael Widrich, Lukas Gruber, Markus Holzleitner, Thomas Adler, David Kreil, Michael K Kopp, G \

    Hubert Ramsauer, Bernhard Sch \"a fl, Johannes Lehner, Philipp Seidl, Michael Widrich, Lukas Gruber, Markus Holzleitner, Thomas Adler, David Kreil, Michael K Kopp, G \"u nter Klambauer, Johannes Brandstetter, and Sepp Hochreiter. 2021. https://openreview.net/forum?id=tL89RnzIiCd Hopfield networks is all you need . In International Conference on Learning R...

  36. [36]

    Soo Hyun Ryu and Richard Lewis. 2021. https://doi.org/10.18653/v1/2021.cmcl-1.6 Accounting for agreement phenomena in sentence comprehension with transformer language models: Effects of similarity-based interference on surprisal and attention . In Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics, pages 61--71, Online. Associ...

  37. [37]

    Soo Hyun Ryu and Richard L. Lewis. 2025. https://doi.org/10.1016/j.jml.2025.104670 Memory for prediction: A transformer-based theory of sentence processing . Journal of Memory and Language, 145:104670

  38. [38]

    Tijn Schmitz, Rick Nouwen, and Jakub Dotlačil. 2025. https://doi.org/10.1016/j.jml.2025.104637 Memory retrieval in discourse: Illusions of coherence during presupposition resolution . Journal of Memory and Language, 143:104637

  39. [39]

    Cory Shain, Clara Meister, Tiago Pimentel, Ryan Cotterell, and Roger Levy. 2024. https://doi.org/10.1073/pnas.2307876121 Large-scale evidence for logarithmic effects of word predictability on reading time . Proceedings of the National Academy of Sciences, 121(10):e2307876121

  40. [40]

    C. E. Shannon. 1948. https://doi.org/10.1002/j.1538-7305.1948.tb01338.x A mathematical theory of communication . Bell System Technical Journal, 27(3):379--423

  41. [41]

    Freda Shi, Xinyun Chen, Kanishka Misra, Nathan Scales, David Dohan, Ed Chi, Nathanael Sch\" a rli, and Denny Zhou. 2023. Large language models can be easily distracted by irrelevant context. In Proceedings of the 40th International Conference on Machine Learning, ICML'23. JMLR.org

  42. [42]

    Michelle Suijkerbuijk, Zo \"e Prins, Marianne de Heer Kloots, Willem Zuidema, and Stefan L. Frank. 2025. https://doi.org/10.1162/coli_a_00559 BL i MP - NL : A corpus of D utch minimal pairs and acceptability judgments for language model evaluation . Computational Linguistics, 51(4):1267--1301

  43. [43]

    Eleftheria Tsipidi, Samuel Kiegeland, Francesco Ignazio Re, Tianyang Xu, Mario Giulianelli, Karolina Stanczak, and Ryan Cotterell. 2026. https://arxiv.org/abs/2604.18712 Probing for reading times . Preprint, arXiv:2604.18712

  44. [44]

    Julie A Van Dyke and Richard L Lewis. 2003. https://doi.org/10.1016/S0749-596X(03)00081-0 Distinguishing effects of structure and decay on attachment and repair: A cue-based parsing account of recovery from misanalyzed ambiguities . Journal of Memory and Language, 49(3):285--316

  45. [45]

    Bram Vanroy. 2023. Language resources for Dutch large language modelling. arXiv preprint arXiv:2312.12852

  46. [46]

    Bram Vanroy. 2024. https://arxiv.org/abs/2412.04092 Geitje 7b ultra: A conversational model for dutch . Preprint, arXiv:2412.04092

  47. [47]

    Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, ukasz Kaiser, and Illia Polosukhin. 2017. https://proceedings.neurips.cc/paper_files/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf Attention is all you need . In Advances in Neural Information Processing Systems, volume 30. Curran Associates, Inc

  48. [48]

    Titus von der Malsburg and Sebastian Pad \'o . 2026. Diverging transformer predictions for human sentence processing: A comprehensive analysis of agreement attraction effects. arXiv preprint arXiv:2603.16574

  49. [49]

    Kevin Ro Wang, Alexandre Variengien, Arthur Conmy, Buck Shlegeris, and Jacob Steinhardt. 2023. https://openreview.net/forum?id=NpsVSN6o4ul Interpretability in the wild: a circuit for indirect object identification in GPT -2 small . In The Eleventh International Conference on Learning Representations

  50. [50]

    Ethan G Wilcox, Tiago Pimentel, Clara Meister, Ryan Cotterell, and Roger P Levy. 2023. Testing the predictions of surprisal theory in 11 languages. Transactions of the Association for Computational Linguistics, 11:1451--1470

  51. [51]

    An Yang, Anfeng Li, Baosong Yang, Beichen Zhang, Binyuan Hui, Bo Zheng, Bowen Yu, Chang Gao, Chengen Huang, Chenxu Lv, Chujie Zheng, Dayiheng Liu, Fan Zhou, Fei Huang, Feng Hu, Hao Ge, Haoran Wei, Huan Lin, Jialong Tang, and 41 others. 2025. https://arxiv.org/abs/2505.09388 Qwen3 technical report . Preprint, arXiv:2505.09388

  52. [52]

    Ori Yoran, Tomer Wolfson, Ori Ram, and Jonathan Berant. 2024. https://openreview.net/forum?id=ZS4m74kZpH Making retrieval-augmented language models robust to irrelevant context . In The Twelfth International Conference on Learning Representations

  53. [53]

    Xompero, Leonardo Ranaldi, Davide Venditti, Federico Ranaldi, Cristina Giannone, Andrea Favalli, and Raniero Romagnoli

    Fabio Massimo Zanzotto, Elena Sofia Ruzzetti, Giancarlo A. Xompero, Leonardo Ranaldi, Davide Venditti, Federico Ranaldi, Cristina Giannone, Andrea Favalli, and Raniero Romagnoli. 2025. https://doi.org/10.18653/v1/2025.findings-acl.785 Position paper: M e M o: Towards language models with associative memory mechanisms . In Findings of the Association for C...