What's in a Name? Morphological Shortcuts by LLMs in Pharmacology

Byron C. Wallace; Chantal Shaib; Junyi Jessy Li; Kaijie Mo; Kanishka Misra; Qing Yao; Ramez Kouzy; Thomas Yang; William Rudman

arxiv: 2606.05616 · v1 · pith:E2MAWBP6new · submitted 2026-06-04 · 💻 cs.CL

What's in a Name? Morphological Shortcuts by LLMs in Pharmacology

Kaijie Mo , Thomas Yang , Chantal Shaib , Qing Yao , William Rudman , Ramez Kouzy , Kanishka Misra , Byron C. Wallace

show 1 more author

Junyi Jessy Li

This is my paper

Pith reviewed 2026-06-28 01:47 UTC · model grok-4.3

classification 💻 cs.CL

keywords large language modelspharmacologymorphological shortcutsaffix heuristicsdrug semanticsactivation patchingfictitious names

0 comments

The pith

Large language models often determine drug meanings from affixes in their names rather than full context.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper examines how LLMs in pharmacology rely on morphological cues like affixes to infer drug properties. Using fictitious drug names constructed from real affixes, the authors demonstrate that these signals alone trigger class-level responses and plausible clinical content. They develop a framework to distinguish affix-driven, stem-driven, or whole-name semantics across hundreds of drugs. Results indicate frequent affix reliance that models do not disclose, along with errors in conflating similar drugs. Mechanistic probes locate the behavior in early to mid layers of the models.

Core claim

The central discovery is that LLMs induce drug meaning primarily through affix cues in pharmacology, rarely indicate this reliance explicitly, and sometimes incorrectly conflate properties among affix-sharing drugs. This is evidenced by behavioral experiments with fictitious drugs and confirmed through activation patching that localizes the behavior to early-mid layers.

What carries the argument

The framework for identifying whether drug semantics are driven by the affix, the stem, or the full drug name, which separates morphological shortcut effects from other influences.

If this is right

Models generate plausible clinical content for fictitious drugs based solely on affixes.
Affix signals elicit class-level pharmacological responses without full name context.
The reliance on affixes is not usually stated by the models.
Conflation of properties occurs among drugs sharing affixes.
Activation patching shows this behavior originates in early-mid layers.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Similar morphological shortcuts could affect LLM performance in other technical fields like chemistry.
Developers might need to audit training data for affix-pattern biases to reduce such reliance.
Clinicians using LLMs for drug queries should cross-check outputs against full name semantics.

Load-bearing premise

That responses to fictitious drug names built from real affixes isolate affix-driven semantics without confounding effects from other linguistic patterns or model training data overlaps.

What would settle it

A test where models are prompted with two fictitious drugs sharing an affix but assigned conflicting properties in the prompt, and checking if they still follow the affix cue or adapt to the new information.

Figures

Figures reproduced from arXiv: 2606.05616 by Byron C. Wallace, Chantal Shaib, Junyi Jessy Li, Kaijie Mo, Kanishka Misra, Qing Yao, Ramez Kouzy, Thomas Yang, William Rudman.

**Figure 1.** Figure 1: Example of morphology-driven inference. Humans may cautiously infer that dimicillin resembles an antibiotic due to the suffix “-cillin”, while LLMs may produce similarly confident continuations for both real and fictitious drugs. We systematically quantify this behavior at both the behavioral and mechanistic levels. shown in [PITH_FULL_IMAGE:figures/full_fig_p001_1.png] view at source ↗

**Figure 2.** Figure 2: Model responses across Real, Fake, and Nonce conditions in multiple-choice (top) and open-ended [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 3.** Figure 3: The framework used to compute the Affix, Stem, and Holistic scores. (a) For each real drug condition [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗

**Figure 4.** Figure 4: Layer- and position-wise activation patching effects in [PITH_FULL_IMAGE:figures/full_fig_p008_4.png] view at source ↗

**Figure 5.** Figure 5: Last-token activation patching effects in [PITH_FULL_IMAGE:figures/full_fig_p008_5.png] view at source ↗

**Figure 6.** Figure 6: Model behavior under the Real-Fake and Fake [PITH_FULL_IMAGE:figures/full_fig_p012_6.png] view at source ↗

**Figure 7.** Figure 7: Results for the bare-question setting (“What [PITH_FULL_IMAGE:figures/full_fig_p012_7.png] view at source ↗

**Figure 8.** Figure 8: Model behavior under original prompting and CoT prompting across the Fake, Nonce, and Real conditions. [PITH_FULL_IMAGE:figures/full_fig_p013_8.png] view at source ↗

**Figure 9.** Figure 9: Per-affix generalization stability across nonce stem variations ( [PITH_FULL_IMAGE:figures/full_fig_p014_9.png] view at source ↗

**Figure 10.** Figure 10: Per-affix generalization stability across nonce stem variations ( [PITH_FULL_IMAGE:figures/full_fig_p014_10.png] view at source ↗

**Figure 11.** Figure 11: Signal type distribution as the No-signal threshold varies from 0 to 1 for the MC and OE tasks. The [PITH_FULL_IMAGE:figures/full_fig_p015_11.png] view at source ↗

**Figure 12.** Figure 12: Threshold sensitivity analysis across models and tasks. We vary the dominance margin threshold [PITH_FULL_IMAGE:figures/full_fig_p016_12.png] view at source ↗

**Figure 13.** Figure 13: RR→NR signal-type transitions across models and tasks. Each row is normalized independently and shows the percentage of RR drugs in each original signal category that transition into each NR signal category after stem perturbation. 0.2 0.0 0.2 0.4 stage 1 stage 2 stage 3 instruct holistic group RR NR NN pt-s1-0 pt-s1-1000 pt-s1-2000 pt-s1-5000 pt-s1-10000 pt-s1-20000 pt-s1-50000 pt-s1-100000 pt-s1-200000 … view at source ↗

**Figure 14.** Figure 14: Training dynamics of class-preference scores across checkpoints for holistic and affix-dependent drug [PITH_FULL_IMAGE:figures/full_fig_p017_14.png] view at source ↗

**Figure 15.** Figure 15: Layer- and position-wise activation patching effects for [PITH_FULL_IMAGE:figures/full_fig_p018_15.png] view at source ↗

**Figure 16.** Figure 16: Last-token RR→NN and NR→NN activation patching effects in Qwen2.5-7B-Instruct. For affix-class drugs, NR→NN closely matches RR→NN across early-to-middle layers, indicating that affix information alone reproduces most of the full real-drug effect. In contrast, holistic-class drugs show substantially larger RR→NN than NR→NN effects, indicating that affix information remains necessary but is insufficient on … view at source ↗

**Figure 17.** Figure 17: Layer-wise DAS intervention results on OLMo-3-7B-Instruct. KL reduction, top-1 accuracy, and definition-margin shifts peak in the early-middle layers, with the strongest and most consistent effects around layers 7–10. Metrics We evaluate DAS using three metrics: KL reduction, measuring alignment with the target distribution; Top-1 accuracy, measuring whether the patched prediction flips to the source pred… view at source ↗

**Figure 18.** Figure 18: Steering along the learned DAS directions produces bidirectional control over affix-driven behavior. [PITH_FULL_IMAGE:figures/full_fig_p020_18.png] view at source ↗

read the original abstract

The morphological form of a word can often give cues to its meaning, but purely relying on these mappings can lead to overgeneralization in high-stakes domains. In the medical domain, for instance, LLMs can confidently reason about fictitious drugs from their affixes alone (e.g., wugcillin) and generate plausible-looking clinical content. We present a behavioral and mechanistic study of LLM "affix heuristics" in pharmacology. Using fictitious drug names built from real affixes, we show that affix signals alone elicit class-level pharmacological responses. We introduce a framework for identifying whether a model's drug semantics are driven mainly by the affix, the stem, or the drug name as a whole. Applied across 653 drugs, our framework reveals that models often induce drug meaning primarily through affix cues, yet rarely explicitly indicate this reliance, and sometimes incorrectly conflate properties among affix-sharing drugs. Activation patching across models further localizes this behavior to early-mid layers. These findings show that morphological shortcuts pose a subtle but measurable risk to safety.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper shows LLMs default to affix cues for drug class reasoning on made-up names and localizes it via patching, but the stem controls look too loose to fully support the shortcut claim.

read the letter

The core finding is that models often pull pharmacological properties from real affixes in fictitious drug names, and activation patching puts this behavior in early-to-mid layers. They also give a framework that tries to separate affix-driven, stem-driven, or whole-name semantics across 653 examples.

What stands out is the attempt to make the morphological shortcut measurable and localizable rather than just noting it happens. Testing on constructed names like wugcillin and then patching is a reasonable way to probe where the model is getting its signal. That part is new enough to be worth seeing in print.

The soft spot is exactly the one the stress-test flags. Pairing real affixes with novel stems does not automatically guarantee the stem carries no residual training signal, n-gram overlap, or co-occurrence statistics. If those stems still trigger partial matches, the class-level outputs and the patching results could reflect broader morphological sensitivity instead of a clean affix heuristic. The abstract gives no numbers on how stems were vetted or what statistical checks were run, so it is hard to judge whether the isolation worked. The claim that models rarely flag their own reliance also needs tighter quantification than the abstract supplies.

This is for people who care about LLM reliability in medical or safety-critical text generation. The question it raises is real, even if the current evidence is preliminary. It should go to peer review so the name-construction details and controls can be checked properly.

Referee Report

2 major / 2 minor

Summary. The paper presents a behavioral and mechanistic investigation of morphological 'affix heuristics' in LLMs applied to pharmacology. Using fictitious drug names constructed from real affixes (e.g., 'wugcillin'), it demonstrates that affix signals alone can elicit class-level pharmacological responses. A framework is introduced to attribute drug semantics primarily to the affix, stem, or full name; when applied to 653 drugs, it finds that models predominantly rely on affix cues, rarely state this reliance explicitly, and sometimes conflate properties across affix-sharing drugs. Activation patching localizes the behavior to early-to-mid layers, with the conclusion that such shortcuts represent a measurable safety risk.

Significance. If the central isolation of affix effects holds, the work provides concrete evidence of a subtle but systematic failure mode in high-stakes medical reasoning, supported by scale (653 drugs) and mechanistic localization via activation patching. This strengthens the case for targeted interpretability interventions in domain-specific LLM applications and offers falsifiable predictions about layer-wise morphological sensitivity.

major comments (2)

[§3] §3 (Framework for attributing semantics): The claim that responses to fictitious names isolate affix-driven semantics rests on the assumption that novel stems carry no residual distributional signals, n-gram overlaps, or affix-stem co-occurrence statistics from pretraining. Without reported controls (e.g., stem perplexity matching, n-gram frequency audits, or ablation on stem-only baselines), the class-level pharmacological outputs and the 'primarily through affix cues' finding for the 653-drug corpus could reflect confounds rather than pure affix heuristics. This is load-bearing for both the behavioral results and the activation-patching localization.
[§4.2] §4.2 (Rarely explicitly indicate reliance): The finding that models rarely state affix reliance explicitly is derived from the same fictitious-name probes; if stem confounds are present, the 'rarely explicitly indicate' and 'incorrectly conflate properties' conclusions cannot be cleanly attributed to affix shortcuts versus broader morphological or lexical leakage.

minor comments (2)

[Abstract, §2] The abstract and §2 should include a brief statement of the statistical tests and validation procedure used to confirm that class-level responses exceed chance baselines.
[Figure 4] Figure captions for activation-patching results should explicitly state the number of models, layers probed, and the precise patching metric (e.g., logit difference or probability shift).

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed and constructive comments. The concerns about potential confounds in the fictitious-name probes are well-taken and directly relevant to the load-bearing claims. We respond point-by-point below and indicate where revisions will be made.

read point-by-point responses

Referee: [§3] §3 (Framework for attributing semantics): The claim that responses to fictitious names isolate affix-driven semantics rests on the assumption that novel stems carry no residual distributional signals, n-gram overlaps, or affix-stem co-occurrence statistics from pretraining. Without reported controls (e.g., stem perplexity matching, n-gram frequency audits, or ablation on stem-only baselines), the class-level pharmacological outputs and the 'primarily through affix cues' finding for the 653-drug corpus could reflect confounds rather than pure affix heuristics. This is load-bearing for both the behavioral results and the activation-patching localization.

Authors: We agree that the isolation of affix effects would be strengthened by explicit controls for residual stem signals. While the fictitious stems were constructed as novel non-words and the attribution framework already compares affix vs. stem contributions (showing affix dominance), we did not report stem perplexity or n-gram audits in the original manuscript. In revision we will add these controls for a representative sample of the fictitious names and include a stem-only baseline ablation; results will be reported in an expanded §3 with a new limitations paragraph. revision: partial
Referee: [§4.2] §4.2 (Rarely explicitly indicate reliance): The finding that models rarely state affix reliance explicitly is derived from the same fictitious-name probes; if stem confounds are present, the 'rarely explicitly indicate' and 'incorrectly conflate properties' conclusions cannot be cleanly attributed to affix shortcuts versus broader morphological or lexical leakage.

Authors: We concur that the §4.2 conclusions inherit the same potential confounds. The planned addition of stem-perplexity and baseline controls in §3 will therefore be cross-referenced in §4.2, and we will qualify the 'rarely explicitly indicate' and conflation claims accordingly while preserving the core observation that explicit statements of affix reliance remain infrequent even under the controlled probes. revision: partial

Circularity Check

0 steps flagged

No circularity: empirical framework with no derivations or self-referential reductions

full rationale

The paper describes an empirical behavioral and mechanistic study of LLM affix heuristics using constructed fictitious drug names and a new attribution framework applied to 653 drugs, followed by activation patching. No equations, fitted parameters, predictions derived from inputs, or derivation chains are present in the provided text. The central claims rest on experimental results rather than any self-definitional, fitted-input, or self-citation load-bearing steps. The framework for distinguishing affix/stem/whole-name contributions is a methodological tool, not a reduction to its own outputs. This is a standard non-circular empirical analysis.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the unstated assumption that fictitious names isolate affix effects; no free parameters or invented entities are mentioned.

axioms (1)

domain assumption Responses to fictitious drug names built from real affixes reflect affix-based heuristics rather than other factors
Core premise of the behavioral study described in the abstract.

pith-pipeline@v0.9.1-grok · 5738 in / 1099 out tokens · 33116 ms · 2026-06-28T01:47:00.499935+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

49 extracted references · 19 canonical work pages

[1]

Vibhor Agarwal, Yiqiao Jin, Mohit Chandra, Munmun De Choudhury, Srijan Kumar, and Nishanth Sastry. 2024. https://doi.org/10.48550/arXiv.2409.19492 Medhalu: Hallucinations in responses to healthcare queries by large language models . arXiv preprint arXiv:2409.19492

work page doi:10.48550/arxiv.2409.19492 2024
[5]

Nicola Dawson, Kathleen Rastle, and Jessie Ricketts. 2021. https://doi.org/10.1111/1467-9817.12338 Bridging form and meaning: support from derivational suffixes in word learning . Journal of Research in Reading, 44(1):27--50

work page doi:10.1111/1467-9817.12338 2021
[6]

Mengnan Du, Fengxiang He, Na Zou, Dacheng Tao, and Xia Hu. 2023. https://doi.org/10.1145/3596490 Shortcut learning of large language models in natural language understanding . Communications of the ACM, 67(1):110--120

work page doi:10.1145/3596490 2023
[7]

Atticus Geiger, Zhengxuan Wu, Christopher Potts, Thomas Icard, and Noah Goodman. 2024. https://proceedings.mlr.press/v236/geiger24a.html Finding alignments between interpretable causal variables and distributed neural representations . In Causal Learning and Reasoning, pages 160--187. PMLR

2024
[8]

Robert Geirhos, J \"o rn-Henrik Jacobsen, Claudio Michaelis, Richard Zemel, Wieland Brendel, Matthias Bethge, and Felix A Wichmann. 2020. https://doi.org/10.1038/s42256-020-00257-z Shortcut learning in deep neural networks . Nature Machine Intelligence, 2(11):665--673

work page doi:10.1038/s42256-020-00257-z 2020
[9]

Aaron Grattafiori, Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-Dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Alex Vaughan, Amy Yang, Angela Fan, Anirudh Goyal, Anthony Hartshorn, Aobo Yang, Archi Mitra, Archie Sravankumar, Artem Korenev, Arthur Hinsvark, and 542 others. 2024. https://arxiv.org/abs/2407.21783 The llama 3...

Pith/arXiv arXiv 2024
[11]

Kelli Henry, Brian Murray, Xingmeng Zhao, Kaitlin Blotske, Yanjun Gao, Brooke Smith, Khoa Le, Susan E Smith, Erin F Barreto, Seth Bauer, and 1 others. 2026. https://doi.org/10.64898/2026.01.12.26343930 Drug or pok \'e mon? large language model performance in identification of fabricated medications . medRxiv, pages 2026--01

work page doi:10.64898/2026.01.12.26343930 2026
[12]

Valentin Hofmann, Leonie Weissweiler, David R Mortensen, Hinrich Sch \"u tze, and Janet B Pierrehumbert. 2025. https://doi.org/10.1073/pnas.2423232122 Derivational morphology reveals analogical generalization in large language models . Proceedings of the National Academy of Sciences, 122(19):e2423232122

work page doi:10.1073/pnas.2423232122 2025
[13]

Yubin Kim, Hyewon Jeong, Shan Chen, Shuyue Stella Li, Chanwoo Park, Mingyu Lu, Kumail Alhamoud, Jimin Mun, Cristina Grau, Minseok Jung, Rodrigo Gameiro, Lizhou Fan, Eugene Park, Tristan Lin, Joonsik Yoon, Wonjin Yoon, Maarten Sap, Yulia Tsvetkov, Paul Liang, and 8 others. 2025. https://doi.org/10.48550/arXiv.2503.05777 Medical hallucinations in foundation...

work page doi:10.48550/arxiv.2503.05777 2025
[15]

Wallace, and Junyi Jessy Li

Kaijie Mo, Siddhartha Venkatayogi, Chantal Shaib, Ramez Kouzy, Wei Xu, Byron C. Wallace, and Junyi Jessy Li. 2026. https://arxiv.org/abs/2601.11886 Faithfulness vs. safety: Evaluating llm behavior under counterfactual medical evidence . Preprint, arXiv:2601.11886

Pith/arXiv arXiv 2026
[16]

Mahmud Omar, Vera Sorin, Jeremy D Collins, David Reich, Robert Freeman, Nicholas Gavin, Alexander Charney, Lisa Stump, Nicola Luigi Bragazzi, Girish N Nadkarni, and 1 others. 2025. https://doi.org/10.1038/s43856-025-01021-3 Multi-model assurance analysis showing large language models are highly vulnerable to adversarial hallucination attacks during clinic...

work page doi:10.1038/s43856-025-01021-3 2025
[17]

Nikoleta Pantelidou, Evelina Leivada, Raquel Montero, and Paolo Morosi. 2026. https://doi.org/10.1371/journal.pone.0343164 Community size rather than grammatical complexity better predicts large language model accuracy in a novel wug test . PLOS ONE, 21(3):e0343164

work page doi:10.1371/journal.pone.0343164 2026
[18]

Walter Quattrociocchi, Valerio Capraro, and Matjaž Perc. 2025. https://arxiv.org/abs/2512.19466 Epistemological fault lines between human and artificial intelligence . Preprint, arXiv:2512.19466

arXiv 2025
[19]

Marta Serafini, Sarah Cargnin, Alberto Massarotti, Gian Cesare Tron, Tracey Pirali, and Armando A Genazzani. 2021. https://pubs.acs.org/doi/10.1021/acs.jmedchem.1c00181 What’s in a name? drug nomenclature and medicinal chemistry trends using inn publications . Journal of Medicinal Chemistry, 64(8):4410--4429

work page doi:10.1021/acs.jmedchem.1c00181 2021
[20]

Olga Solaja and Davide Crepaldi. 2024. https://doi.org/10.1098/rsos.230094 The role of morphology in novel word learning: a registered report . Royal Society Open Science, 11(6):230094

work page doi:10.1098/rsos.230094 2024
[21]

Jakke Tamminen, Matthew H Davis, and Kathleen Rastle. 2015. https://doi.org/10.1016/j.cogpsych.2015.03.003 From specific examples to general knowledge in language learning . Cognitive psychology, 79:1--39

work page doi:10.1016/j.cogpsych.2015.03.003 2015
[22]

Marion Weller-Di Marco and Alexander Fraser. 2024. https://aclanthology.org/2024.lrec-main.90/ Analyzing the understanding of morphologically complex words in large language models . In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 1009--1020, Torino, Italia...

2024
[23]

World Health Organization . 2017. Guidance on the use of international nonproprietary names (inns) for pharmaceutical substances. https://www.who.int/publications/m/item/guidance-on-the-use-of-inns. Accessed: 2026-05-04

2017
[24]

Zhengxuan Wu, Aryaman Arora, Zheng Wang, Atticus Geiger, Dan Jurafsky, Christopher D Manning, and Christopher Potts. 2024. https://doi.org/10.52202/079017-2041 Reft: Representation finetuning for language models . Advances in Neural Information Processing Systems, 37:63908--63962

work page doi:10.52202/079017-2041 2024
[25]

An Yang, Baosong Yang, Beichen Zhang, Binyuan Hui, Bo Zheng, Bowen Yu, Chengyuan Li, Dayiheng Liu, Fei Huang, Haoran Wei, Huan Lin, Jian Yang, Jianhong Tu, Jianwei Zhang, Jianxin Yang, Jiaxi Yang, Jingren Zhou, Junyang Lin, Kai Dang, and 23 others. 2025. https://arxiv.org/abs/2412.15115 Qwen2.5 technical report . Preprint, arXiv:2412.15115

Pith/arXiv arXiv 2025
[27]

Cognitive psychology , volume=

From specific examples to general knowledge in language learning , author=. Cognitive psychology , volume=. 2015 , publisher=

2015
[28]

Causal Learning and Reasoning , pages=

Finding alignments between interpretable causal variables and distributed neural representations , author=. Causal Learning and Reasoning , pages=. 2024 , organization=

2024
[29]

Proceedings of the National Academy of Sciences , volume=

Derivational morphology reveals analogical generalization in large language models , author=. Proceedings of the National Academy of Sciences , volume=. 2025 , publisher=

2025
[30]

and Koller, Alexander , title =

Bender, Emily M. and Koller, Alexander. Climbing towards NLU : On Meaning, Form, and Understanding in the Age of Data. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020. doi:10.18653/v1/2020.acl-main.463

work page doi:10.18653/v1/2020.acl-main.463 2020
[31]

Krakauer , title =

Melanie Mitchell and David C. Krakauer , title =. Proceedings of the National Academy of Sciences , volume =. 2023 , doi =. https://www.pnas.org/doi/pdf/10.1073/pnas.2215907120 , abstract =

work page doi:10.1073/pnas.2215907120 2023
[32]

Journal of Medicinal Chemistry , volume=

What’s in a name? Drug nomenclature and medicinal chemistry trends using INN publications , author=. Journal of Medicinal Chemistry , volume=. 2021 , url=

2021
[33]

Nature Machine Intelligence , volume=

Shortcut learning in deep neural networks , author=. Nature Machine Intelligence , volume=. 2020 , url=

2020
[34]

Communications of the ACM , volume=

Shortcut learning of large language models in natural language understanding , author=. Communications of the ACM , volume=. 2023 , publisher=

2023
[35]

Do LLM s Overcome Shortcut Learning? An Evaluation of Shortcut Challenges in Large Language Models

Yuan, Yu and Zhao, Lili and Zhang, Kai and Zheng, Guangting and Liu, Qi. Do LLM s Overcome Shortcut Learning? An Evaluation of Shortcut Challenges in Large Language Models. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. 2024. doi:10.18653/v1/2024.emnlp-main.679

work page doi:10.18653/v1/2024.emnlp-main.679 2024
[36]

2025 , eprint=

Epistemological Fault Lines Between Human and Artificial Intelligence , author=. 2025 , eprint=

2025
[37]

Analyzing the Understanding of Morphologically Complex Words in Large Language Models

Weller-Di Marco, Marion and Fraser, Alexander. Analyzing the Understanding of Morphologically Complex Words in Large Language Models. Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024). 2024

2024
[38]

177--188

Anh, Dang and Raviv, Limor and Galke, Lukas. Morphology Matters: Probing the Cross-linguistic Morphological Generalization Abilities of Large Language Models through a Wug Test. Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics. 2024. doi:10.18653/v1/2024.cmcl-1.15 , pages = "177--188", abstract = "We develop a multilingual v...

work page doi:10.18653/v1/2024.cmcl-1.15 2024
[39]

Advances in Neural Information Processing Systems , volume=

Reft: Representation finetuning for language models , author=. Advances in Neural Information Processing Systems , volume=. 2024 , doi=

2024
[40]

Guidance on the Use of International Nonproprietary Names (INNs) for Pharmaceutical Substances , year=
[41]

PLOS ONE , volume=

Community size rather than grammatical complexity better predicts Large Language Model accuracy in a novel Wug Test , author=. PLOS ONE , volume=. 2026 , doi=

2026
[42]

arXiv preprint arXiv:2503.05777 , year=

Medical Hallucinations in Foundation Models and Their Impact on Healthcare , author =. arXiv preprint arXiv:2503.05777 , year=. 2503.05777 , archivePrefix=

arXiv
[43]

arXiv preprint arXiv:2409.19492 , year=

MedHalu: Hallucinations in Responses to Healthcare Queries by Large Language Models , author =. arXiv preprint arXiv:2409.19492 , year=. 2409.19492 , archivePrefix=

arXiv
[44]

Drug or Pok

Henry, Kelli and Murray, Brian and Zhao, Xingmeng and Blotske, Kaitlin and Gao, Yanjun and Smith, Brooke and Le, Khoa and Smith, Susan E and Barreto, Erin F and Bauer, Seth and others , journal=. Drug or Pok. 2026 , publisher=

2026
[45]

2026 , journal=

Rethinking Medical LLM Hallucinations: A System-Level Survey , author =. 2026 , journal=

2026
[46]

and Rust, Paul and Pearcy, Pauline and Nasir, Khurram and Mossialos, Elias , title =

van Kessel, Robin and Anderson, Michael and McMillan, Brian and Matthews, Marc R. and Rust, Paul and Pearcy, Pauline and Nasir, Khurram and Mossialos, Elias , title =. BMJ Health Care Informatics , year =
[47]

and Pierrehumbert, Janet B

Needle, Jeremy M. and Pierrehumbert, Janet B. and Hay, Jennifer B. , title =. Morphological Diversity and Linguistic Cognition , chapter =. 2022 , month = may, doi =

2022
[48]

Safety: Evaluating LLM Behavior Under Counterfactual Medical Evidence , author =

Faithfulness vs. Safety: Evaluating LLM Behavior Under Counterfactual Medical Evidence , author =. 2026 , eprint=

2026
[49]

Communications Medicine , volume=

Multi-model assurance analysis showing large language models are highly vulnerable to adversarial hallucination attacks during clinical decision support , author=. Communications Medicine , volume=. 2025 , publisher=

2025
[50]

Royal Society Open Science , volume=

The role of morphology in novel word learning: a registered report , author=. Royal Society Open Science , volume=. 2024 , url=

2024
[51]

Journal of Research in Reading , volume=

Bridging form and meaning: support from derivational suffixes in word learning , author=. Journal of Research in Reading , volume=. 2021 , publisher=

2021
[52]

Proceedings of the 62nd

Groeneveld, Dirk and Beltagy, Iz and Walsh, Evan and Bhagia, Akshita and Kinney, Rodney and Tafjord, Oyvind and Jha, Ananya and Ivison, Hamish and Magnusson, Ian and Wang, Yizhong and Arora, Shane and Atkinson, David and Authur, Russell and Chandu, Khyathi and Cohan, Arman and Dumas, Jennifer and Elazar, Yanai and Gu, Yuling and Hessel, Jack and Khot, Tus...

work page doi:10.18653/v1/2024.acl-long.841 2024
[53]

2025 , eprint=

Qwen2.5 Technical Report , author=. 2025 , eprint=

2025
[54]

Towards Medical Complex Reasoning with LLM s through Medical Verifiable Problems

Chen, Junying and Cai, Zhenyang and Ji, Ke and Wang, Xidong and Liu, Wanlong and Wang, Rongsheng and Wang, Benyou. Towards Medical Complex Reasoning with LLM s through Medical Verifiable Problems. Findings of the Association for Computational Linguistics: ACL 2025. 2025. doi:10.18653/v1/2025.findings-acl.751

work page doi:10.18653/v1/2025.findings-acl.751 2025
[55]

2024 , eprint=

The Llama 3 Herd of Models , author=. 2024 , eprint=

2024

[1] [1]

Vibhor Agarwal, Yiqiao Jin, Mohit Chandra, Munmun De Choudhury, Srijan Kumar, and Nishanth Sastry. 2024. https://doi.org/10.48550/arXiv.2409.19492 Medhalu: Hallucinations in responses to healthcare queries by large language models . arXiv preprint arXiv:2409.19492

work page doi:10.48550/arxiv.2409.19492 2024

[2] [5]

Nicola Dawson, Kathleen Rastle, and Jessie Ricketts. 2021. https://doi.org/10.1111/1467-9817.12338 Bridging form and meaning: support from derivational suffixes in word learning . Journal of Research in Reading, 44(1):27--50

work page doi:10.1111/1467-9817.12338 2021

[3] [6]

Mengnan Du, Fengxiang He, Na Zou, Dacheng Tao, and Xia Hu. 2023. https://doi.org/10.1145/3596490 Shortcut learning of large language models in natural language understanding . Communications of the ACM, 67(1):110--120

work page doi:10.1145/3596490 2023

[4] [7]

Atticus Geiger, Zhengxuan Wu, Christopher Potts, Thomas Icard, and Noah Goodman. 2024. https://proceedings.mlr.press/v236/geiger24a.html Finding alignments between interpretable causal variables and distributed neural representations . In Causal Learning and Reasoning, pages 160--187. PMLR

2024

[5] [8]

Robert Geirhos, J \"o rn-Henrik Jacobsen, Claudio Michaelis, Richard Zemel, Wieland Brendel, Matthias Bethge, and Felix A Wichmann. 2020. https://doi.org/10.1038/s42256-020-00257-z Shortcut learning in deep neural networks . Nature Machine Intelligence, 2(11):665--673

work page doi:10.1038/s42256-020-00257-z 2020

[6] [9]

Aaron Grattafiori, Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-Dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Alex Vaughan, Amy Yang, Angela Fan, Anirudh Goyal, Anthony Hartshorn, Aobo Yang, Archi Mitra, Archie Sravankumar, Artem Korenev, Arthur Hinsvark, and 542 others. 2024. https://arxiv.org/abs/2407.21783 The llama 3...

Pith/arXiv arXiv 2024

[7] [11]

Kelli Henry, Brian Murray, Xingmeng Zhao, Kaitlin Blotske, Yanjun Gao, Brooke Smith, Khoa Le, Susan E Smith, Erin F Barreto, Seth Bauer, and 1 others. 2026. https://doi.org/10.64898/2026.01.12.26343930 Drug or pok \'e mon? large language model performance in identification of fabricated medications . medRxiv, pages 2026--01

work page doi:10.64898/2026.01.12.26343930 2026

[8] [12]

Valentin Hofmann, Leonie Weissweiler, David R Mortensen, Hinrich Sch \"u tze, and Janet B Pierrehumbert. 2025. https://doi.org/10.1073/pnas.2423232122 Derivational morphology reveals analogical generalization in large language models . Proceedings of the National Academy of Sciences, 122(19):e2423232122

work page doi:10.1073/pnas.2423232122 2025

[9] [13]

Yubin Kim, Hyewon Jeong, Shan Chen, Shuyue Stella Li, Chanwoo Park, Mingyu Lu, Kumail Alhamoud, Jimin Mun, Cristina Grau, Minseok Jung, Rodrigo Gameiro, Lizhou Fan, Eugene Park, Tristan Lin, Joonsik Yoon, Wonjin Yoon, Maarten Sap, Yulia Tsvetkov, Paul Liang, and 8 others. 2025. https://doi.org/10.48550/arXiv.2503.05777 Medical hallucinations in foundation...

work page doi:10.48550/arxiv.2503.05777 2025

[10] [15]

Wallace, and Junyi Jessy Li

Kaijie Mo, Siddhartha Venkatayogi, Chantal Shaib, Ramez Kouzy, Wei Xu, Byron C. Wallace, and Junyi Jessy Li. 2026. https://arxiv.org/abs/2601.11886 Faithfulness vs. safety: Evaluating llm behavior under counterfactual medical evidence . Preprint, arXiv:2601.11886

Pith/arXiv arXiv 2026

[11] [16]

Mahmud Omar, Vera Sorin, Jeremy D Collins, David Reich, Robert Freeman, Nicholas Gavin, Alexander Charney, Lisa Stump, Nicola Luigi Bragazzi, Girish N Nadkarni, and 1 others. 2025. https://doi.org/10.1038/s43856-025-01021-3 Multi-model assurance analysis showing large language models are highly vulnerable to adversarial hallucination attacks during clinic...

work page doi:10.1038/s43856-025-01021-3 2025

[12] [17]

Nikoleta Pantelidou, Evelina Leivada, Raquel Montero, and Paolo Morosi. 2026. https://doi.org/10.1371/journal.pone.0343164 Community size rather than grammatical complexity better predicts large language model accuracy in a novel wug test . PLOS ONE, 21(3):e0343164

work page doi:10.1371/journal.pone.0343164 2026

[13] [18]

Walter Quattrociocchi, Valerio Capraro, and Matjaž Perc. 2025. https://arxiv.org/abs/2512.19466 Epistemological fault lines between human and artificial intelligence . Preprint, arXiv:2512.19466

arXiv 2025

[14] [19]

Marta Serafini, Sarah Cargnin, Alberto Massarotti, Gian Cesare Tron, Tracey Pirali, and Armando A Genazzani. 2021. https://pubs.acs.org/doi/10.1021/acs.jmedchem.1c00181 What’s in a name? drug nomenclature and medicinal chemistry trends using inn publications . Journal of Medicinal Chemistry, 64(8):4410--4429

work page doi:10.1021/acs.jmedchem.1c00181 2021

[15] [20]

Olga Solaja and Davide Crepaldi. 2024. https://doi.org/10.1098/rsos.230094 The role of morphology in novel word learning: a registered report . Royal Society Open Science, 11(6):230094

work page doi:10.1098/rsos.230094 2024

[16] [21]

Jakke Tamminen, Matthew H Davis, and Kathleen Rastle. 2015. https://doi.org/10.1016/j.cogpsych.2015.03.003 From specific examples to general knowledge in language learning . Cognitive psychology, 79:1--39

work page doi:10.1016/j.cogpsych.2015.03.003 2015

[17] [22]

Marion Weller-Di Marco and Alexander Fraser. 2024. https://aclanthology.org/2024.lrec-main.90/ Analyzing the understanding of morphologically complex words in large language models . In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 1009--1020, Torino, Italia...

2024

[18] [23]

World Health Organization . 2017. Guidance on the use of international nonproprietary names (inns) for pharmaceutical substances. https://www.who.int/publications/m/item/guidance-on-the-use-of-inns. Accessed: 2026-05-04

2017

[19] [24]

Zhengxuan Wu, Aryaman Arora, Zheng Wang, Atticus Geiger, Dan Jurafsky, Christopher D Manning, and Christopher Potts. 2024. https://doi.org/10.52202/079017-2041 Reft: Representation finetuning for language models . Advances in Neural Information Processing Systems, 37:63908--63962

work page doi:10.52202/079017-2041 2024

[20] [25]

An Yang, Baosong Yang, Beichen Zhang, Binyuan Hui, Bo Zheng, Bowen Yu, Chengyuan Li, Dayiheng Liu, Fei Huang, Haoran Wei, Huan Lin, Jian Yang, Jianhong Tu, Jianwei Zhang, Jianxin Yang, Jiaxi Yang, Jingren Zhou, Junyang Lin, Kai Dang, and 23 others. 2025. https://arxiv.org/abs/2412.15115 Qwen2.5 technical report . Preprint, arXiv:2412.15115

Pith/arXiv arXiv 2025

[21] [27]

Cognitive psychology , volume=

From specific examples to general knowledge in language learning , author=. Cognitive psychology , volume=. 2015 , publisher=

2015

[22] [28]

Causal Learning and Reasoning , pages=

Finding alignments between interpretable causal variables and distributed neural representations , author=. Causal Learning and Reasoning , pages=. 2024 , organization=

2024

[23] [29]

Proceedings of the National Academy of Sciences , volume=

Derivational morphology reveals analogical generalization in large language models , author=. Proceedings of the National Academy of Sciences , volume=. 2025 , publisher=

2025

[24] [30]

and Koller, Alexander , title =

Bender, Emily M. and Koller, Alexander. Climbing towards NLU : On Meaning, Form, and Understanding in the Age of Data. Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020. doi:10.18653/v1/2020.acl-main.463

work page doi:10.18653/v1/2020.acl-main.463 2020

[25] [31]

Krakauer , title =

Melanie Mitchell and David C. Krakauer , title =. Proceedings of the National Academy of Sciences , volume =. 2023 , doi =. https://www.pnas.org/doi/pdf/10.1073/pnas.2215907120 , abstract =

work page doi:10.1073/pnas.2215907120 2023

[26] [32]

Journal of Medicinal Chemistry , volume=

What’s in a name? Drug nomenclature and medicinal chemistry trends using INN publications , author=. Journal of Medicinal Chemistry , volume=. 2021 , url=

2021

[27] [33]

Nature Machine Intelligence , volume=

Shortcut learning in deep neural networks , author=. Nature Machine Intelligence , volume=. 2020 , url=

2020

[28] [34]

Communications of the ACM , volume=

Shortcut learning of large language models in natural language understanding , author=. Communications of the ACM , volume=. 2023 , publisher=

2023

[29] [35]

Do LLM s Overcome Shortcut Learning? An Evaluation of Shortcut Challenges in Large Language Models

Yuan, Yu and Zhao, Lili and Zhang, Kai and Zheng, Guangting and Liu, Qi. Do LLM s Overcome Shortcut Learning? An Evaluation of Shortcut Challenges in Large Language Models. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. 2024. doi:10.18653/v1/2024.emnlp-main.679

work page doi:10.18653/v1/2024.emnlp-main.679 2024

[30] [36]

2025 , eprint=

Epistemological Fault Lines Between Human and Artificial Intelligence , author=. 2025 , eprint=

2025

[31] [37]

Analyzing the Understanding of Morphologically Complex Words in Large Language Models

Weller-Di Marco, Marion and Fraser, Alexander. Analyzing the Understanding of Morphologically Complex Words in Large Language Models. Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024). 2024

2024

[32] [38]

177--188

Anh, Dang and Raviv, Limor and Galke, Lukas. Morphology Matters: Probing the Cross-linguistic Morphological Generalization Abilities of Large Language Models through a Wug Test. Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics. 2024. doi:10.18653/v1/2024.cmcl-1.15 , pages = "177--188", abstract = "We develop a multilingual v...

work page doi:10.18653/v1/2024.cmcl-1.15 2024

[33] [39]

Advances in Neural Information Processing Systems , volume=

Reft: Representation finetuning for language models , author=. Advances in Neural Information Processing Systems , volume=. 2024 , doi=

2024

[34] [40]

Guidance on the Use of International Nonproprietary Names (INNs) for Pharmaceutical Substances , year=

[35] [41]

PLOS ONE , volume=

Community size rather than grammatical complexity better predicts Large Language Model accuracy in a novel Wug Test , author=. PLOS ONE , volume=. 2026 , doi=

2026

[36] [42]

arXiv preprint arXiv:2503.05777 , year=

Medical Hallucinations in Foundation Models and Their Impact on Healthcare , author =. arXiv preprint arXiv:2503.05777 , year=. 2503.05777 , archivePrefix=

arXiv

[37] [43]

arXiv preprint arXiv:2409.19492 , year=

MedHalu: Hallucinations in Responses to Healthcare Queries by Large Language Models , author =. arXiv preprint arXiv:2409.19492 , year=. 2409.19492 , archivePrefix=

arXiv

[38] [44]

Drug or Pok

Henry, Kelli and Murray, Brian and Zhao, Xingmeng and Blotske, Kaitlin and Gao, Yanjun and Smith, Brooke and Le, Khoa and Smith, Susan E and Barreto, Erin F and Bauer, Seth and others , journal=. Drug or Pok. 2026 , publisher=

2026

[39] [45]

2026 , journal=

Rethinking Medical LLM Hallucinations: A System-Level Survey , author =. 2026 , journal=

2026

[40] [46]

and Rust, Paul and Pearcy, Pauline and Nasir, Khurram and Mossialos, Elias , title =

van Kessel, Robin and Anderson, Michael and McMillan, Brian and Matthews, Marc R. and Rust, Paul and Pearcy, Pauline and Nasir, Khurram and Mossialos, Elias , title =. BMJ Health Care Informatics , year =

[41] [47]

and Pierrehumbert, Janet B

Needle, Jeremy M. and Pierrehumbert, Janet B. and Hay, Jennifer B. , title =. Morphological Diversity and Linguistic Cognition , chapter =. 2022 , month = may, doi =

2022

[42] [48]

Safety: Evaluating LLM Behavior Under Counterfactual Medical Evidence , author =

Faithfulness vs. Safety: Evaluating LLM Behavior Under Counterfactual Medical Evidence , author =. 2026 , eprint=

2026

[43] [49]

Communications Medicine , volume=

Multi-model assurance analysis showing large language models are highly vulnerable to adversarial hallucination attacks during clinical decision support , author=. Communications Medicine , volume=. 2025 , publisher=

2025

[44] [50]

Royal Society Open Science , volume=

The role of morphology in novel word learning: a registered report , author=. Royal Society Open Science , volume=. 2024 , url=

2024

[45] [51]

Journal of Research in Reading , volume=

Bridging form and meaning: support from derivational suffixes in word learning , author=. Journal of Research in Reading , volume=. 2021 , publisher=

2021

[46] [52]

Proceedings of the 62nd

Groeneveld, Dirk and Beltagy, Iz and Walsh, Evan and Bhagia, Akshita and Kinney, Rodney and Tafjord, Oyvind and Jha, Ananya and Ivison, Hamish and Magnusson, Ian and Wang, Yizhong and Arora, Shane and Atkinson, David and Authur, Russell and Chandu, Khyathi and Cohan, Arman and Dumas, Jennifer and Elazar, Yanai and Gu, Yuling and Hessel, Jack and Khot, Tus...

work page doi:10.18653/v1/2024.acl-long.841 2024

[47] [53]

2025 , eprint=

Qwen2.5 Technical Report , author=. 2025 , eprint=

2025

[48] [54]

Towards Medical Complex Reasoning with LLM s through Medical Verifiable Problems

Chen, Junying and Cai, Zhenyang and Ji, Ke and Wang, Xidong and Liu, Wanlong and Wang, Rongsheng and Wang, Benyou. Towards Medical Complex Reasoning with LLM s through Medical Verifiable Problems. Findings of the Association for Computational Linguistics: ACL 2025. 2025. doi:10.18653/v1/2025.findings-acl.751

work page doi:10.18653/v1/2025.findings-acl.751 2025

[49] [55]

2024 , eprint=

The Llama 3 Herd of Models , author=. 2024 , eprint=

2024