pith. sign in

arxiv: 2606.00570 · v1 · pith:SBX7WVRBnew · submitted 2026-05-30 · 💻 cs.CL · cs.AI

Revisiting Parameter-Based Knowledge Editing in Large Language Models: Theoretical Limits and Empirical Evidence

Pith reviewed 2026-06-28 19:01 UTC · model grok-4.3

classification 💻 cs.CL cs.AI
keywords knowledge editinglarge language modelsparameter editingretrieval augmentationreasoning collapsedimensional collapsemodel capabilities
0
0 comments X

The pith

Localized parameter edits in LLMs propagate to cause global reasoning collapse, while a simple retrieval baseline consistently outperforms them.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that parameter-based knowledge editing, which modifies internal weights locally, has fundamental theoretical limits that lead to interference across the model. It introduces the dimensional collapse hypothesis to show how these edits travel along fragile directions in representation space and produce widespread capability damage. Empirical tests across varying edit counts, knowledge complexity, and evaluation settings confirm that all tested editing methods impair core LLM functions such as reasoning and coherence. In direct comparison, an unchanged retrieval-based approach maintains stronger performance under every condition examined. The central takeaway is that future editing work must treat preservation of base capabilities as a primary requirement rather than an afterthought.

Core claim

Localized parameter edits propagate along fragile directions in representation space, inducing global interference that collapses reasoning performance; this effect is observed consistently across methods and settings, whereas retrieval-based access avoids the damage entirely.

What carries the argument

Dimensional Collapse Hypothesis, which models how localized weight changes spread along low-dimensional fragile directions to produce global capability loss.

If this is right

  • Any method that changes weights directly will face the same interference risk once edit count or complexity rises.
  • Retrieval mechanisms that leave parameters untouched can serve as a practical upper bound for capability preservation.
  • Evaluation protocols must include broad capability checks beyond the edited facts themselves.
  • Research priority should shift toward hybrid or non-parametric approaches that avoid representation-space collapse.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • If the collapse mechanism is general, then scaling model size alone will not remove the interference problem without changes to editing strategy.
  • Retrieval baselines could be extended with lightweight adapters that never touch the original weights.
  • The same fragility directions may explain why fine-tuning on narrow tasks sometimes erodes broader abilities.

Load-bearing premise

The dimensional collapse hypothesis accurately describes the mechanism by which local edits create global interference.

What would settle it

An experiment in which multiple parameter edits are applied and the model shows no measurable drop in reasoning accuracy or coherence on held-out tasks while still incorporating the new facts.

Figures

Figures reproduced from arXiv: 2606.00570 by Aixin Sun, Futing Wang, Guoxiu He, Wanying Ren, Xin Song.

Figure 1
Figure 1. Figure 1: Training and inference workflows for four types of knowledge editing methods. The training process is shown above the green line, while the inference stage is shown below. (a) and (b) perform knowledge editing by modifying the LLM parameters. However, as world knowledge continuously evolves, some information encoded in LLMs inevitably becomes outdated or inaccurate (Mousavi et al., 2024; Ji et al., 2023). … view at source ↗
Figure 2
Figure 2. Figure 2: Distribution of Rk values in layer 30 of Llama-3.1-8B￾Instruct after performing 1000 sequential edits using MEMIT. 4.3. Cumulative Effects in Sequential Editing We now extend the analysis to sequential editing, where the model undergoes a series of updates: W(0) → W(1) → · · · → W(T) . Let h (t) denote the hidden representation after the t-th edit. The total representation change after T edits relative to … view at source ↗
Figure 3
Figure 3. Figure 3: Performance changes of knowledge editing methods during sequential editing of Llama-3.1-8B-Instruct on the ZsRE dataset. The x-axis represents the number of edits: 1, 10, 100, and ‘All’ for the full dataset. decoding with semantic consistency judged by Qwen2.5- 72B-Instruct across four dimensions (Zhang et al., 2024b). A token-level locality check confirms the semantic results (Appendix C.5). Full implemen… view at source ↗
Figure 4
Figure 4. Figure 4: Performance comparison between AlphaEdit and SCR on sequential editing tasks using the ELKEN dataset. RQ4: How do different knowledge editing methods com￾pare in terms of time efficiency? Beyond correctness and robustness, latency and efficiency are equally crucial for practical deployment. Effective edit￾ing methods should minimize both editing overhead and inference latency. To systematically evaluate th… view at source ↗
Figure 6
Figure 6. Figure 6: Directional relative change rate Rk across editing methods, model architectures, and steps. 30 [PITH_FULL_IMAGE:figures/full_fig_p030_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Performance changes of knowledge editing methods during sequential editing of Llama-2-7B-Chat on the ZsRE dataset. The x-axis represents the number of edits: 1, 10, 100, and the full dataset. 38 [PITH_FULL_IMAGE:figures/full_fig_p038_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Performance changes of knowledge editing methods during sequential editing of Mistral-7B-Instruct-v0.1 on the ZsRE dataset. The x-axis represents the number of edits: 1, 10, 100, and the full dataset. 39 [PITH_FULL_IMAGE:figures/full_fig_p039_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Editing time (in second), and inference latency relative to the base model. 41 [PITH_FULL_IMAGE:figures/full_fig_p041_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: SCR efficiency and performance metrics for sequential editing. Visual representation of the SCR’s inference time and editing stability across various memory sizes (100 and 1000) and retrieval windows (top-1, top-3, top-5, and top-10). 51 [PITH_FULL_IMAGE:figures/full_fig_p051_10.png] view at source ↗
read the original abstract

Parameter-based knowledge editing updates the internal knowledge of large language models (LLMs) via localized weight modifications and has attracted significant attention. However, most existing methods overlook fundamental theoretical limitations and are rarely evaluated under realistic, practice-oriented settings. In this paper, we first present a theoretical analysis based on the dimensional Collapse Hypothesis, explaining how localized parameter edits can propagate along fragile directions in the representation space, inducing global interference and ultimately causing reasoning collapse. Building on this insight, we conduct a comprehensive empirical evaluation by systematically varying knowledge complexity, number of edits, evaluation dimensions, and baseline methods. Our results show that parameter-based editing methods consistently damage core LLM capabilities. In contrast, a simple retrieval-based baseline achieves consistently stronger performance than all parameter-editing methods across all evaluated conditions. These findings highlight that preserving the fundamental capabilities of LLMs after knowledge editing should be a central concern for future research.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The paper claims that parameter-based knowledge editing in LLMs is subject to fundamental theoretical limits, which it explains via the Dimensional Collapse Hypothesis: localized weight updates propagate along fragile directions in representation space, causing global interference and reasoning collapse. It supports this with a comprehensive empirical evaluation that varies knowledge complexity, edit count, evaluation dimensions, and baselines, finding that all parameter-editing methods degrade core LLM capabilities while a simple retrieval-based baseline consistently outperforms them.

Significance. If the empirical findings hold under scrutiny, the work would usefully shift attention in the knowledge-editing literature toward capability preservation and retrieval alternatives. The systematic sweep across multiple axes is a clear strength and provides falsifiable, practice-oriented evidence. However, the explanatory mechanism (Dimensional Collapse Hypothesis) is presented without direct validation, reducing the paper's ability to move beyond post-hoc interpretation.

major comments (2)
  1. [theoretical analysis] Theoretical analysis section (as described in the abstract): the Dimensional Collapse Hypothesis is invoked to explain how localized edits induce global interference, yet the manuscript reports no direct measurements (effective rank, singular-value spectra, or directional fragility metrics) of representation space before versus after edits. Without such tests the hypothesis remains an unverified interpretive lens rather than a validated causal account, leaving open alternative explanations such as generic optimization instability.
  2. [empirical evaluation] Empirical evaluation (abstract and results): while the central claim that parameter edits damage capabilities rests on the sweep, the abstract provides no quantitative results, error bars, dataset sizes, or model scales. If these details are absent or insufficiently reported in the full text, the strength of the cross-condition superiority claim cannot be assessed.
minor comments (1)
  1. Clarify whether the retrieval baseline is purely non-parametric or involves any learned components, as this affects the interpretation of 'parameter-based' versus 'retrieval-based' distinctions.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback. We address each major comment below and indicate planned revisions.

read point-by-point responses
  1. Referee: [theoretical analysis] Theoretical analysis section (as described in the abstract): the Dimensional Collapse Hypothesis is invoked to explain how localized edits induce global interference, yet the manuscript reports no direct measurements (effective rank, singular-value spectra, or directional fragility metrics) of representation space before versus after edits. Without such tests the hypothesis remains an unverified interpretive lens rather than a validated causal account, leaving open alternative explanations such as generic optimization instability.

    Authors: The Dimensional Collapse Hypothesis is derived from a theoretical analysis of how localized weight updates interact with the geometry of representation spaces. The manuscript supports it via the observed pattern of global interference across diverse edit conditions. We agree that direct measurements would strengthen the causal claim over post-hoc interpretation. In revision we will add experiments reporting effective rank, singular-value spectra, and directional fragility metrics before versus after edits. revision: yes

  2. Referee: [empirical evaluation] Empirical evaluation (abstract and results): while the central claim that parameter edits damage capabilities rests on the sweep, the abstract provides no quantitative results, error bars, dataset sizes, or model scales. If these details are absent or insufficiently reported in the full text, the strength of the cross-condition superiority claim cannot be assessed.

    Authors: The full manuscript reports all quantitative results (performance deltas with standard deviations), dataset sizes, model scales, and evaluation protocols in the results section and appendices. To make the strength of the claims immediately visible, we will revise the abstract to include representative quantitative outcomes and error-bar information. revision: partial

Circularity Check

0 steps flagged

No significant circularity; empirical claims rest on new measurements independent of the invoked hypothesis

full rationale

The paper's central results are empirical comparisons of parameter-editing methods versus a retrieval baseline across varied conditions of knowledge complexity, edit count, and evaluation dimensions. These outcomes are generated from direct experiments and do not reduce to any fitted parameter, self-defined quantity, or self-citation chain. The Dimensional Collapse Hypothesis is cited only as the basis for a theoretical explanation of observed interference; it is not used to derive or force any reported performance numbers, nor does the paper present predictions that are then verified by construction from the same hypothesis. No self-definitional, fitted-input, or uniqueness-imported steps appear in the provided derivation chain. The hypothesis functions as post-hoc interpretive framing rather than a load-bearing premise that collapses the empirical findings back into the inputs.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the dimensional collapse hypothesis as an explanatory mechanism and on the assumption that the chosen evaluation dimensions and baselines adequately represent practice-oriented settings; no free parameters or invented entities are introduced in the abstract.

axioms (1)
  • domain assumption Dimensional Collapse Hypothesis
    Invoked to explain propagation of localized edits into global interference and reasoning collapse.

pith-pipeline@v0.9.1-grok · 5690 in / 1133 out tokens · 23974 ms · 2026-06-28T19:01:10.450637+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

125 extracted references · 40 canonical work pages · 12 internal anchors

  1. [1]

    A review on language models as knowledge bases

    AlKhamissi, B., Li, M., Celikyilmaz, A., Diab, M., and Ghazvininejad, M. A review on language models as knowledge bases. arXiv preprint arXiv:2204.06031, 2022

  2. [2]

    and Li, Y

    Allen-Zhu, Z. and Li, Y. Physics of language models: Part 3.3, knowledge capacity scaling laws. arXiv preprint arXiv:2404.05405, 2024

  3. [3]

    Self-rag: Learning to retrieve, generate, and critique through self-reflection

    Asai, A., Wu, Z., Wang, Y., Sil, A., and Hajishirzi, H. Self-rag: Learning to retrieve, generate, and critique through self-reflection. In The Twelfth International Conference on Learning Representations

  4. [4]

    D., Dhariwal, P., Neelakantan, A., Shyam, P., Sastry, G., Askell, A., et al

    Brown, T., Mann, B., Ryder, N., Subbiah, M., Kaplan, J. D., Dhariwal, P., Neelakantan, A., Shyam, P., Sastry, G., Askell, A., et al. Language models are few-shot learners. Advances in neural information processing systems, 33: 0 1877--1901, 2020

  5. [5]

    Lifelong knowledge editing for llms with retrieval-augmented continuous prompt learning

    Chen, Q., Zhang, T., He, X., Li, D., Wang, C., Huang, L., et al. Lifelong knowledge editing for llms with retrieval-augmented continuous prompt learning. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pp.\ 13565--13580, 2024

  6. [6]

    Uniedit: A unified knowledge editing benchmark for large language models

    Chen, Q., Wang, D., Zhang, T., Yan, Z., You, C., Wang, C., and He, X. Uniedit: A unified knowledge editing benchmark for large language models. In The Thirty-ninth Annual Conference on Neural Information Processing Systems Datasets and Benchmarks Track, 2025. URL https://openreview.net/forum?id=eESbQ5lWiP

  7. [7]

    Towards understanding the mixture-of-experts layer in deep learning

    Chen, Z., Deng, Y., Wu, Y., Gu, Q., and Li, Y. Towards understanding the mixture-of-experts layer in deep learning. In Proceedings of the 36th International Conference on Neural Information Processing Systems, pp.\ 23049--23062, 2022

  8. [8]

    Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge

    Clark, P., Cowhey, I., Etzioni, O., Khot, T., Sabharwal, A., Schoenick, C., and Tafjord, O. Think you have solved question answering? try arc, the ai2 reasoning challenge. arXiv:1803.05457v1, 2018

  9. [9]

    Evaluating the ripple effects of knowledge editing in language models

    Cohen, R., Biran, E., Yoran, O., Globerson, A., and Geva, M. Evaluating the ripple effects of knowledge editing in language models. Transactions of the Association for Computational Linguistics, 12: 0 283--298, 2024

  10. [10]

    Knowledge neurons in pretrained transformers

    Dai, D., Dong, L., Hao, Y., Sui, Z., Chang, B., and Wei, F. Knowledge neurons in pretrained transformers. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.\ 8493--8502, 2022

  11. [11]

    Eamet: Robust massive model editing via embedding alignment optimization

    Dai, Y., Ji, Z., Li, Z., and Wang, S. Eamet: Robust massive model editing via embedding alignment optimization. arXiv preprint arXiv:2505.11876, 2025

  12. [12]

    Larimar: Large language models with episodic memory control

    Das, P., Chaudhury, S., Nelson, E., Melnyk, I., Swaminathan, S., Dai, S., Lozano, A., Kollias, G., Chenthamarakshan, V., Navratil, J., et al. Larimar: Large language models with episodic memory control. In Forty-first International Conference on Machine Learning

  13. [13]

    Editing factual knowledge in language models

    De Cao, N., Aziz, W., and Titov, I. Editing factual knowledge in language models. In EMNLP 2021-2021 Conference on Empirical Methods in Natural Language Processing, Proceedings, pp.\ 6491--6506, 2021

  14. [14]

    Parameter-efficient fine-tuning of large-scale pre-trained language models

    Ding, N., Qin, Y., Yang, G., Wei, F., Yang, Z., Su, Y., Hu, S., Chen, Y., Chan, C.-M., Chen, W., et al. Parameter-efficient fine-tuning of large-scale pre-trained language models. Nature Machine Intelligence, 5 0 (3): 0 220--235, 2023

  15. [15]

    Strong model collapse

    Dohmatob, E., Feng, Y., and Kempe, J. Strong model collapse. arXiv preprint arXiv:2410.04840, 2024

  16. [16]

    Calibrating factual knowledge in pretrained language models

    Dong, Q., Dai, D., Song, Y., Xu, J., Sui, Z., and Li, L. Calibrating factual knowledge in pretrained language models. In Findings of the Association for Computational Linguistics: EMNLP 2022, pp.\ 5937--5947, 2022

  17. [17]

    A survey on in-context learning

    Dong, Q., Li, L., Dai, D., Zheng, C., Ma, J., Li, R., Xia, H., Xu, J., Wu, Z., Chang, B., et al. A survey on in-context learning. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pp.\ 1107--1128, 2024

  18. [18]

    Alphaedit: Null-space constrained model editing for language models

    Fang, J., Jiang, H., Wang, K., Ma, Y., Shi, J., Wang, X., He, X., and Chua, T.-S. Alphaedit: Null-space constrained model editing for language models. In The Thirteenth International Conference on Learning Representations, 2025 a . URL https://openreview.net/forum?id=HvSytvg3Jh

  19. [19]

    Cknowedit: A new chinese knowledge editing dataset for linguistics, facts, and logic error correction in llms

    Fang, J., Lu, T., Yao, Y., Jiang, Z., Xu, X., Chen, H., and Zhang, N. Cknowedit: A new chinese knowledge editing dataset for linguistics, facts, and logic error correction in llms. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.\ 8789--8807, 2025 b

  20. [20]

    Dissecting recall of factual associations in auto-regressive language models

    Geva, M., Bastings, J., Filippova, K., and Globerson, A. Dissecting recall of factual associations in auto-regressive language models. In The 2023 Conference on Empirical Methods in Natural Language Processing

  21. [21]

    Transformer feed-forward layers are key-value memories

    Geva, M., Schuster, R., Berant, J., and Levy, O. Transformer feed-forward layers are key-value memories. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pp.\ 5484--5495, 2021

  22. [22]

    The Llama 3 Herd of Models

    Grattafiori, A., Dubey, A., Jauhri, A., Pandey, A., Kadian, A., Al-Dahle, A., Letman, A., Mathur, A., Schelten, A., Vaughan, A., et al. The llama 3 herd of models. arXiv preprint arXiv:2407.21783, 2024

  23. [23]

    Model editing harms general abilities of large language models: Regularization to the rescue

    Gu, J.-C., Xu, H.-X., Ma, J.-Y., Lu, P., Ling, Z.-H., Chang, K.-W., and Peng, N. Model editing harms general abilities of large language models: Regularization to the rescue. In Al-Onaizan, Y., Bansal, M., and Chen, Y.-N. (eds.), Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pp.\ 16801--16819, Miami, Florida, USA,...

  24. [24]

    Model editing harms general abilities of large language models: Regularization to the rescue

    Gu, J.-C., Xu, H.-X., Ma, J.-Y., Lu, P., Ling, Z.-H., Chang, K.-W., and Peng, N. Model editing harms general abilities of large language models: Regularization to the rescue. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pp.\ 16801--16819. Association for Computational Linguistics, 2024 b

  25. [25]

    DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

    Guo, D., Yang, D., Zhang, H., Song, J., Zhang, R., Xu, R., Zhu, Q., Ma, S., Wang, P., Bi, X., et al. Deepseek-r1: Incentivizing reasoning capability in llms via reinforcement learning. arXiv preprint arXiv:2501.12948, 2025

  26. [26]

    Model editing at scale leads to gradual and catastrophic forgetting

    Gupta, A., Rao, A., and Anumanchipalli, G. Model editing at scale leads to gradual and catastrophic forgetting. In Ku, L.-W., Martins, A., and Srikumar, V. (eds.), Findings of the Association for Computational Linguistics: ACL 2024, pp.\ 15202--15232, Bangkok, Thailand, August 2024 a . Association for Computational Linguistics. doi:10.18653/v1/2024.findin...

  27. [27]

    Model editing at scale leads to gradual and catastrophic forgetting

    Gupta, A., Rao, A., and Anumanchipalli, G. Model editing at scale leads to gradual and catastrophic forgetting. In Findings of the Association for Computational Linguistics ACL 2024, pp.\ 15202--15232, 2024 b

  28. [28]

    flex tape can’t fix that

    Halevy, K., Sotnikova, A., Alkhamissi, B., Montariol, S., and Bosselut, A. “flex tape can’t fix that”: Bias and misinformation in edited language models. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pp.\ 8690--8707, 2024

  29. [29]

    Aging with grace: Lifelong model editing with discrete key-value adaptors

    Hartvigsen, T., Sankaranarayanan, S., Palangi, H., Kim, Y., and Ghassemi, M. Aging with grace: Lifelong model editing with discrete key-value adaptors. Advances in Neural Information Processing Systems, 36, 2024

  30. [30]

    Olympiadbench: A challenging benchmark for promoting agi with olympiad-level bilingual multimodal scientific problems

    He, C., Luo, R., Bai, Y., Hu, S., Thai, Z., Shen, J., Hu, J., Han, X., Huang, Y., Zhang, Y., et al. Olympiadbench: A challenging benchmark for promoting agi with olympiad-level bilingual multimodal scientific problems. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.\ 3828--3850, 2024

  31. [31]

    Knowledge updating? no more model editing! just selective contextual reasoning

    He, G., Song, X., and Sun, A. Knowledge updating? no more model editing! just selective contextual reasoning. arXiv preprint arXiv:2503.05212, 2025

  32. [32]

    Measuring Mathematical Problem Solving With the MATH Dataset

    Hendrycks, D., Burns, C., Kadavath, S., Arora, A., Basart, S., Tang, E., Song, D., and Steinhardt, J. Measuring mathematical problem solving with the math dataset. arXiv preprint arXiv:2103.03874, 2021

  33. [33]

    Z., and Andreas, J

    Hernandez, E., Li, B. Z., and Andreas, J. Inspecting and editing knowledge representations in language models. arXiv preprint arXiv:2304.00740, 2023

  34. [34]

    K.-M., Lin, T.-H., Liao, C.-W., Fang, H.-C., Huang, C.-W., and Chen, Y.-N

    Hsueh, C.-H., Huang, P. K.-M., Lin, T.-H., Liao, C.-W., Fang, H.-C., Huang, C.-W., and Chen, Y.-N. Editing the mind of giants: An in-depth exploration of pitfalls of knowledge editing in large language models. arXiv preprint arXiv:2406.01436, 2024

  35. [35]

    Wilke: Wise-layer knowledge editor for lifelong knowledge editing

    Hu, C., Cao, P., Chen, Y., Liu, K., and Zhao, J. Wilke: Wise-layer knowledge editor for lifelong knowledge editing. In Findings of the Association for Computational Linguistics ACL 2024, pp.\ 3476--3503, 2024

  36. [36]

    J., Wallis, P., Allen-Zhu, Z., Li, Y., Wang, S., Wang, L., Chen, W., et al

    Hu, E. J., Wallis, P., Allen-Zhu, Z., Li, Y., Wang, S., Wang, L., Chen, W., et al. Lora: Low-rank adaptation of large language models. In International Conference on Learning Representations

  37. [37]

    Transformer-patcher: One mistake worth one neuron

    Huang, Z., Shen, Y., Zhang, X., Zhou, J., Rong, W., and Xiong, Z. Transformer-patcher: One mistake worth one neuron. In The Eleventh International Conference on Learning Representations, 2023

  38. [38]

    GPT-4o System Card

    Hurst, A., Lerer, A., Goucher, A. P., Perelman, A., Ramesh, A., Clark, A., Ostrow, A., Welihinda, A., Hayes, A., Radford, A., et al. Gpt-4o system card. arXiv preprint arXiv:2410.21276, 2024

  39. [39]

    Unsupervised dense information retrieval with contrastive learning

    Izacard, G., Caron, M., Hosseini, L., Riedel, S., Bojanowski, P., Joulin, A., and Grave, E. Unsupervised dense information retrieval with contrastive learning. Transactions on Machine Learning Research

  40. [40]

    J., Madotto, A., and Fung, P

    Ji, Z., Lee, N., Frieske, R., Yu, T., Su, D., Xu, Y., Ishii, E., Bang, Y. J., Madotto, A., and Fung, P. Survey of hallucination in natural language generation. ACM Computing Surveys, 55 0 (12): 0 1--38, 2023

  41. [41]

    Mistral 7B

    Jiang, A. Q., Sablayrolles, A., Mensch, A., Bamford, C., Chaplot, D. S., Casas, D. d. l., Bressand, F., Lengyel, G., Lample, G., Saulnier, L., et al. Mistral 7b. arXiv preprint arXiv:2310.06825, 2023

  42. [42]

    Anyedit: Edit any knowledge encoded in language models

    Jiang, H., Fang, J., Zhang, N., Wan, M., Ma, G., Wang, X., He, X., and Chua, T.-S. Anyedit: Edit any knowledge encoded in language models. In International Conference on Machine Learning, pp.\ 27510--27533. PMLR, 2025

  43. [43]

    Learning to edit: Aligning llms with knowledge editing

    Jiang, Y., Wang, Y., Wu, C., Zhong, W., Zeng, X., Gao, J., Li, L., Jiang, X., Shang, L., Tang, R., et al. Learning to edit: Aligning llms with knowledge editing. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.\ 4689--4705, 2024

  44. [44]

    F., Araki, J., and Neubig, G

    Jiang, Z., Xu, F. F., Araki, J., and Neubig, G. How can we know what language models know? Transactions of the Association for Computational Linguistics, 8: 0 423--438, 2020

  45. [45]

    Evaluating open-domain question answering in the era of large language models

    Kamalloo, E., Dziri, N., Clarke, C., and Rafiei, D. Evaluating open-domain question answering in the era of large language models. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.\ 5591--5606, 2023

  46. [46]

    Reasoning abilities of large language models: In-depth analysis on the abstraction and reasoning corpus

    Lee, S., Sim, W., Shin, D., Seo, W., Park, J., Lee, S., Hwang, S., Kim, S., and Kim, S. Reasoning abilities of large language models: In-depth analysis on the abstraction and reasoning corpus. ACM Transactions on Intelligent Systems and Technology

  47. [47]

    Zero-shot relation extraction via reading comprehension

    Levy, O., Seo, M., Choi, E., and Zettlemoyer, L. Zero-shot relation extraction via reading comprehension. In 21st Conference on Computational Natural Language Learning, CoNLL 2017, pp.\ 333--342. Association for Computational Linguistics (ACL), 2017

  48. [48]

    u ttler, H., Lewis, M., Yih, W.-t., Rockt \

    Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., K \"u ttler, H., Lewis, M., Yih, W.-t., Rockt \"a schel, T., et al. Retrieval-augmented generation for knowledge-intensive nlp tasks. Advances in Neural Information Processing Systems, 33: 0 9459--9474, 2020

  49. [49]

    Solving quantitative reasoning problems with language models

    Lewkowycz, A., Andreassen, A., Dohan, D., Dyer, E., Michalewski, H., Ramasesh, V., Slone, A., Anil, C., Schlag, I., Gutman-Solo, T., et al. Solving quantitative reasoning problems with language models. Advances in Neural Information Processing Systems, 35: 0 3843--3857, 2022

  50. [50]

    Q., Shen, Z., et al

    Li, J., Beeching, E., Tunstall, L., Lipkin, B., Soletskyi, R., Huang, S., Rasul, K., Yu, L., Jiang, A. Q., Shen, Z., et al. Numinamath: The largest public dataset in ai4maths with 860k pairs of competition math problems and solutions. Hugging Face repository, 13: 0 9, 2024 a

  51. [51]

    and Chu, X

    Li, Q. and Chu, X. Adaedit: Advancing continuous knowledge editing for large language models. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.\ 4127--4149, 2025

  52. [52]

    Consecutive model editing with batch alongside hook layers

    Li, S., Deng, Y., Cai, D., Lu, H., Chen, L., and Lam, W. Consecutive model editing with batch alongside hook layers. arXiv preprint arXiv:2403.05330, 2024 b

  53. [53]

    Pmet: Precise model editing in a transformer

    Li, X., Li, S., Song, S., Yang, J., Ma, J., and Yu, J. Pmet: Precise model editing in a transformer. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 38, pp.\ 18564--18572, 2024 c

  54. [54]

    E., Papailiopoulos, D., and Oymak, S

    Li, Y., Ildiz, M. E., Papailiopoulos, D., and Oymak, S. Transformers as algorithms: Generalization and stability in in-context learning. In International Conference on Machine Learning, pp.\ 19565--19594. PMLR, 2023

  55. [55]

    Let's verify step by step

    Lightman, H., Kosaraju, V., Burda, Y., Edwards, H., Baker, B., Lee, T., Leike, J., Schulman, J., Sutskever, I., and Cobbe, K. Let's verify step by step. In International Conference on Learning Representations, volume 2024, pp.\ 39578--39601, 2024

  56. [56]

    Edit less, achieve more: Dynamic sparse neuron masking for lifelong knowledge editing in llms

    Liu, J., Sun, J., Shen, S., Yang, C., and Wang, S. Edit less, achieve more: Dynamic sparse neuron masking for lifelong knowledge editing in llms. In The Thirty-ninth Annual Conference on Neural Information Processing Systems

  57. [57]

    Pre-train, prompt, and predict: A systematic survey of prompting methods in natural language processing

    Liu, P., Yuan, W., Fu, J., Jiang, Z., Hayashi, H., and Neubig, G. Pre-train, prompt, and predict: A systematic survey of prompting methods in natural language processing. ACM Computing Surveys, 55 0 (9): 0 1--35, 2023

  58. [58]

    X., Huan, J., Wang, H., et al

    Liu, T., Li, R., Qi, Y., Liu, H., Tang, X., Zheng, T., Yin, Q., Cheng, M. X., Huan, J., Wang, H., et al. Unlocking efficient, scalable, and continual knowledge editing with basis-level representation fine-tuning. arXiv preprint arXiv:2503.00306, 2025 a

  59. [59]

    W., and Lee, W

    Liu, W., Xu, H., Liu, B., Deng, Z., Wang, H., Wang, J., Li, R., Teh, Y. W., and Lee, W. S. Is model editing built on sand? revealing its illusory success and fragile foundation, 2025 b . URL https://arxiv.org/abs/2510.00625

  60. [60]

    An Empirical Study of Catastrophic Forgetting in Large Language Models During Continual Fine-tuning

    Luo, Y., Yang, Z., Meng, F., Li, Y., Zhou, J., and Zhang, Y. An empirical study of catastrophic forgetting in large language models during continual fine-tuning. arXiv preprint arXiv:2308.08747, 2023

  61. [61]

    Interpreting key mechanisms of factual recall in transformer-based language models

    Lv, A., Zhang, K., Chen, Y., Wang, Y., Liu, L., Wen, J.-R., Xie, J., and Yan, R. Interpreting key mechanisms of factual recall in transformer-based language models. arXiv preprint arXiv:2403.19521, 2024 a

  62. [62]

    Full parameter fine-tuning for large language models with limited resources

    Lv, K., Yang, Y., Liu, T., Guo, Q., and Qiu, X. Full parameter fine-tuning for large language models with limited resources. In Ku, L.-W., Martins, A., and Srikumar, V. (eds.), Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.\ 8187--8198, Bangkok, Thailand, August 2024 b . Association for...

  63. [63]

    K-edit: Language model editing with contextual knowledge awareness

    Markowitz, E., Ramakrishna, A., Mehrabi, N., Peris, C., Gupta, R., Chang, K.-W., and Galstyan, A. K-edit: Language model editing with contextual knowledge awareness. arXiv preprint arXiv:2502.10626, 2025

  64. [64]

    arXiv preprint arXiv:2309.13638 (2023)

    McCoy, R. T., Yao, S., Friedman, D., Hardy, M., and Griffiths, T. L. Embers of autoregression: Understanding large language models through the problem they are trained to solve. arXiv preprint arXiv:2309.13638, 2023

  65. [65]

    S., Andonian, A

    Meng, K., Sharma, A. S., Andonian, A. J., Belinkov, Y., and Bau, D. Mass-editing memory in a transformer. In The Eleventh International Conference on Learning Representations

  66. [66]

    Locating and editing factual associations in gpt

    Meng, K., Bau, D., Andonian, A., and Belinkov, Y. Locating and editing factual associations in gpt. Advances in Neural Information Processing Systems, 35: 0 17359--17372, 2022 a

  67. [67]

    Locating and editing factual associations in gpt

    Meng, K., Bau, D., Andonian, A., and Belinkov, Y. Locating and editing factual associations in gpt. In Proceedings of the 36th International Conference on Neural Information Processing Systems, pp.\ 17359--17372, 2022 b

  68. [68]

    Llama 3 model card

    Meta AI . Llama 3 model card. https://github.com/meta-llama/llama3/blob/main/MODEL_CARD.md, 2024. Accessed: 2025-04-21

  69. [69]

    Mitchell, E., Lin, C., Bosselut, A., Finn, C., and Manning, C. D. Fast model editing at scale. arXiv preprint arXiv:2110.11309, 2021

  70. [70]

    D., and Finn, C

    Mitchell, E., Lin, C., Bosselut, A., Manning, C. D., and Finn, C. Memory-based model editing at scale. In International Conference on Machine Learning, pp.\ 15817--15831. PMLR, 2022

  71. [71]

    M., Alghisi, S., and Riccardi, G

    Mousavi, S. M., Alghisi, S., and Riccardi, G. Is your llm outdated? benchmarking llms & alignment algorithms for time-sensitive knowledge. arXiv preprint arXiv:2404.08700, 2024

  72. [72]

    Bmike-53: Investigating cross-lingual knowledge editing with in-context learning

    Nie, E., Shao, B., Wang, M., Ding, Z., Schmid, H., and Sch \"u tze, H. Bmike-53: Investigating cross-lingual knowledge editing with in-context learning. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.\ 16357--16374, 2025

  73. [73]

    Nishi, K., Ramesh, R., Okawa, M., Khona, M., Tanaka, H., and Lubana, E. S. Representation shattering in transformers: A synthetic study with knowledge editing. In Singh, A., Fazel, M., Hsu, D., Lacoste-Julien, S., Berkenkamp, F., Maharaj, T., Wagstaff, K., and Zhu, J. (eds.), Proceedings of the 42nd International Conference on Machine Learning, volume 267...

  74. [74]

    GPT-4 Technical Report

    OpenAI. Gpt-4 technical report. arXiv preprint arXiv:2303.08774, 2023

  75. [75]

    Training language models to follow instructions with human feedback

    Ouyang, L., Wu, J., Jiang, X., Almeida, D., Wainwright, C., Mishkin, P., Zhang, C., Agarwal, S., Slama, K., Ray, A., et al. Training language models to follow instructions with human feedback. Advances in neural information processing systems, 35: 0 27730--27744, 2022

  76. [76]

    Context-robust knowledge editing for language models

    Park, H., Choi, G., Kim, M., and Jo, Y. Context-robust knowledge editing for language models. In Findings of the Association for Computational Linguistics: ACL 2025, pp.\ 10360--10385, 2025

  77. [77]

    Event-level knowledge editing

    Peng, H., Wang, X., Li, C., Zeng, K., Duo, J., Cao, Y., Hou, L., and Li, J. Event-level knowledge editing. arXiv preprint arXiv:2402.13093, 2024

  78. [78]

    Petroni, F., Rockt \"a schel, T., Riedel, S., Lewis, P., Bakhtin, A., Wu, Y., and Miller, A. Language models as knowledge bases? In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp.\ 2463--2473, 2019

  79. [79]

    and Elhadad, M

    Pinter, Y. and Elhadad, M. Emptying the ocean with a spoon: Should we edit models? In Findings of the Association for Computational Linguistics: EMNLP 2023, pp.\ 15164--15172, 2023

  80. [80]

    S., Zhu, C., Li, D., Yu, F., Zaheer, M., Kumar, S., and Bhojanapalli, S

    Rawat, A. S., Zhu, C., Li, D., Yu, F., Zaheer, M., Kumar, S., and Bhojanapalli, S. Modifying memories in transformer models. In International Conference on Machine Learning (ICML), volume 2020, 2021

Showing first 80 references.