Revisiting Parameter-Based Knowledge Editing in Large Language Models: Theoretical Limits and Empirical Evidence

Aixin Sun; Futing Wang; Guoxiu He; Wanying Ren; Xin Song

arxiv: 2606.00570 · v1 · pith:SBX7WVRBnew · submitted 2026-05-30 · 💻 cs.CL · cs.AI

Revisiting Parameter-Based Knowledge Editing in Large Language Models: Theoretical Limits and Empirical Evidence

Wanying Ren , Xin Song , Futing Wang , Guoxiu He , Aixin Sun This is my paper

Pith reviewed 2026-06-28 19:01 UTC · model grok-4.3

classification 💻 cs.CL cs.AI

keywords knowledge editinglarge language modelsparameter editingretrieval augmentationreasoning collapsedimensional collapsemodel capabilities

0 comments

The pith

Localized parameter edits in LLMs propagate to cause global reasoning collapse, while a simple retrieval baseline consistently outperforms them.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that parameter-based knowledge editing, which modifies internal weights locally, has fundamental theoretical limits that lead to interference across the model. It introduces the dimensional collapse hypothesis to show how these edits travel along fragile directions in representation space and produce widespread capability damage. Empirical tests across varying edit counts, knowledge complexity, and evaluation settings confirm that all tested editing methods impair core LLM functions such as reasoning and coherence. In direct comparison, an unchanged retrieval-based approach maintains stronger performance under every condition examined. The central takeaway is that future editing work must treat preservation of base capabilities as a primary requirement rather than an afterthought.

Core claim

Localized parameter edits propagate along fragile directions in representation space, inducing global interference that collapses reasoning performance; this effect is observed consistently across methods and settings, whereas retrieval-based access avoids the damage entirely.

What carries the argument

Dimensional Collapse Hypothesis, which models how localized weight changes spread along low-dimensional fragile directions to produce global capability loss.

If this is right

Any method that changes weights directly will face the same interference risk once edit count or complexity rises.
Retrieval mechanisms that leave parameters untouched can serve as a practical upper bound for capability preservation.
Evaluation protocols must include broad capability checks beyond the edited facts themselves.
Research priority should shift toward hybrid or non-parametric approaches that avoid representation-space collapse.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

If the collapse mechanism is general, then scaling model size alone will not remove the interference problem without changes to editing strategy.
Retrieval baselines could be extended with lightweight adapters that never touch the original weights.
The same fragility directions may explain why fine-tuning on narrow tasks sometimes erodes broader abilities.

Load-bearing premise

The dimensional collapse hypothesis accurately describes the mechanism by which local edits create global interference.

What would settle it

An experiment in which multiple parameter edits are applied and the model shows no measurable drop in reasoning accuracy or coherence on held-out tasks while still incorporating the new facts.

Figures

Figures reproduced from arXiv: 2606.00570 by Aixin Sun, Futing Wang, Guoxiu He, Wanying Ren, Xin Song.

**Figure 1.** Figure 1: Training and inference workflows for four types of knowledge editing methods. The training process is shown above the green line, while the inference stage is shown below. (a) and (b) perform knowledge editing by modifying the LLM parameters. However, as world knowledge continuously evolves, some information encoded in LLMs inevitably becomes outdated or inaccurate (Mousavi et al., 2024; Ji et al., 2023). … view at source ↗

**Figure 2.** Figure 2: Distribution of Rk values in layer 30 of Llama-3.1-8BInstruct after performing 1000 sequential edits using MEMIT. 4.3. Cumulative Effects in Sequential Editing We now extend the analysis to sequential editing, where the model undergoes a series of updates: W(0) → W(1) → · · · → W(T) . Let h (t) denote the hidden representation after the t-th edit. The total representation change after T edits relative to … view at source ↗

**Figure 3.** Figure 3: Performance changes of knowledge editing methods during sequential editing of Llama-3.1-8B-Instruct on the ZsRE dataset. The x-axis represents the number of edits: 1, 10, 100, and ‘All’ for the full dataset. decoding with semantic consistency judged by Qwen2.5- 72B-Instruct across four dimensions (Zhang et al., 2024b). A token-level locality check confirms the semantic results (Appendix C.5). Full implemen… view at source ↗

**Figure 4.** Figure 4: Performance comparison between AlphaEdit and SCR on sequential editing tasks using the ELKEN dataset. RQ4: How do different knowledge editing methods compare in terms of time efficiency? Beyond correctness and robustness, latency and efficiency are equally crucial for practical deployment. Effective editing methods should minimize both editing overhead and inference latency. To systematically evaluate th… view at source ↗

**Figure 6.** Figure 6: Directional relative change rate Rk across editing methods, model architectures, and steps. 30 [PITH_FULL_IMAGE:figures/full_fig_p030_6.png] view at source ↗

**Figure 7.** Figure 7: Performance changes of knowledge editing methods during sequential editing of Llama-2-7B-Chat on the ZsRE dataset. The x-axis represents the number of edits: 1, 10, 100, and the full dataset. 38 [PITH_FULL_IMAGE:figures/full_fig_p038_7.png] view at source ↗

**Figure 8.** Figure 8: Performance changes of knowledge editing methods during sequential editing of Mistral-7B-Instruct-v0.1 on the ZsRE dataset. The x-axis represents the number of edits: 1, 10, 100, and the full dataset. 39 [PITH_FULL_IMAGE:figures/full_fig_p039_8.png] view at source ↗

**Figure 9.** Figure 9: Editing time (in second), and inference latency relative to the base model. 41 [PITH_FULL_IMAGE:figures/full_fig_p041_9.png] view at source ↗

**Figure 10.** Figure 10: SCR efficiency and performance metrics for sequential editing. Visual representation of the SCR’s inference time and editing stability across various memory sizes (100 and 1000) and retrieval windows (top-1, top-3, top-5, and top-10). 51 [PITH_FULL_IMAGE:figures/full_fig_p051_10.png] view at source ↗

read the original abstract

Parameter-based knowledge editing updates the internal knowledge of large language models (LLMs) via localized weight modifications and has attracted significant attention. However, most existing methods overlook fundamental theoretical limitations and are rarely evaluated under realistic, practice-oriented settings. In this paper, we first present a theoretical analysis based on the dimensional Collapse Hypothesis, explaining how localized parameter edits can propagate along fragile directions in the representation space, inducing global interference and ultimately causing reasoning collapse. Building on this insight, we conduct a comprehensive empirical evaluation by systematically varying knowledge complexity, number of edits, evaluation dimensions, and baseline methods. Our results show that parameter-based editing methods consistently damage core LLM capabilities. In contrast, a simple retrieval-based baseline achieves consistently stronger performance than all parameter-editing methods across all evaluated conditions. These findings highlight that preserving the fundamental capabilities of LLMs after knowledge editing should be a central concern for future research.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Parameter editing harms capabilities while retrieval holds up in their tests, but the dimensional collapse hypothesis lacks direct evidence and could be post-hoc.

read the letter

The main takeaway is that this paper reports parameter-based editing methods damaging LLM core abilities across conditions, with a simple retrieval baseline outperforming them all. They frame the damage through a dimensional collapse hypothesis where local weight changes spread along weak directions in representation space.

What stands out as new is the combination of that hypothesis with a controlled sweep that varies knowledge complexity, edit volume, evaluation axes, and multiple editing baselines at once. Prior work tended to test these factors separately, so the joint comparison adds some clarity on when and how badly editing fails.

The empirical side is the stronger part. If the protocols are as described, the consistent gap versus retrieval is worth noting for anyone maintaining deployed models. The soft spot is the theory. The hypothesis explains the interference but the paper does not appear to measure representation properties like effective rank or singular-value spectra before and after edits. Without those checks, the observed drops could trace to ordinary fine-tuning instability, learning-rate choices, or task mismatch rather than the specific collapse mechanism.

This is aimed at researchers working on knowledge updating and LLM reliability. Anyone weighing editing against retrieval or hybrid approaches will get value from the comparison, even if they end up disagreeing with the causal story.

I would send it to peer review. The negative result on editing is relevant enough that referees should check the data and setup, though the authors will likely need to add direct tests of the hypothesis or qualify the explanation.

Referee Report

2 major / 1 minor

Summary. The paper claims that parameter-based knowledge editing in LLMs is subject to fundamental theoretical limits, which it explains via the Dimensional Collapse Hypothesis: localized weight updates propagate along fragile directions in representation space, causing global interference and reasoning collapse. It supports this with a comprehensive empirical evaluation that varies knowledge complexity, edit count, evaluation dimensions, and baselines, finding that all parameter-editing methods degrade core LLM capabilities while a simple retrieval-based baseline consistently outperforms them.

Significance. If the empirical findings hold under scrutiny, the work would usefully shift attention in the knowledge-editing literature toward capability preservation and retrieval alternatives. The systematic sweep across multiple axes is a clear strength and provides falsifiable, practice-oriented evidence. However, the explanatory mechanism (Dimensional Collapse Hypothesis) is presented without direct validation, reducing the paper's ability to move beyond post-hoc interpretation.

major comments (2)

[theoretical analysis] Theoretical analysis section (as described in the abstract): the Dimensional Collapse Hypothesis is invoked to explain how localized edits induce global interference, yet the manuscript reports no direct measurements (effective rank, singular-value spectra, or directional fragility metrics) of representation space before versus after edits. Without such tests the hypothesis remains an unverified interpretive lens rather than a validated causal account, leaving open alternative explanations such as generic optimization instability.
[empirical evaluation] Empirical evaluation (abstract and results): while the central claim that parameter edits damage capabilities rests on the sweep, the abstract provides no quantitative results, error bars, dataset sizes, or model scales. If these details are absent or insufficiently reported in the full text, the strength of the cross-condition superiority claim cannot be assessed.

minor comments (1)

Clarify whether the retrieval baseline is purely non-parametric or involves any learned components, as this affects the interpretation of 'parameter-based' versus 'retrieval-based' distinctions.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback. We address each major comment below and indicate planned revisions.

read point-by-point responses

Referee: [theoretical analysis] Theoretical analysis section (as described in the abstract): the Dimensional Collapse Hypothesis is invoked to explain how localized edits induce global interference, yet the manuscript reports no direct measurements (effective rank, singular-value spectra, or directional fragility metrics) of representation space before versus after edits. Without such tests the hypothesis remains an unverified interpretive lens rather than a validated causal account, leaving open alternative explanations such as generic optimization instability.

Authors: The Dimensional Collapse Hypothesis is derived from a theoretical analysis of how localized weight updates interact with the geometry of representation spaces. The manuscript supports it via the observed pattern of global interference across diverse edit conditions. We agree that direct measurements would strengthen the causal claim over post-hoc interpretation. In revision we will add experiments reporting effective rank, singular-value spectra, and directional fragility metrics before versus after edits. revision: yes
Referee: [empirical evaluation] Empirical evaluation (abstract and results): while the central claim that parameter edits damage capabilities rests on the sweep, the abstract provides no quantitative results, error bars, dataset sizes, or model scales. If these details are absent or insufficiently reported in the full text, the strength of the cross-condition superiority claim cannot be assessed.

Authors: The full manuscript reports all quantitative results (performance deltas with standard deviations), dataset sizes, model scales, and evaluation protocols in the results section and appendices. To make the strength of the claims immediately visible, we will revise the abstract to include representative quantitative outcomes and error-bar information. revision: partial

Circularity Check

0 steps flagged

No significant circularity; empirical claims rest on new measurements independent of the invoked hypothesis

full rationale

The paper's central results are empirical comparisons of parameter-editing methods versus a retrieval baseline across varied conditions of knowledge complexity, edit count, and evaluation dimensions. These outcomes are generated from direct experiments and do not reduce to any fitted parameter, self-defined quantity, or self-citation chain. The Dimensional Collapse Hypothesis is cited only as the basis for a theoretical explanation of observed interference; it is not used to derive or force any reported performance numbers, nor does the paper present predictions that are then verified by construction from the same hypothesis. No self-definitional, fitted-input, or uniqueness-imported steps appear in the provided derivation chain. The hypothesis functions as post-hoc interpretive framing rather than a load-bearing premise that collapses the empirical findings back into the inputs.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the dimensional collapse hypothesis as an explanatory mechanism and on the assumption that the chosen evaluation dimensions and baselines adequately represent practice-oriented settings; no free parameters or invented entities are introduced in the abstract.

axioms (1)

domain assumption Dimensional Collapse Hypothesis
Invoked to explain propagation of localized edits into global interference and reasoning collapse.

pith-pipeline@v0.9.1-grok · 5690 in / 1133 out tokens · 23974 ms · 2026-06-28T19:01:10.450637+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

125 extracted references · 40 canonical work pages · 12 internal anchors

[1]

A review on language models as knowledge bases

AlKhamissi, B., Li, M., Celikyilmaz, A., Diab, M., and Ghazvininejad, M. A review on language models as knowledge bases. arXiv preprint arXiv:2204.06031, 2022

work page arXiv 2022
[2]

and Li, Y

Allen-Zhu, Z. and Li, Y. Physics of language models: Part 3.3, knowledge capacity scaling laws. arXiv preprint arXiv:2404.05405, 2024

work page arXiv 2024
[3]

Self-rag: Learning to retrieve, generate, and critique through self-reflection

Asai, A., Wu, Z., Wang, Y., Sil, A., and Hajishirzi, H. Self-rag: Learning to retrieve, generate, and critique through self-reflection. In The Twelfth International Conference on Learning Representations
[4]

D., Dhariwal, P., Neelakantan, A., Shyam, P., Sastry, G., Askell, A., et al

Brown, T., Mann, B., Ryder, N., Subbiah, M., Kaplan, J. D., Dhariwal, P., Neelakantan, A., Shyam, P., Sastry, G., Askell, A., et al. Language models are few-shot learners. Advances in neural information processing systems, 33: 0 1877--1901, 2020

1901
[5]

Lifelong knowledge editing for llms with retrieval-augmented continuous prompt learning

Chen, Q., Zhang, T., He, X., Li, D., Wang, C., Huang, L., et al. Lifelong knowledge editing for llms with retrieval-augmented continuous prompt learning. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pp.\ 13565--13580, 2024

2024
[6]

Uniedit: A unified knowledge editing benchmark for large language models

Chen, Q., Wang, D., Zhang, T., Yan, Z., You, C., Wang, C., and He, X. Uniedit: A unified knowledge editing benchmark for large language models. In The Thirty-ninth Annual Conference on Neural Information Processing Systems Datasets and Benchmarks Track, 2025. URL https://openreview.net/forum?id=eESbQ5lWiP

2025
[7]

Towards understanding the mixture-of-experts layer in deep learning

Chen, Z., Deng, Y., Wu, Y., Gu, Q., and Li, Y. Towards understanding the mixture-of-experts layer in deep learning. In Proceedings of the 36th International Conference on Neural Information Processing Systems, pp.\ 23049--23062, 2022

2022
[8]

Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge

Clark, P., Cowhey, I., Etzioni, O., Khot, T., Sabharwal, A., Schoenick, C., and Tafjord, O. Think you have solved question answering? try arc, the ai2 reasoning challenge. arXiv:1803.05457v1, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018
[9]

Evaluating the ripple effects of knowledge editing in language models

Cohen, R., Biran, E., Yoran, O., Globerson, A., and Geva, M. Evaluating the ripple effects of knowledge editing in language models. Transactions of the Association for Computational Linguistics, 12: 0 283--298, 2024

2024
[10]

Knowledge neurons in pretrained transformers

Dai, D., Dong, L., Hao, Y., Sui, Z., Chang, B., and Wei, F. Knowledge neurons in pretrained transformers. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.\ 8493--8502, 2022

2022
[11]

Eamet: Robust massive model editing via embedding alignment optimization

Dai, Y., Ji, Z., Li, Z., and Wang, S. Eamet: Robust massive model editing via embedding alignment optimization. arXiv preprint arXiv:2505.11876, 2025

work page arXiv 2025
[12]

Larimar: Large language models with episodic memory control

Das, P., Chaudhury, S., Nelson, E., Melnyk, I., Swaminathan, S., Dai, S., Lozano, A., Kollias, G., Chenthamarakshan, V., Navratil, J., et al. Larimar: Large language models with episodic memory control. In Forty-first International Conference on Machine Learning
[13]

Editing factual knowledge in language models

De Cao, N., Aziz, W., and Titov, I. Editing factual knowledge in language models. In EMNLP 2021-2021 Conference on Empirical Methods in Natural Language Processing, Proceedings, pp.\ 6491--6506, 2021

2021
[14]

Parameter-efficient fine-tuning of large-scale pre-trained language models

Ding, N., Qin, Y., Yang, G., Wei, F., Yang, Z., Su, Y., Hu, S., Chen, Y., Chan, C.-M., Chen, W., et al. Parameter-efficient fine-tuning of large-scale pre-trained language models. Nature Machine Intelligence, 5 0 (3): 0 220--235, 2023

2023
[15]

Strong model collapse

Dohmatob, E., Feng, Y., and Kempe, J. Strong model collapse. arXiv preprint arXiv:2410.04840, 2024

work page arXiv 2024
[16]

Calibrating factual knowledge in pretrained language models

Dong, Q., Dai, D., Song, Y., Xu, J., Sui, Z., and Li, L. Calibrating factual knowledge in pretrained language models. In Findings of the Association for Computational Linguistics: EMNLP 2022, pp.\ 5937--5947, 2022

2022
[17]

A survey on in-context learning

Dong, Q., Li, L., Dai, D., Zheng, C., Ma, J., Li, R., Xia, H., Xu, J., Wu, Z., Chang, B., et al. A survey on in-context learning. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pp.\ 1107--1128, 2024

2024
[18]

Alphaedit: Null-space constrained model editing for language models

Fang, J., Jiang, H., Wang, K., Ma, Y., Shi, J., Wang, X., He, X., and Chua, T.-S. Alphaedit: Null-space constrained model editing for language models. In The Thirteenth International Conference on Learning Representations, 2025 a . URL https://openreview.net/forum?id=HvSytvg3Jh

2025
[19]

Cknowedit: A new chinese knowledge editing dataset for linguistics, facts, and logic error correction in llms

Fang, J., Lu, T., Yao, Y., Jiang, Z., Xu, X., Chen, H., and Zhang, N. Cknowedit: A new chinese knowledge editing dataset for linguistics, facts, and logic error correction in llms. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.\ 8789--8807, 2025 b

2025
[20]

Dissecting recall of factual associations in auto-regressive language models

Geva, M., Bastings, J., Filippova, K., and Globerson, A. Dissecting recall of factual associations in auto-regressive language models. In The 2023 Conference on Empirical Methods in Natural Language Processing

2023
[21]

Transformer feed-forward layers are key-value memories

Geva, M., Schuster, R., Berant, J., and Levy, O. Transformer feed-forward layers are key-value memories. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pp.\ 5484--5495, 2021

2021
[22]

The Llama 3 Herd of Models

Grattafiori, A., Dubey, A., Jauhri, A., Pandey, A., Kadian, A., Al-Dahle, A., Letman, A., Mathur, A., Schelten, A., Vaughan, A., et al. The llama 3 herd of models. arXiv preprint arXiv:2407.21783, 2024

work page internal anchor Pith review Pith/arXiv arXiv 2024
[23]

Model editing harms general abilities of large language models: Regularization to the rescue

Gu, J.-C., Xu, H.-X., Ma, J.-Y., Lu, P., Ling, Z.-H., Chang, K.-W., and Peng, N. Model editing harms general abilities of large language models: Regularization to the rescue. In Al-Onaizan, Y., Bansal, M., and Chen, Y.-N. (eds.), Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pp.\ 16801--16819, Miami, Florida, USA,...

work page doi:10.18653/v1/2024.emnlp-main.934 2024
[24]

Model editing harms general abilities of large language models: Regularization to the rescue

Gu, J.-C., Xu, H.-X., Ma, J.-Y., Lu, P., Ling, Z.-H., Chang, K.-W., and Peng, N. Model editing harms general abilities of large language models: Regularization to the rescue. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pp.\ 16801--16819. Association for Computational Linguistics, 2024 b

2024
[25]

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Guo, D., Yang, D., Zhang, H., Song, J., Zhang, R., Xu, R., Zhu, Q., Ma, S., Wang, P., Bi, X., et al. Deepseek-r1: Incentivizing reasoning capability in llms via reinforcement learning. arXiv preprint arXiv:2501.12948, 2025

work page internal anchor Pith review Pith/arXiv arXiv 2025
[26]

Model editing at scale leads to gradual and catastrophic forgetting

Gupta, A., Rao, A., and Anumanchipalli, G. Model editing at scale leads to gradual and catastrophic forgetting. In Ku, L.-W., Martins, A., and Srikumar, V. (eds.), Findings of the Association for Computational Linguistics: ACL 2024, pp.\ 15202--15232, Bangkok, Thailand, August 2024 a . Association for Computational Linguistics. doi:10.18653/v1/2024.findin...

work page doi:10.18653/v1/2024.findings-acl.902 2024
[27]

Model editing at scale leads to gradual and catastrophic forgetting

Gupta, A., Rao, A., and Anumanchipalli, G. Model editing at scale leads to gradual and catastrophic forgetting. In Findings of the Association for Computational Linguistics ACL 2024, pp.\ 15202--15232, 2024 b

2024
[28]

flex tape can’t fix that

Halevy, K., Sotnikova, A., Alkhamissi, B., Montariol, S., and Bosselut, A. “flex tape can’t fix that”: Bias and misinformation in edited language models. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pp.\ 8690--8707, 2024

2024
[29]

Aging with grace: Lifelong model editing with discrete key-value adaptors

Hartvigsen, T., Sankaranarayanan, S., Palangi, H., Kim, Y., and Ghassemi, M. Aging with grace: Lifelong model editing with discrete key-value adaptors. Advances in Neural Information Processing Systems, 36, 2024

2024
[30]

Olympiadbench: A challenging benchmark for promoting agi with olympiad-level bilingual multimodal scientific problems

He, C., Luo, R., Bai, Y., Hu, S., Thai, Z., Shen, J., Hu, J., Han, X., Huang, Y., Zhang, Y., et al. Olympiadbench: A challenging benchmark for promoting agi with olympiad-level bilingual multimodal scientific problems. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.\ 3828--3850, 2024

2024
[31]

Knowledge updating? no more model editing! just selective contextual reasoning

He, G., Song, X., and Sun, A. Knowledge updating? no more model editing! just selective contextual reasoning. arXiv preprint arXiv:2503.05212, 2025

work page arXiv 2025
[32]

Measuring Mathematical Problem Solving With the MATH Dataset

Hendrycks, D., Burns, C., Kadavath, S., Arora, A., Basart, S., Tang, E., Song, D., and Steinhardt, J. Measuring mathematical problem solving with the math dataset. arXiv preprint arXiv:2103.03874, 2021

work page internal anchor Pith review Pith/arXiv arXiv 2021
[33]

Z., and Andreas, J

Hernandez, E., Li, B. Z., and Andreas, J. Inspecting and editing knowledge representations in language models. arXiv preprint arXiv:2304.00740, 2023

work page arXiv 2023
[34]

K.-M., Lin, T.-H., Liao, C.-W., Fang, H.-C., Huang, C.-W., and Chen, Y.-N

Hsueh, C.-H., Huang, P. K.-M., Lin, T.-H., Liao, C.-W., Fang, H.-C., Huang, C.-W., and Chen, Y.-N. Editing the mind of giants: An in-depth exploration of pitfalls of knowledge editing in large language models. arXiv preprint arXiv:2406.01436, 2024

work page arXiv 2024
[35]

Wilke: Wise-layer knowledge editor for lifelong knowledge editing

Hu, C., Cao, P., Chen, Y., Liu, K., and Zhao, J. Wilke: Wise-layer knowledge editor for lifelong knowledge editing. In Findings of the Association for Computational Linguistics ACL 2024, pp.\ 3476--3503, 2024

2024
[36]

J., Wallis, P., Allen-Zhu, Z., Li, Y., Wang, S., Wang, L., Chen, W., et al

Hu, E. J., Wallis, P., Allen-Zhu, Z., Li, Y., Wang, S., Wang, L., Chen, W., et al. Lora: Low-rank adaptation of large language models. In International Conference on Learning Representations
[37]

Transformer-patcher: One mistake worth one neuron

Huang, Z., Shen, Y., Zhang, X., Zhou, J., Rong, W., and Xiong, Z. Transformer-patcher: One mistake worth one neuron. In The Eleventh International Conference on Learning Representations, 2023

2023
[38]

GPT-4o System Card

Hurst, A., Lerer, A., Goucher, A. P., Perelman, A., Ramesh, A., Clark, A., Ostrow, A., Welihinda, A., Hayes, A., Radford, A., et al. Gpt-4o system card. arXiv preprint arXiv:2410.21276, 2024

work page internal anchor Pith review Pith/arXiv arXiv 2024
[39]

Unsupervised dense information retrieval with contrastive learning

Izacard, G., Caron, M., Hosseini, L., Riedel, S., Bojanowski, P., Joulin, A., and Grave, E. Unsupervised dense information retrieval with contrastive learning. Transactions on Machine Learning Research
[40]

J., Madotto, A., and Fung, P

Ji, Z., Lee, N., Frieske, R., Yu, T., Su, D., Xu, Y., Ishii, E., Bang, Y. J., Madotto, A., and Fung, P. Survey of hallucination in natural language generation. ACM Computing Surveys, 55 0 (12): 0 1--38, 2023

2023
[41]

Mistral 7B

Jiang, A. Q., Sablayrolles, A., Mensch, A., Bamford, C., Chaplot, D. S., Casas, D. d. l., Bressand, F., Lengyel, G., Lample, G., Saulnier, L., et al. Mistral 7b. arXiv preprint arXiv:2310.06825, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023
[42]

Anyedit: Edit any knowledge encoded in language models

Jiang, H., Fang, J., Zhang, N., Wan, M., Ma, G., Wang, X., He, X., and Chua, T.-S. Anyedit: Edit any knowledge encoded in language models. In International Conference on Machine Learning, pp.\ 27510--27533. PMLR, 2025

2025
[43]

Learning to edit: Aligning llms with knowledge editing

Jiang, Y., Wang, Y., Wu, C., Zhong, W., Zeng, X., Gao, J., Li, L., Jiang, X., Shang, L., Tang, R., et al. Learning to edit: Aligning llms with knowledge editing. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.\ 4689--4705, 2024

2024
[44]

F., Araki, J., and Neubig, G

Jiang, Z., Xu, F. F., Araki, J., and Neubig, G. How can we know what language models know? Transactions of the Association for Computational Linguistics, 8: 0 423--438, 2020

2020
[45]

Evaluating open-domain question answering in the era of large language models

Kamalloo, E., Dziri, N., Clarke, C., and Rafiei, D. Evaluating open-domain question answering in the era of large language models. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.\ 5591--5606, 2023

2023
[46]

Reasoning abilities of large language models: In-depth analysis on the abstraction and reasoning corpus

Lee, S., Sim, W., Shin, D., Seo, W., Park, J., Lee, S., Hwang, S., Kim, S., and Kim, S. Reasoning abilities of large language models: In-depth analysis on the abstraction and reasoning corpus. ACM Transactions on Intelligent Systems and Technology
[47]

Zero-shot relation extraction via reading comprehension

Levy, O., Seo, M., Choi, E., and Zettlemoyer, L. Zero-shot relation extraction via reading comprehension. In 21st Conference on Computational Natural Language Learning, CoNLL 2017, pp.\ 333--342. Association for Computational Linguistics (ACL), 2017

2017
[48]

u ttler, H., Lewis, M., Yih, W.-t., Rockt \

Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., K \"u ttler, H., Lewis, M., Yih, W.-t., Rockt \"a schel, T., et al. Retrieval-augmented generation for knowledge-intensive nlp tasks. Advances in Neural Information Processing Systems, 33: 0 9459--9474, 2020

2020
[49]

Solving quantitative reasoning problems with language models

Lewkowycz, A., Andreassen, A., Dohan, D., Dyer, E., Michalewski, H., Ramasesh, V., Slone, A., Anil, C., Schlag, I., Gutman-Solo, T., et al. Solving quantitative reasoning problems with language models. Advances in Neural Information Processing Systems, 35: 0 3843--3857, 2022

2022
[50]

Q., Shen, Z., et al

Li, J., Beeching, E., Tunstall, L., Lipkin, B., Soletskyi, R., Huang, S., Rasul, K., Yu, L., Jiang, A. Q., Shen, Z., et al. Numinamath: The largest public dataset in ai4maths with 860k pairs of competition math problems and solutions. Hugging Face repository, 13: 0 9, 2024 a

2024
[51]

and Chu, X

Li, Q. and Chu, X. Adaedit: Advancing continuous knowledge editing for large language models. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.\ 4127--4149, 2025

2025
[52]

Consecutive model editing with batch alongside hook layers

Li, S., Deng, Y., Cai, D., Lu, H., Chen, L., and Lam, W. Consecutive model editing with batch alongside hook layers. arXiv preprint arXiv:2403.05330, 2024 b

work page arXiv 2024
[53]

Pmet: Precise model editing in a transformer

Li, X., Li, S., Song, S., Yang, J., Ma, J., and Yu, J. Pmet: Precise model editing in a transformer. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 38, pp.\ 18564--18572, 2024 c

2024
[54]

E., Papailiopoulos, D., and Oymak, S

Li, Y., Ildiz, M. E., Papailiopoulos, D., and Oymak, S. Transformers as algorithms: Generalization and stability in in-context learning. In International Conference on Machine Learning, pp.\ 19565--19594. PMLR, 2023

2023
[55]

Let's verify step by step

Lightman, H., Kosaraju, V., Burda, Y., Edwards, H., Baker, B., Lee, T., Leike, J., Schulman, J., Sutskever, I., and Cobbe, K. Let's verify step by step. In International Conference on Learning Representations, volume 2024, pp.\ 39578--39601, 2024

2024
[56]

Edit less, achieve more: Dynamic sparse neuron masking for lifelong knowledge editing in llms

Liu, J., Sun, J., Shen, S., Yang, C., and Wang, S. Edit less, achieve more: Dynamic sparse neuron masking for lifelong knowledge editing in llms. In The Thirty-ninth Annual Conference on Neural Information Processing Systems
[57]

Pre-train, prompt, and predict: A systematic survey of prompting methods in natural language processing

Liu, P., Yuan, W., Fu, J., Jiang, Z., Hayashi, H., and Neubig, G. Pre-train, prompt, and predict: A systematic survey of prompting methods in natural language processing. ACM Computing Surveys, 55 0 (9): 0 1--35, 2023

2023
[58]

X., Huan, J., Wang, H., et al

Liu, T., Li, R., Qi, Y., Liu, H., Tang, X., Zheng, T., Yin, Q., Cheng, M. X., Huan, J., Wang, H., et al. Unlocking efficient, scalable, and continual knowledge editing with basis-level representation fine-tuning. arXiv preprint arXiv:2503.00306, 2025 a

work page arXiv 2025
[59]

W., and Lee, W

Liu, W., Xu, H., Liu, B., Deng, Z., Wang, H., Wang, J., Li, R., Teh, Y. W., and Lee, W. S. Is model editing built on sand? revealing its illusory success and fragile foundation, 2025 b . URL https://arxiv.org/abs/2510.00625

work page arXiv 2025
[60]

An Empirical Study of Catastrophic Forgetting in Large Language Models During Continual Fine-tuning

Luo, Y., Yang, Z., Meng, F., Li, Y., Zhou, J., and Zhang, Y. An empirical study of catastrophic forgetting in large language models during continual fine-tuning. arXiv preprint arXiv:2308.08747, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023
[61]

Interpreting key mechanisms of factual recall in transformer-based language models

Lv, A., Zhang, K., Chen, Y., Wang, Y., Liu, L., Wen, J.-R., Xie, J., and Yan, R. Interpreting key mechanisms of factual recall in transformer-based language models. arXiv preprint arXiv:2403.19521, 2024 a

work page arXiv 2024
[62]

Full parameter fine-tuning for large language models with limited resources

Lv, K., Yang, Y., Liu, T., Guo, Q., and Qiu, X. Full parameter fine-tuning for large language models with limited resources. In Ku, L.-W., Martins, A., and Srikumar, V. (eds.), Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.\ 8187--8198, Bangkok, Thailand, August 2024 b . Association for...

work page doi:10.18653/v1/2024.acl-long.445 2024
[63]

K-edit: Language model editing with contextual knowledge awareness

Markowitz, E., Ramakrishna, A., Mehrabi, N., Peris, C., Gupta, R., Chang, K.-W., and Galstyan, A. K-edit: Language model editing with contextual knowledge awareness. arXiv preprint arXiv:2502.10626, 2025

work page arXiv 2025
[64]

arXiv preprint arXiv:2309.13638 (2023)

McCoy, R. T., Yao, S., Friedman, D., Hardy, M., and Griffiths, T. L. Embers of autoregression: Understanding large language models through the problem they are trained to solve. arXiv preprint arXiv:2309.13638, 2023

work page arXiv 2023
[65]

S., Andonian, A

Meng, K., Sharma, A. S., Andonian, A. J., Belinkov, Y., and Bau, D. Mass-editing memory in a transformer. In The Eleventh International Conference on Learning Representations
[66]

Locating and editing factual associations in gpt

Meng, K., Bau, D., Andonian, A., and Belinkov, Y. Locating and editing factual associations in gpt. Advances in Neural Information Processing Systems, 35: 0 17359--17372, 2022 a

2022
[67]

Locating and editing factual associations in gpt

Meng, K., Bau, D., Andonian, A., and Belinkov, Y. Locating and editing factual associations in gpt. In Proceedings of the 36th International Conference on Neural Information Processing Systems, pp.\ 17359--17372, 2022 b

2022
[68]

Llama 3 model card

Meta AI . Llama 3 model card. https://github.com/meta-llama/llama3/blob/main/MODEL_CARD.md, 2024. Accessed: 2025-04-21

2024
[69]

Mitchell, E., Lin, C., Bosselut, A., Finn, C., and Manning, C. D. Fast model editing at scale. arXiv preprint arXiv:2110.11309, 2021

work page arXiv 2021
[70]

D., and Finn, C

Mitchell, E., Lin, C., Bosselut, A., Manning, C. D., and Finn, C. Memory-based model editing at scale. In International Conference on Machine Learning, pp.\ 15817--15831. PMLR, 2022

2022
[71]

M., Alghisi, S., and Riccardi, G

Mousavi, S. M., Alghisi, S., and Riccardi, G. Is your llm outdated? benchmarking llms & alignment algorithms for time-sensitive knowledge. arXiv preprint arXiv:2404.08700, 2024

work page arXiv 2024
[72]

Bmike-53: Investigating cross-lingual knowledge editing with in-context learning

Nie, E., Shao, B., Wang, M., Ding, Z., Schmid, H., and Sch \"u tze, H. Bmike-53: Investigating cross-lingual knowledge editing with in-context learning. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.\ 16357--16374, 2025

2025
[73]

Nishi, K., Ramesh, R., Okawa, M., Khona, M., Tanaka, H., and Lubana, E. S. Representation shattering in transformers: A synthetic study with knowledge editing. In Singh, A., Fazel, M., Hsu, D., Lacoste-Julien, S., Berkenkamp, F., Maharaj, T., Wagstaff, K., and Zhu, J. (eds.), Proceedings of the 42nd International Conference on Machine Learning, volume 267...

2025
[74]

GPT-4 Technical Report

OpenAI. Gpt-4 technical report. arXiv preprint arXiv:2303.08774, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023
[75]

Training language models to follow instructions with human feedback

Ouyang, L., Wu, J., Jiang, X., Almeida, D., Wainwright, C., Mishkin, P., Zhang, C., Agarwal, S., Slama, K., Ray, A., et al. Training language models to follow instructions with human feedback. Advances in neural information processing systems, 35: 0 27730--27744, 2022

2022
[76]

Context-robust knowledge editing for language models

Park, H., Choi, G., Kim, M., and Jo, Y. Context-robust knowledge editing for language models. In Findings of the Association for Computational Linguistics: ACL 2025, pp.\ 10360--10385, 2025

2025
[77]

Event-level knowledge editing

Peng, H., Wang, X., Li, C., Zeng, K., Duo, J., Cao, Y., Hou, L., and Li, J. Event-level knowledge editing. arXiv preprint arXiv:2402.13093, 2024

work page arXiv 2024
[78]

Petroni, F., Rockt \"a schel, T., Riedel, S., Lewis, P., Bakhtin, A., Wu, Y., and Miller, A. Language models as knowledge bases? In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp.\ 2463--2473, 2019

2019
[79]

and Elhadad, M

Pinter, Y. and Elhadad, M. Emptying the ocean with a spoon: Should we edit models? In Findings of the Association for Computational Linguistics: EMNLP 2023, pp.\ 15164--15172, 2023

2023
[80]

S., Zhu, C., Li, D., Yu, F., Zaheer, M., Kumar, S., and Bhojanapalli, S

Rawat, A. S., Zhu, C., Li, D., Yu, F., Zaheer, M., Kumar, S., and Bhojanapalli, S. Modifying memories in transformer models. In International Conference on Machine Learning (ICML), volume 2020, 2021

2020

Showing first 80 references.

[1] [1]

A review on language models as knowledge bases

AlKhamissi, B., Li, M., Celikyilmaz, A., Diab, M., and Ghazvininejad, M. A review on language models as knowledge bases. arXiv preprint arXiv:2204.06031, 2022

work page arXiv 2022

[2] [2]

and Li, Y

Allen-Zhu, Z. and Li, Y. Physics of language models: Part 3.3, knowledge capacity scaling laws. arXiv preprint arXiv:2404.05405, 2024

work page arXiv 2024

[3] [3]

Self-rag: Learning to retrieve, generate, and critique through self-reflection

Asai, A., Wu, Z., Wang, Y., Sil, A., and Hajishirzi, H. Self-rag: Learning to retrieve, generate, and critique through self-reflection. In The Twelfth International Conference on Learning Representations

[4] [4]

D., Dhariwal, P., Neelakantan, A., Shyam, P., Sastry, G., Askell, A., et al

Brown, T., Mann, B., Ryder, N., Subbiah, M., Kaplan, J. D., Dhariwal, P., Neelakantan, A., Shyam, P., Sastry, G., Askell, A., et al. Language models are few-shot learners. Advances in neural information processing systems, 33: 0 1877--1901, 2020

1901

[5] [5]

Lifelong knowledge editing for llms with retrieval-augmented continuous prompt learning

Chen, Q., Zhang, T., He, X., Li, D., Wang, C., Huang, L., et al. Lifelong knowledge editing for llms with retrieval-augmented continuous prompt learning. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pp.\ 13565--13580, 2024

2024

[6] [6]

Uniedit: A unified knowledge editing benchmark for large language models

Chen, Q., Wang, D., Zhang, T., Yan, Z., You, C., Wang, C., and He, X. Uniedit: A unified knowledge editing benchmark for large language models. In The Thirty-ninth Annual Conference on Neural Information Processing Systems Datasets and Benchmarks Track, 2025. URL https://openreview.net/forum?id=eESbQ5lWiP

2025

[7] [7]

Towards understanding the mixture-of-experts layer in deep learning

Chen, Z., Deng, Y., Wu, Y., Gu, Q., and Li, Y. Towards understanding the mixture-of-experts layer in deep learning. In Proceedings of the 36th International Conference on Neural Information Processing Systems, pp.\ 23049--23062, 2022

2022

[8] [8]

Think you have Solved Question Answering? Try ARC, the AI2 Reasoning Challenge

Clark, P., Cowhey, I., Etzioni, O., Khot, T., Sabharwal, A., Schoenick, C., and Tafjord, O. Think you have solved question answering? try arc, the ai2 reasoning challenge. arXiv:1803.05457v1, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018

[9] [9]

Evaluating the ripple effects of knowledge editing in language models

Cohen, R., Biran, E., Yoran, O., Globerson, A., and Geva, M. Evaluating the ripple effects of knowledge editing in language models. Transactions of the Association for Computational Linguistics, 12: 0 283--298, 2024

2024

[10] [10]

Knowledge neurons in pretrained transformers

Dai, D., Dong, L., Hao, Y., Sui, Z., Chang, B., and Wei, F. Knowledge neurons in pretrained transformers. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.\ 8493--8502, 2022

2022

[11] [11]

Eamet: Robust massive model editing via embedding alignment optimization

Dai, Y., Ji, Z., Li, Z., and Wang, S. Eamet: Robust massive model editing via embedding alignment optimization. arXiv preprint arXiv:2505.11876, 2025

work page arXiv 2025

[12] [12]

Larimar: Large language models with episodic memory control

Das, P., Chaudhury, S., Nelson, E., Melnyk, I., Swaminathan, S., Dai, S., Lozano, A., Kollias, G., Chenthamarakshan, V., Navratil, J., et al. Larimar: Large language models with episodic memory control. In Forty-first International Conference on Machine Learning

[13] [13]

Editing factual knowledge in language models

De Cao, N., Aziz, W., and Titov, I. Editing factual knowledge in language models. In EMNLP 2021-2021 Conference on Empirical Methods in Natural Language Processing, Proceedings, pp.\ 6491--6506, 2021

2021

[14] [14]

Parameter-efficient fine-tuning of large-scale pre-trained language models

Ding, N., Qin, Y., Yang, G., Wei, F., Yang, Z., Su, Y., Hu, S., Chen, Y., Chan, C.-M., Chen, W., et al. Parameter-efficient fine-tuning of large-scale pre-trained language models. Nature Machine Intelligence, 5 0 (3): 0 220--235, 2023

2023

[15] [15]

Strong model collapse

Dohmatob, E., Feng, Y., and Kempe, J. Strong model collapse. arXiv preprint arXiv:2410.04840, 2024

work page arXiv 2024

[16] [16]

Calibrating factual knowledge in pretrained language models

Dong, Q., Dai, D., Song, Y., Xu, J., Sui, Z., and Li, L. Calibrating factual knowledge in pretrained language models. In Findings of the Association for Computational Linguistics: EMNLP 2022, pp.\ 5937--5947, 2022

2022

[17] [17]

A survey on in-context learning

Dong, Q., Li, L., Dai, D., Zheng, C., Ma, J., Li, R., Xia, H., Xu, J., Wu, Z., Chang, B., et al. A survey on in-context learning. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pp.\ 1107--1128, 2024

2024

[18] [18]

Alphaedit: Null-space constrained model editing for language models

Fang, J., Jiang, H., Wang, K., Ma, Y., Shi, J., Wang, X., He, X., and Chua, T.-S. Alphaedit: Null-space constrained model editing for language models. In The Thirteenth International Conference on Learning Representations, 2025 a . URL https://openreview.net/forum?id=HvSytvg3Jh

2025

[19] [19]

Cknowedit: A new chinese knowledge editing dataset for linguistics, facts, and logic error correction in llms

Fang, J., Lu, T., Yao, Y., Jiang, Z., Xu, X., Chen, H., and Zhang, N. Cknowedit: A new chinese knowledge editing dataset for linguistics, facts, and logic error correction in llms. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.\ 8789--8807, 2025 b

2025

[20] [20]

Dissecting recall of factual associations in auto-regressive language models

Geva, M., Bastings, J., Filippova, K., and Globerson, A. Dissecting recall of factual associations in auto-regressive language models. In The 2023 Conference on Empirical Methods in Natural Language Processing

2023

[21] [21]

Transformer feed-forward layers are key-value memories

Geva, M., Schuster, R., Berant, J., and Levy, O. Transformer feed-forward layers are key-value memories. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pp.\ 5484--5495, 2021

2021

[22] [22]

The Llama 3 Herd of Models

Grattafiori, A., Dubey, A., Jauhri, A., Pandey, A., Kadian, A., Al-Dahle, A., Letman, A., Mathur, A., Schelten, A., Vaughan, A., et al. The llama 3 herd of models. arXiv preprint arXiv:2407.21783, 2024

work page internal anchor Pith review Pith/arXiv arXiv 2024

[23] [23]

Model editing harms general abilities of large language models: Regularization to the rescue

Gu, J.-C., Xu, H.-X., Ma, J.-Y., Lu, P., Ling, Z.-H., Chang, K.-W., and Peng, N. Model editing harms general abilities of large language models: Regularization to the rescue. In Al-Onaizan, Y., Bansal, M., and Chen, Y.-N. (eds.), Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pp.\ 16801--16819, Miami, Florida, USA,...

work page doi:10.18653/v1/2024.emnlp-main.934 2024

[24] [24]

Model editing harms general abilities of large language models: Regularization to the rescue

Gu, J.-C., Xu, H.-X., Ma, J.-Y., Lu, P., Ling, Z.-H., Chang, K.-W., and Peng, N. Model editing harms general abilities of large language models: Regularization to the rescue. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pp.\ 16801--16819. Association for Computational Linguistics, 2024 b

2024

[25] [25]

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Guo, D., Yang, D., Zhang, H., Song, J., Zhang, R., Xu, R., Zhu, Q., Ma, S., Wang, P., Bi, X., et al. Deepseek-r1: Incentivizing reasoning capability in llms via reinforcement learning. arXiv preprint arXiv:2501.12948, 2025

work page internal anchor Pith review Pith/arXiv arXiv 2025

[26] [26]

Model editing at scale leads to gradual and catastrophic forgetting

Gupta, A., Rao, A., and Anumanchipalli, G. Model editing at scale leads to gradual and catastrophic forgetting. In Ku, L.-W., Martins, A., and Srikumar, V. (eds.), Findings of the Association for Computational Linguistics: ACL 2024, pp.\ 15202--15232, Bangkok, Thailand, August 2024 a . Association for Computational Linguistics. doi:10.18653/v1/2024.findin...

work page doi:10.18653/v1/2024.findings-acl.902 2024

[27] [27]

Model editing at scale leads to gradual and catastrophic forgetting

Gupta, A., Rao, A., and Anumanchipalli, G. Model editing at scale leads to gradual and catastrophic forgetting. In Findings of the Association for Computational Linguistics ACL 2024, pp.\ 15202--15232, 2024 b

2024

[28] [28]

flex tape can’t fix that

Halevy, K., Sotnikova, A., Alkhamissi, B., Montariol, S., and Bosselut, A. “flex tape can’t fix that”: Bias and misinformation in edited language models. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pp.\ 8690--8707, 2024

2024

[29] [29]

Aging with grace: Lifelong model editing with discrete key-value adaptors

Hartvigsen, T., Sankaranarayanan, S., Palangi, H., Kim, Y., and Ghassemi, M. Aging with grace: Lifelong model editing with discrete key-value adaptors. Advances in Neural Information Processing Systems, 36, 2024

2024

[30] [30]

Olympiadbench: A challenging benchmark for promoting agi with olympiad-level bilingual multimodal scientific problems

He, C., Luo, R., Bai, Y., Hu, S., Thai, Z., Shen, J., Hu, J., Han, X., Huang, Y., Zhang, Y., et al. Olympiadbench: A challenging benchmark for promoting agi with olympiad-level bilingual multimodal scientific problems. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.\ 3828--3850, 2024

2024

[31] [31]

Knowledge updating? no more model editing! just selective contextual reasoning

He, G., Song, X., and Sun, A. Knowledge updating? no more model editing! just selective contextual reasoning. arXiv preprint arXiv:2503.05212, 2025

work page arXiv 2025

[32] [32]

Measuring Mathematical Problem Solving With the MATH Dataset

Hendrycks, D., Burns, C., Kadavath, S., Arora, A., Basart, S., Tang, E., Song, D., and Steinhardt, J. Measuring mathematical problem solving with the math dataset. arXiv preprint arXiv:2103.03874, 2021

work page internal anchor Pith review Pith/arXiv arXiv 2021

[33] [33]

Z., and Andreas, J

Hernandez, E., Li, B. Z., and Andreas, J. Inspecting and editing knowledge representations in language models. arXiv preprint arXiv:2304.00740, 2023

work page arXiv 2023

[34] [34]

K.-M., Lin, T.-H., Liao, C.-W., Fang, H.-C., Huang, C.-W., and Chen, Y.-N

Hsueh, C.-H., Huang, P. K.-M., Lin, T.-H., Liao, C.-W., Fang, H.-C., Huang, C.-W., and Chen, Y.-N. Editing the mind of giants: An in-depth exploration of pitfalls of knowledge editing in large language models. arXiv preprint arXiv:2406.01436, 2024

work page arXiv 2024

[35] [35]

Wilke: Wise-layer knowledge editor for lifelong knowledge editing

Hu, C., Cao, P., Chen, Y., Liu, K., and Zhao, J. Wilke: Wise-layer knowledge editor for lifelong knowledge editing. In Findings of the Association for Computational Linguistics ACL 2024, pp.\ 3476--3503, 2024

2024

[36] [36]

J., Wallis, P., Allen-Zhu, Z., Li, Y., Wang, S., Wang, L., Chen, W., et al

Hu, E. J., Wallis, P., Allen-Zhu, Z., Li, Y., Wang, S., Wang, L., Chen, W., et al. Lora: Low-rank adaptation of large language models. In International Conference on Learning Representations

[37] [37]

Transformer-patcher: One mistake worth one neuron

Huang, Z., Shen, Y., Zhang, X., Zhou, J., Rong, W., and Xiong, Z. Transformer-patcher: One mistake worth one neuron. In The Eleventh International Conference on Learning Representations, 2023

2023

[38] [38]

GPT-4o System Card

Hurst, A., Lerer, A., Goucher, A. P., Perelman, A., Ramesh, A., Clark, A., Ostrow, A., Welihinda, A., Hayes, A., Radford, A., et al. Gpt-4o system card. arXiv preprint arXiv:2410.21276, 2024

work page internal anchor Pith review Pith/arXiv arXiv 2024

[39] [39]

Unsupervised dense information retrieval with contrastive learning

Izacard, G., Caron, M., Hosseini, L., Riedel, S., Bojanowski, P., Joulin, A., and Grave, E. Unsupervised dense information retrieval with contrastive learning. Transactions on Machine Learning Research

[40] [40]

J., Madotto, A., and Fung, P

Ji, Z., Lee, N., Frieske, R., Yu, T., Su, D., Xu, Y., Ishii, E., Bang, Y. J., Madotto, A., and Fung, P. Survey of hallucination in natural language generation. ACM Computing Surveys, 55 0 (12): 0 1--38, 2023

2023

[41] [41]

Mistral 7B

Jiang, A. Q., Sablayrolles, A., Mensch, A., Bamford, C., Chaplot, D. S., Casas, D. d. l., Bressand, F., Lengyel, G., Lample, G., Saulnier, L., et al. Mistral 7b. arXiv preprint arXiv:2310.06825, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023

[42] [42]

Anyedit: Edit any knowledge encoded in language models

Jiang, H., Fang, J., Zhang, N., Wan, M., Ma, G., Wang, X., He, X., and Chua, T.-S. Anyedit: Edit any knowledge encoded in language models. In International Conference on Machine Learning, pp.\ 27510--27533. PMLR, 2025

2025

[43] [43]

Learning to edit: Aligning llms with knowledge editing

Jiang, Y., Wang, Y., Wu, C., Zhong, W., Zeng, X., Gao, J., Li, L., Jiang, X., Shang, L., Tang, R., et al. Learning to edit: Aligning llms with knowledge editing. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.\ 4689--4705, 2024

2024

[44] [44]

F., Araki, J., and Neubig, G

Jiang, Z., Xu, F. F., Araki, J., and Neubig, G. How can we know what language models know? Transactions of the Association for Computational Linguistics, 8: 0 423--438, 2020

2020

[45] [45]

Evaluating open-domain question answering in the era of large language models

Kamalloo, E., Dziri, N., Clarke, C., and Rafiei, D. Evaluating open-domain question answering in the era of large language models. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.\ 5591--5606, 2023

2023

[46] [46]

Reasoning abilities of large language models: In-depth analysis on the abstraction and reasoning corpus

Lee, S., Sim, W., Shin, D., Seo, W., Park, J., Lee, S., Hwang, S., Kim, S., and Kim, S. Reasoning abilities of large language models: In-depth analysis on the abstraction and reasoning corpus. ACM Transactions on Intelligent Systems and Technology

[47] [47]

Zero-shot relation extraction via reading comprehension

Levy, O., Seo, M., Choi, E., and Zettlemoyer, L. Zero-shot relation extraction via reading comprehension. In 21st Conference on Computational Natural Language Learning, CoNLL 2017, pp.\ 333--342. Association for Computational Linguistics (ACL), 2017

2017

[48] [48]

u ttler, H., Lewis, M., Yih, W.-t., Rockt \

Lewis, P., Perez, E., Piktus, A., Petroni, F., Karpukhin, V., Goyal, N., K \"u ttler, H., Lewis, M., Yih, W.-t., Rockt \"a schel, T., et al. Retrieval-augmented generation for knowledge-intensive nlp tasks. Advances in Neural Information Processing Systems, 33: 0 9459--9474, 2020

2020

[49] [49]

Solving quantitative reasoning problems with language models

Lewkowycz, A., Andreassen, A., Dohan, D., Dyer, E., Michalewski, H., Ramasesh, V., Slone, A., Anil, C., Schlag, I., Gutman-Solo, T., et al. Solving quantitative reasoning problems with language models. Advances in Neural Information Processing Systems, 35: 0 3843--3857, 2022

2022

[50] [50]

Q., Shen, Z., et al

Li, J., Beeching, E., Tunstall, L., Lipkin, B., Soletskyi, R., Huang, S., Rasul, K., Yu, L., Jiang, A. Q., Shen, Z., et al. Numinamath: The largest public dataset in ai4maths with 860k pairs of competition math problems and solutions. Hugging Face repository, 13: 0 9, 2024 a

2024

[51] [51]

and Chu, X

Li, Q. and Chu, X. Adaedit: Advancing continuous knowledge editing for large language models. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.\ 4127--4149, 2025

2025

[52] [52]

Consecutive model editing with batch alongside hook layers

Li, S., Deng, Y., Cai, D., Lu, H., Chen, L., and Lam, W. Consecutive model editing with batch alongside hook layers. arXiv preprint arXiv:2403.05330, 2024 b

work page arXiv 2024

[53] [53]

Pmet: Precise model editing in a transformer

Li, X., Li, S., Song, S., Yang, J., Ma, J., and Yu, J. Pmet: Precise model editing in a transformer. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 38, pp.\ 18564--18572, 2024 c

2024

[54] [54]

E., Papailiopoulos, D., and Oymak, S

Li, Y., Ildiz, M. E., Papailiopoulos, D., and Oymak, S. Transformers as algorithms: Generalization and stability in in-context learning. In International Conference on Machine Learning, pp.\ 19565--19594. PMLR, 2023

2023

[55] [55]

Let's verify step by step

Lightman, H., Kosaraju, V., Burda, Y., Edwards, H., Baker, B., Lee, T., Leike, J., Schulman, J., Sutskever, I., and Cobbe, K. Let's verify step by step. In International Conference on Learning Representations, volume 2024, pp.\ 39578--39601, 2024

2024

[56] [56]

Edit less, achieve more: Dynamic sparse neuron masking for lifelong knowledge editing in llms

Liu, J., Sun, J., Shen, S., Yang, C., and Wang, S. Edit less, achieve more: Dynamic sparse neuron masking for lifelong knowledge editing in llms. In The Thirty-ninth Annual Conference on Neural Information Processing Systems

[57] [57]

Pre-train, prompt, and predict: A systematic survey of prompting methods in natural language processing

Liu, P., Yuan, W., Fu, J., Jiang, Z., Hayashi, H., and Neubig, G. Pre-train, prompt, and predict: A systematic survey of prompting methods in natural language processing. ACM Computing Surveys, 55 0 (9): 0 1--35, 2023

2023

[58] [58]

X., Huan, J., Wang, H., et al

Liu, T., Li, R., Qi, Y., Liu, H., Tang, X., Zheng, T., Yin, Q., Cheng, M. X., Huan, J., Wang, H., et al. Unlocking efficient, scalable, and continual knowledge editing with basis-level representation fine-tuning. arXiv preprint arXiv:2503.00306, 2025 a

work page arXiv 2025

[59] [59]

W., and Lee, W

Liu, W., Xu, H., Liu, B., Deng, Z., Wang, H., Wang, J., Li, R., Teh, Y. W., and Lee, W. S. Is model editing built on sand? revealing its illusory success and fragile foundation, 2025 b . URL https://arxiv.org/abs/2510.00625

work page arXiv 2025

[60] [60]

An Empirical Study of Catastrophic Forgetting in Large Language Models During Continual Fine-tuning

Luo, Y., Yang, Z., Meng, F., Li, Y., Zhou, J., and Zhang, Y. An empirical study of catastrophic forgetting in large language models during continual fine-tuning. arXiv preprint arXiv:2308.08747, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023

[61] [61]

Interpreting key mechanisms of factual recall in transformer-based language models

Lv, A., Zhang, K., Chen, Y., Wang, Y., Liu, L., Wen, J.-R., Xie, J., and Yan, R. Interpreting key mechanisms of factual recall in transformer-based language models. arXiv preprint arXiv:2403.19521, 2024 a

work page arXiv 2024

[62] [62]

Full parameter fine-tuning for large language models with limited resources

Lv, K., Yang, Y., Liu, T., Guo, Q., and Qiu, X. Full parameter fine-tuning for large language models with limited resources. In Ku, L.-W., Martins, A., and Srikumar, V. (eds.), Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.\ 8187--8198, Bangkok, Thailand, August 2024 b . Association for...

work page doi:10.18653/v1/2024.acl-long.445 2024

[63] [63]

K-edit: Language model editing with contextual knowledge awareness

Markowitz, E., Ramakrishna, A., Mehrabi, N., Peris, C., Gupta, R., Chang, K.-W., and Galstyan, A. K-edit: Language model editing with contextual knowledge awareness. arXiv preprint arXiv:2502.10626, 2025

work page arXiv 2025

[64] [64]

arXiv preprint arXiv:2309.13638 (2023)

McCoy, R. T., Yao, S., Friedman, D., Hardy, M., and Griffiths, T. L. Embers of autoregression: Understanding large language models through the problem they are trained to solve. arXiv preprint arXiv:2309.13638, 2023

work page arXiv 2023

[65] [65]

S., Andonian, A

Meng, K., Sharma, A. S., Andonian, A. J., Belinkov, Y., and Bau, D. Mass-editing memory in a transformer. In The Eleventh International Conference on Learning Representations

[66] [66]

Locating and editing factual associations in gpt

Meng, K., Bau, D., Andonian, A., and Belinkov, Y. Locating and editing factual associations in gpt. Advances in Neural Information Processing Systems, 35: 0 17359--17372, 2022 a

2022

[67] [67]

Locating and editing factual associations in gpt

Meng, K., Bau, D., Andonian, A., and Belinkov, Y. Locating and editing factual associations in gpt. In Proceedings of the 36th International Conference on Neural Information Processing Systems, pp.\ 17359--17372, 2022 b

2022

[68] [68]

Llama 3 model card

Meta AI . Llama 3 model card. https://github.com/meta-llama/llama3/blob/main/MODEL_CARD.md, 2024. Accessed: 2025-04-21

2024

[69] [69]

Mitchell, E., Lin, C., Bosselut, A., Finn, C., and Manning, C. D. Fast model editing at scale. arXiv preprint arXiv:2110.11309, 2021

work page arXiv 2021

[70] [70]

D., and Finn, C

Mitchell, E., Lin, C., Bosselut, A., Manning, C. D., and Finn, C. Memory-based model editing at scale. In International Conference on Machine Learning, pp.\ 15817--15831. PMLR, 2022

2022

[71] [71]

M., Alghisi, S., and Riccardi, G

Mousavi, S. M., Alghisi, S., and Riccardi, G. Is your llm outdated? benchmarking llms & alignment algorithms for time-sensitive knowledge. arXiv preprint arXiv:2404.08700, 2024

work page arXiv 2024

[72] [72]

Bmike-53: Investigating cross-lingual knowledge editing with in-context learning

Nie, E., Shao, B., Wang, M., Ding, Z., Schmid, H., and Sch \"u tze, H. Bmike-53: Investigating cross-lingual knowledge editing with in-context learning. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp.\ 16357--16374, 2025

2025

[73] [73]

Nishi, K., Ramesh, R., Okawa, M., Khona, M., Tanaka, H., and Lubana, E. S. Representation shattering in transformers: A synthetic study with knowledge editing. In Singh, A., Fazel, M., Hsu, D., Lacoste-Julien, S., Berkenkamp, F., Maharaj, T., Wagstaff, K., and Zhu, J. (eds.), Proceedings of the 42nd International Conference on Machine Learning, volume 267...

2025

[74] [74]

GPT-4 Technical Report

OpenAI. Gpt-4 technical report. arXiv preprint arXiv:2303.08774, 2023

work page internal anchor Pith review Pith/arXiv arXiv 2023

[75] [75]

Training language models to follow instructions with human feedback

Ouyang, L., Wu, J., Jiang, X., Almeida, D., Wainwright, C., Mishkin, P., Zhang, C., Agarwal, S., Slama, K., Ray, A., et al. Training language models to follow instructions with human feedback. Advances in neural information processing systems, 35: 0 27730--27744, 2022

2022

[76] [76]

Context-robust knowledge editing for language models

Park, H., Choi, G., Kim, M., and Jo, Y. Context-robust knowledge editing for language models. In Findings of the Association for Computational Linguistics: ACL 2025, pp.\ 10360--10385, 2025

2025

[77] [77]

Event-level knowledge editing

Peng, H., Wang, X., Li, C., Zeng, K., Duo, J., Cao, Y., Hou, L., and Li, J. Event-level knowledge editing. arXiv preprint arXiv:2402.13093, 2024

work page arXiv 2024

[78] [78]

Petroni, F., Rockt \"a schel, T., Riedel, S., Lewis, P., Bakhtin, A., Wu, Y., and Miller, A. Language models as knowledge bases? In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pp.\ 2463--2473, 2019

2019

[79] [79]

and Elhadad, M

Pinter, Y. and Elhadad, M. Emptying the ocean with a spoon: Should we edit models? In Findings of the Association for Computational Linguistics: EMNLP 2023, pp.\ 15164--15172, 2023

2023

[80] [80]

S., Zhu, C., Li, D., Yu, F., Zaheer, M., Kumar, S., and Bhojanapalli, S

Rawat, A. S., Zhu, C., Li, D., Yu, F., Zaheer, M., Kumar, S., and Bhojanapalli, S. Modifying memories in transformer models. In International Conference on Machine Learning (ICML), volume 2020, 2021

2020