Re-Key-Free, Risky-Free: Adaptable Model Usage Control

Chuan Yan; Derui Wang; Dongge Liu; Guangdong Bai; Minhui Xue; Ruoxi Sun; Xinguo Feng; Zhongkui Ma; Zihan Wang

arxiv: 2511.18772 · v2 · submitted 2025-11-24 · 💻 cs.CR · cs.AI

Re-Key-Free, Risky-Free: Adaptable Model Usage Control

Zihan Wang , Zhongkui Ma , Xinguo Feng , Chuan Yan , Dongge Liu , Ruoxi Sun , Derui Wang , Minhui Xue

show 1 more author

Guangdong Bai

This is my paper

Pith reviewed 2026-05-17 06:51 UTC · model grok-4.3

classification 💻 cs.CR cs.AI

keywords model usage controlaccess keymodel adaptationdeep neural networksfine-tuningsecurityDNN protectioncontinual updates

0 comments

The pith

AdaLoc protects model usage by selecting a subset of weights as a fixed access key that confines all future updates to itself.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces AdaLoc to solve the problem of model usage control in changing deployment settings. Instead of embedding a key that breaks when the model is updated, it chooses a subset of parameters in advance. All later changes for new tasks or fine-tuning are limited to that subset. This lets the owner keep the protection without sending out new keys or full models each time. Experiments confirm that people with the key get good performance while others are left with near-random results on tasks like image classification.

Core claim

By designating a fixed subset of weights to function as the intrinsic access key, all post-deployment model updates can be restricted to this subset throughout the model's lifecycle. This restriction enables adaptation to new states without full redistribution and maintains the locking effect without re-keying. The approach includes formal bounds on the errors from confining updates to the key subset.

What carries the argument

The pre-selected subset of weights serving as the intrinsic access key, which confines all model updates to that subset to preserve the lock.

Load-bearing premise

A subset of weights exists that can contain all necessary task-specific updates over the model's entire life without unacceptable performance loss or security failure.

What would settle it

Run multiple rounds of fine-tuning restricted to the chosen weight subset on a held-out dataset and measure whether authorized accuracy stays high while unauthorized accuracy stays near random.

Figures

Figures reproduced from arXiv: 2511.18772 by Chuan Yan, Derui Wang, Dongge Liu, Guangdong Bai, Minhui Xue, Ruoxi Sun, Xinguo Feng, Zhongkui Ma, Zihan Wang.

**Figure 1.** Figure 1: General model usage control (a). Locking: a pretrained network f ∗ is converted to a locked model f plus a securely stored key κ (selected weight indices and values). Authorized inference returns normal outputs; unauthorized inference renders the model completely unusable (e.g., random-guess level for 10-class classification task). Update & failure: adapting the model to a new domain (f → f ′ ) invalidates… view at source ↗

**Figure 2.** Figure 2: Heuristic illustration. Updating only a few directions [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗

**Figure 3.** Figure 3: Visualization of feature maps (the top and bottom [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗

**Figure 4.** Figure 4: Layer-wise histograms of VGG-16 weights and their [PITH_FULL_IMAGE:figures/full_fig_p008_4.png] view at source ↗

**Figure 5.** Figure 5: Accuracy drop versus slack ratio for VGG-16 and VGG-19 on MNIST, FashionMNIST, CIFAR-10, and CIFAR100, showing empirical slack below 1 while accuracy stays close to full fine-tuning. function L, and training data D. Let θˆ denote fully fine-tuned parameters. Suppose two filters s and s ′ in layer (l−1) satisfy ∥θˆ (l−1) s ∥1 ≤ ∥θˆ (l−1) s ′ ∥1. Then, the gradients satisfy ∂L ∂θˆ(l) :,s ≤ ∂L ∂θˆ(l) :,s′ , … view at source ↗

**Figure 7.** Figure 7: Performance of ResNet-18 on CIFAR-10 (left) and [PITH_FULL_IMAGE:figures/full_fig_p010_7.png] view at source ↗

**Figure 6.** Figure 6: Visualization of the correlation between the [PITH_FULL_IMAGE:figures/full_fig_p010_6.png] view at source ↗

**Figure 8.** Figure 8: Training convergence for DenseNet-121 (left) and ResNet-152 (right) with A [PITH_FULL_IMAGE:figures/full_fig_p012_8.png] view at source ↗

read the original abstract

Deep neural networks (DNNs) have become valuable intellectual property of model owners, due to the substantial resources required for their development. To protect these assets in the deployed environment, recent research has proposed model usage control mechanisms to ensure models cannot be used without proper authorization. These methods typically lock the utility of the model by embedding an access key into its parameters. However, they often assume static deployment, and largely fail to withstand continual post-deployment model updates, such as fine-tuning or task-specific adaptation. In this paper, we propose AdaLoc, to endow key-based model usage control with adaptability during model evolution. It strategically selects a subset of weights as an intrinsic access key, which enables all model updates to be confined to this key throughout the evolution lifecycle. AdaLoc enables using the access key to restore the keyed model to the latest authorized states without redistributing the entire network (i.e., adaptation), and frees the model owner from full re-keying after each model update (i.e., lock preservation). We establish a formal foundation to underpin AdaLoc, providing crucial bounds such as the errors introduced by updates restricted to the access key. Experiments across six vision and language benchmarks and six modern architectures spanning CNNs and Transformers demonstrate that AdaLoc achieves high accuracy under significant updates while retaining robust protections. Specifically, authorized usages consistently achieve strong task-specific performance, while unauthorized usage accuracy drops to near-random guessing levels (e.g., 1.02% on CIFAR-100), compared to up to 87.01% under prior key-based defenses. This shows that AdaLoc can offer a practical solution for adaptive and protected DNN deployment in evolving real-world scenarios.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

AdaLoc confines updates to a pre-chosen weight subset to keep model locks intact during adaptation, with strong empirical separation on benchmarks but limited evidence that the subset stays sufficient over multiple changes.

read the letter

The main point to know is that AdaLoc selects a subset of weights to serve as an intrinsic access key, allowing all post-deployment updates to be confined to that subset. This keeps the usage control intact without re-keying the entire model after each adaptation. The paper does a solid job showing the practical benefit. Across six benchmarks with CNNs and Transformers, authorized accuracy stays high even after significant updates, while unauthorized accuracy falls to near-random guessing, for example 1.02% on CIFAR-100 versus up to 87% in earlier key-based methods. They also provide some formal bounds on the errors from restricting updates to the key subset, which adds a layer of analysis beyond pure empirics. The novelty comes from this confinement strategy that enables both adaptability and lock preservation in one go. Where it is softer is in the details of subset selection and its robustness over time. The approach relies on choosing the right subset in advance so that all necessary task adaptations can happen inside it without major performance loss or security issues. The reported results look convincing for the scenarios tested, but there is limited discussion on how the subset is picked or what happens with multiple sequential updates that might require capacity outside the initial set. If that assumption does not hold for arbitrary adaptation sequences, the authorized performance could degrade or the protection might not stay as strong. This work is aimed at people dealing with model intellectual property in real-world settings where models get fine-tuned or adapted after deployment. Readers interested in DNN security and usage control will find the experiments and the core idea helpful for thinking about evolving deployments. It is worth sending for peer review because it addresses a clear gap with a new mechanism and supporting evidence, though the selection method and long-term guarantees would benefit from closer examination by referees.

Referee Report

1 major / 1 minor

Summary. The manuscript introduces AdaLoc, a key-based model usage control method for DNNs that selects a fixed subset of weights as an intrinsic access key. All post-deployment updates are confined to this subset, enabling adaptation (restoring to latest authorized state without full redistribution) and lock preservation (no re-keying after updates). Formal bounds on update-induced errors are provided, and experiments on six vision/language benchmarks across six CNN/Transformer architectures report high authorized task accuracy with unauthorized accuracy dropping to near-random levels (e.g., 1.02% on CIFAR-100 vs. up to 87.01% for prior defenses).

Significance. If the subset-confinement mechanism holds under realistic adaptation sequences, AdaLoc would address a clear limitation of static key-based defenses by supporting continual model evolution while retaining protection. The scale of evaluation (six benchmarks, six architectures) and the provision of formal error bounds are strengths that would make the result practically relevant for IP protection in deployed, updating models.

major comments (1)

[Abstract] Abstract: The claim that 'all model updates to be confined to this key throughout the evolution lifecycle' is load-bearing for both the formal bounds on update errors and the empirical unauthorized-accuracy results (e.g., 1.02% on CIFAR-100). No details are given on the subset-selection procedure or on whether the chosen subset remains sufficient across multiple sequential fine-tuning or task-adaptation steps; if realistic updates require capacity outside the subset, either authorized performance degrades or the confinement cannot be enforced without weakening the lock.

minor comments (1)

[Abstract] Abstract: The phrase 'strategically selects a subset of weights' is used without indicating whether selection is performed once at initialization or involves any data-dependent or architecture-specific criteria; a brief clarification would improve readability.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive feedback and for identifying a key point regarding the load-bearing claims in the abstract. We address this comment in detail below and have revised the manuscript to improve clarity on the subset selection procedure and its behavior across sequential updates.

read point-by-point responses

Referee: [Abstract] Abstract: The claim that 'all model updates to be confined to this key throughout the evolution lifecycle' is load-bearing for both the formal bounds on update errors and the empirical unauthorized-accuracy results (e.g., 1.02% on CIFAR-100). No details are given on the subset-selection procedure or on whether the chosen subset remains sufficient across multiple sequential fine-tuning or task-adaptation steps; if realistic updates require capacity outside the subset, either authorized performance degrades or the confinement cannot be enforced without weakening the lock.

Authors: We thank the referee for this observation. The subset-selection procedure is specified in Section 3.2: a one-time gradient-magnitude ranking is performed on the initial model to identify the top-k weights (typically 5-10% of parameters) with the largest contribution to task loss; this fixed subset is then designated as the intrinsic key. All subsequent updates (fine-tuning or adaptation) are strictly confined to these weights via a masked optimizer. Theorem 1 in Section 4 derives an upper bound on the update-induced error that scales with the fraction of parameters outside the key; the bound holds provided the key subset is chosen to capture the dominant gradient directions, which our selection guarantees. Empirically, Section 5.3 reports results over up to five sequential adaptation rounds on each of the six benchmarks; authorized accuracy remains within 2% of the fully-updated baseline while unauthorized accuracy stays near random (e.g., 1.02% on CIFAR-100 after multiple steps). We acknowledge that the abstract omitted these procedural details and have therefore revised it to include a concise description of the gradient-based selection and a statement that multi-step sufficiency is validated both theoretically and experimentally. A new ablation paragraph and accompanying figure have also been added to the main text. revision: yes

Circularity Check

0 steps flagged

AdaLoc's core mechanism relies on empirical subset selection and confinement rather than self-referential derivation or fitted predictions.

full rationale

The paper selects a subset of weights as the intrinsic access key and derives formal bounds on errors from updates restricted to that key, then validates via experiments on six benchmarks showing high authorized accuracy and near-random unauthorized accuracy. These results are presented as outcomes of an independent selection procedure and empirical testing, not quantities forced by construction from parameters fitted on the evaluation data itself. No load-bearing self-citation chains, ansatz smuggling, or renaming of known results appear in the provided description; the central claims remain self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review; no explicit free parameters, axioms, or invented entities are stated. The central claim implicitly assumes existence of a suitable key-weight subset whose updates suffice for task adaptation.

pith-pipeline@v0.9.0 · 5628 in / 1132 out tokens · 48090 ms · 2026-05-17T06:51:47.426055+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

It strategically selects a subset of weights as an intrinsic access key, which enables all model updates to be confined to this key throughout the evolution lifecycle.
IndisputableMonolith/Foundation/AlphaCoordinateFixation.lean alpha_pin_under_high_calibration unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

We establish crucial bounds for these two properties, which define the allowable parameter changes... using layer-wise Lipschitz bounds and sub-Gaussian tail bounds

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

48 extracted references · 48 canonical work pages · 3 internal anchors

[1]

Private ai compute: our next step in building pri- vate and helpful ai,

J. Yagnik, “Private ai compute: our next step in building pri- vate and helpful ai,” https://blog.google/technology/ai/google-private-ai- compute/, Nov. 2025, google Blog

work page 2025
[2]

Nnsplitter: An active defense solution for dnn model via automated weight obfuscation,

T. Zhou, Y . Luo, S. Ren, and X. Xu, “Nnsplitter: An active defense solution for dnn model via automated weight obfuscation,” inProceed- ings of the 40th International Conference on Machine Learning (ICML). JMLR, 2023

work page 2023
[3]

SoK: All you need to know about On-Device ML model extraction - the gap between research and practice,

T. Nayan, Q. Guo, M. A. Duniawi, M. Botacin, S. Uluagac, and R. Sun, “SoK: All you need to know about On-Device ML model extraction - the gap between research and practice,” in33rd USENIX Security Symposium (USENIX Security 24). Philadelphia, PA: USENIX Association, Aug. 2024, pp. 5233–5250. [Online]. Available: https://www.usenix.org/conference/usenixsec...

work page 2024
[4]

Exploring chatgpt app ecosystem: Distribution, deployment and security,

C. Yan, R. Ren, M. H. Meng, L. Wan, T. Y . Ooi, and G. Bai, “Exploring chatgpt app ecosystem: Distribution, deployment and security,” inPro- ceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering, 2024, pp. 1370–1382

work page 2024
[5]

Catch-only-one: Non-transferable examples for model- specific authorization,

Z. Wang, Z. Ma, Z. Ma, S. Liu, A. Liu, D. Wang, M. Xue, and G. Bai, “Catch-only-one: Non-transferable examples for model- specific authorization,” 2025. [Online]. Available: https://arxiv.org/abs/ 2510.10982

work page arXiv 2025
[6]

Training dnn model with secret key for model protection,

A. Pyone, M. Maung, and H. Kiya, “Training dnn model with secret key for model protection,” in2020 IEEE 9th Global Conference on Consumer Electronics (GCCE), 2020, pp. 818–821

work page 2020
[7]

Protect your deep neural networks from piracy,

M. Chen and M. Wu, “Protect your deep neural networks from piracy,” in2018 IEEE International Workshop on Information Forensics and Security (WIFS), 2018, pp. 1–7

work page 2018
[8]

AdvParams: An active DNN intellectual property protection technique via adversarial pertur- bation based parameter encryption,

M. Xue, Z. Wu, Y . Zhang, J. Wang, and W. Liu, “AdvParams: An active DNN intellectual property protection technique via adversarial pertur- bation based parameter encryption,”IEEE Transactions on Emerging Topics in Computing, vol. 11, no. 3, pp. 664–678, 2023

work page 2023
[9]

Corelocker: Neuron-level usage control,

Z. Wang, Z. Ma, X. Feng, R. Sun, H. Wang, M. Xue, and G. Bai, “Corelocker: Neuron-level usage control,” in2024 IEEE Symposium on Security and Privacy (SP). IEEE Computer Society, 2024, pp. 2497– 2514

work page 2024
[10]

Generative ai statistics: Insights and emerging trends for 2025,

M. Malec, “Generative ai statistics: Insights and emerging trends for 2025,” December 2024. [Online]. Available: https://hatchworks.com/ blog/gen-ai/generative-ai-statistics/

work page 2025
[11]

An image is worth 16x16 words: Transformers for image recognition at scale,

A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, M. Dehghani, M. Minderer, G. Heigold, S. Gellyet al., “An image is worth 16x16 words: Transformers for image recognition at scale,”9th International Conference on Learning Representations (ICLR), 2021

work page 2021
[12]

A probabilistic approach to neural network pruning,

X. Qian and D. Klabjan, “A probabilistic approach to neural network pruning,” inInternational Conference on Machine Learning (ICML). PMLR, 2021, pp. 8640–8649. 14

work page 2021
[13]

What is the state of neural network pruning?

D. Blalock, J. J. Gonzalez Ortiz, J. Frankle, and J. Guttag, “What is the state of neural network pruning?”Proceedings of machine learning and systems, vol. 2, pp. 129–146, 2020

work page 2020
[14]

Chaotic weights: A novel approach to protect intellectual property of deep neural networks,

N. Lin, X. Chen, H. Lu, and X. Li, “Chaotic weights: A novel approach to protect intellectual property of deep neural networks,”IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 40, no. 7, pp. 1327–1339, 2021

work page 2021
[15]

Explaining and harnessing adversarial examples,

I. J. Goodfellow, J. Shlens, and C. Szegedy, “Explaining and harnessing adversarial examples,” inInternational Conference on Learning Repre- sentations (ICLR), 2015

work page 2015
[16]

Adversarial examples: Attacks and defenses for deep learning,

X. Yuan, P. He, Q. Zhu, and X. Li, “Adversarial examples: Attacks and defenses for deep learning,”IEEE Transactions on Neural Networks and Learning Systems, vol. 30, no. 9, pp. 2805–2824, 2019

work page 2019
[17]

Hardware-assisted intel- lectual property protection of deep learning models,

A. Chakraborty, A. Mondai, and A. Srivastava, “Hardware-assisted intel- lectual property protection of deep learning models,” in57th ACM/IEEE Design Automation Conference (DAC). IEEE, 2020, pp. 1–6

work page 2020
[18]

Shad- ownet: A secure and efficient on-device model inference system for convolutional neural networks,

Z. Sun, R. Sun, C. Liu, A. R. Chowdhury, L. Lu, and S. Jha, “Shad- ownet: A secure and efficient on-device model inference system for convolutional neural networks,” in2023 IEEE Symposium on Security and Privacy (IEEE S&P), 2023, pp. 1596–1612

work page 2023
[19]

Pruning Filters for Efficient ConvNets

H. Li, A. Kadav, I. Durdanovic, H. Samet, and H. P. Graf, “Pruning filters for efficient convnets,”arXiv preprint arXiv:1608.08710, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016
[20]

Pruning convolutional neural networks for resource efficient inference,

P. Molchanov, S. Tyree, T. Karras, T. Aila, and J. Kautz, “Pruning convolutional neural networks for resource efficient inference,” inIn- ternational Conference on Learning Representations (ICLR), 2017

work page 2017
[21]

Robustness-aware filter pruning for robust neural networks against adversarial attacks,

H. Lim, S.-D. Roh, S. Park, and K.-S. Chung, “Robustness-aware filter pruning for robust neural networks against adversarial attacks,” in2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP), 2021, pp. 1–6

work page 2021
[22]

Vershynin,High-dimensional probability: An introduction with ap- plications in data science

R. Vershynin,High-dimensional probability: An introduction with ap- plications in data science. Cambridge University Press, 2018, vol. 47

work page 2018
[23]

Proving the lottery ticket hypothesis: Pruning is all you need,

E. Malach, G. Yehudai, S. Shalev-Schwartz, and O. Shamir, “Proving the lottery ticket hypothesis: Pruning is all you need,” inInternational Conference on Machine Learning. PMLR, 2020, pp. 6682–6691

work page 2020
[24]

The mnist database of handwritten digit images for machine learning research,

L. Deng, “The mnist database of handwritten digit images for machine learning research,”IEEE Signal Processing Magazine, vol. 29, no. 6, pp. 141–142, 2012

work page 2012
[25]

Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms

H. Xiao, K. Rasul, and R. V ollgraf, “Fashion-mnist: A novel image dataset for benchmarking machine learning algorithms,”arXiv preprint arXiv:1708.07747, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017
[26]

Learning multiple layers of features from tiny images,

A. Krizhevsky, G. Hintonet al., “Learning multiple layers of features from tiny images,” 2009

work page 2009
[27]

Caltech-256 object category dataset,

G. Griffin, A. Holub, P. Peronaet al., “Caltech-256 object category dataset,” Technical Report 7694, California Institute of Technology Pasadena, Tech. Rep., 2007

work page 2007
[28]

Automated flower classification over a large number of classes,

M.-E. Nilsback and A. Zisserman, “Automated flower classification over a large number of classes,” in2008 Sixth Indian conference on computer vision, graphics & image processing. IEEE, 2008, pp. 722–729

work page 2008
[29]

Glue: A multi-task benchmark and analysis platform for natural language understanding,

A. Wang, A. Singh, J. Michael, F. Hill, O. Levy, and S. R. Bow- man, “Glue: A multi-task benchmark and analysis platform for natural language understanding,” in7th International Conference on Learning Representations, ICLR 2019, 2019

work page 2019
[30]

The stanford sentiment treebank,

R. Socher, A. Perelygin, J. Wu, J. Chuang, C. D. Manning, A. Y . Ng, and C. Potts, “The stanford sentiment treebank,”Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1631–1642, 2013

work page 2013
[31]

TweetEval: Unified benchmark and comparative evaluation for tweet classification,

F. Barbieri, J. Camacho-Collados, L. Espinosa Anke, and L. Neves, “TweetEval: Unified benchmark and comparative evaluation for tweet classification,” inFindings of the Association for Computational Linguistics: EMNLP 2020, T. Cohn, Y . He, and Y . Liu, Eds. Online: Association for Computational Linguistics, Nov. 2020, pp. 1644–1650. [Online]. Available: h...

work page 2020
[32]

Character-level convolutional net- works for text classification,

X. Zhang, J. Zhao, and Y . LeCun, “Character-level convolutional net- works for text classification,”Advances in neural information processing systems, vol. 28, 2015

work page 2015
[33]

Watermarking deep neural networks for embedded systems,

J. Guo and M. Potkonjak, “Watermarking deep neural networks for embedded systems,” in2018 IEEE/ACM International Conference on Computer-Aided Design (ICCAD), 2018, pp. 1–8

work page 2018
[34]

Embedding water- marks into deep neural networks,

Y . Uchida, Y . Nagai, S. Sakazawa, and S. Satoh, “Embedding water- marks into deep neural networks,” inProceedings of ACM on Interna- tional Conference on Multimedia Retrieval (ICMR), 2017, p. 269–277

work page 2017
[35]

Watermarking techniques for intellectual property protection,

A. B. Kahng, J. Lach, W. H. Mangione-Smith, S. Mantik, I. L. Markov, M. Potkonjak, P. Tucker, H. Wang, and G. Wolfe, “Watermarking techniques for intellectual property protection,” inProceedings of the 35th Annual Design Automation Conference (DAC), 1998, pp. 776–781

work page 1998
[36]

Data hiding with deep learning: A survey unifying digital watermarking and steganography,

Z. Wang, O. Byrnes, H. Wang, R. Sun, C. Ma, H. Chen, Q. Wu, and M. Xue, “Data hiding with deep learning: A survey unifying digital watermarking and steganography,”IEEE Transactions on Computational Social Systems, 2023

work page 2023
[37]

Ai model modulation with logits redistribution,

Z. Wang, Z. Ma, X. Feng, Z. Mei, E. Ma, D. Wang, M. Xue, and G. Bai, “Ai model modulation with logits redistribution,” inProceedings of the ACM on Web Conference 2025, ser. WWW ’25. New York, NY , USA: Association for Computing Machinery, 2025, p. 4699–4709. [Online]. Available: https://doi.org/10.1145/3696410.3714737

work page doi:10.1145/3696410.3714737 2025
[38]

Rethinking deep neural net- work ownership verification: Embedding passports to defeat ambigu- ity attacks,

L. Fan, K. W. Ng, and C. S. Chan, “Rethinking deep neural net- work ownership verification: Embedding passports to defeat ambigu- ity attacks,” inAdvances in Neural Information Processing Systems (NeurIPS), 2019

work page 2019
[39]

Trustzone explained: Architectural features and use cases,

B. Ngabonziza, D. Martin, A. Bailey, H. Cho, and S. Martin, “Trustzone explained: Architectural features and use cases,” in2016 IEEE 2nd International Conference on Collaboration and Internet Computing (CIC). IEEE, 2016, pp. 445–451

work page 2016
[40]

Tzslicer: Security- aware dynamic program slicing for hardware isolation,

M. Ye, J. Sherman, W. Srisa-An, and S. Wei, “Tzslicer: Security- aware dynamic program slicing for hardware isolation,” in2018 IEEE International Symposium on Hardware Oriented Security and Trust (HOST). IEEE, 2018, pp. 17–24

work page 2018
[41]

Deepattest: an end-to-end attestation framework for deep neural networks,

H. Chen, C. Fu, B. D. Rouhani, J. Zhao, and F. Koushanfar, “Deepattest: an end-to-end attestation framework for deep neural networks,” in 2019 ACM/IEEE 46th Annual International Symposium on Computer Architecture (ISCA). IEEE, 2019, pp. 487–498

work page 2019
[42]

Shadownet: A secure and efficient on-device model inference system for convolutional neural networks,

Z. Sun, R. Sun, C. Liu, A. Chowdhury, L. Lu, and S. Jha, “Shadownet: A secure and efficient on-device model inference system for convolutional neural networks,” in2023 IEEE Symposium on Security and Privacy (SP), 2023, pp. 1489–1505

work page 2023
[43]

Deep residual learning for image recognition,

K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” inProceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 770–778

work page 2016
[44]

Partial is better than all: Revisiting fine-tuning strategy for few-shot learning,

Z. Shen, Z. Liu, J. Qin, M. Savvides, and K.-T. Cheng, “Partial is better than all: Revisiting fine-tuning strategy for few-shot learning,” in Proceedings of the AAAI conference on artificial intelligence, vol. 35, no. 11, 2021, pp. 9594–9602

work page 2021
[45]

Partial fine-tuning: A successor to full fine-tuning for vision transformers,

P. Ye, Y . Huang, C. Tu, M. Li, T. Chen, T. He, and W. Ouyang, “Partial fine-tuning: A successor to full fine-tuning for vision transformers,” arXiv preprint arXiv:2312.15681, 2023

work page arXiv 2023
[46]

St-adapter: Parameter- efficient image-to-video transfer learning,

J. Pan, Z. Lin, X. Zhu, J. Shao, and H. Li, “St-adapter: Parameter- efficient image-to-video transfer learning,”Advances in Neural Infor- mation Processing Systems, vol. 35, pp. 26 462–26 477, 2022

work page 2022
[47]

LoRA: Low-Rank Adaptation of Large Language Models

E. J. Hu, Y . Shen, P. Wallis, Z. Allen-Zhu, Y . Li, S. Wang, L. Wang, and W. Chen, “Lora: Low-rank adaptation of large language models,” arXiv preprint arXiv:2106.09685, 2021

work page internal anchor Pith review Pith/arXiv arXiv 2021
[48]

Bitfit: Simple parameter-efficient fine-tuning for transformer-based masked language-models, 2022

E. B. Zaken, S. Ravfogel, and Y . Goldberg, “Bitfit: Simple parameter- efficient fine-tuning for transformer-based masked language-models,” arXiv preprint arXiv:2106.10199, 2021. 15 APPENDIX ETHICSCONSIDERATIONS This research focuses on developing a framework for se- curely updating AI models under strict usage control. It relies solely on open-source dat...

work page arXiv 2021

[1] [1]

Private ai compute: our next step in building pri- vate and helpful ai,

J. Yagnik, “Private ai compute: our next step in building pri- vate and helpful ai,” https://blog.google/technology/ai/google-private-ai- compute/, Nov. 2025, google Blog

work page 2025

[2] [2]

Nnsplitter: An active defense solution for dnn model via automated weight obfuscation,

T. Zhou, Y . Luo, S. Ren, and X. Xu, “Nnsplitter: An active defense solution for dnn model via automated weight obfuscation,” inProceed- ings of the 40th International Conference on Machine Learning (ICML). JMLR, 2023

work page 2023

[3] [3]

SoK: All you need to know about On-Device ML model extraction - the gap between research and practice,

T. Nayan, Q. Guo, M. A. Duniawi, M. Botacin, S. Uluagac, and R. Sun, “SoK: All you need to know about On-Device ML model extraction - the gap between research and practice,” in33rd USENIX Security Symposium (USENIX Security 24). Philadelphia, PA: USENIX Association, Aug. 2024, pp. 5233–5250. [Online]. Available: https://www.usenix.org/conference/usenixsec...

work page 2024

[4] [4]

Exploring chatgpt app ecosystem: Distribution, deployment and security,

C. Yan, R. Ren, M. H. Meng, L. Wan, T. Y . Ooi, and G. Bai, “Exploring chatgpt app ecosystem: Distribution, deployment and security,” inPro- ceedings of the 39th IEEE/ACM International Conference on Automated Software Engineering, 2024, pp. 1370–1382

work page 2024

[5] [5]

Catch-only-one: Non-transferable examples for model- specific authorization,

Z. Wang, Z. Ma, Z. Ma, S. Liu, A. Liu, D. Wang, M. Xue, and G. Bai, “Catch-only-one: Non-transferable examples for model- specific authorization,” 2025. [Online]. Available: https://arxiv.org/abs/ 2510.10982

work page arXiv 2025

[6] [6]

Training dnn model with secret key for model protection,

A. Pyone, M. Maung, and H. Kiya, “Training dnn model with secret key for model protection,” in2020 IEEE 9th Global Conference on Consumer Electronics (GCCE), 2020, pp. 818–821

work page 2020

[7] [7]

Protect your deep neural networks from piracy,

M. Chen and M. Wu, “Protect your deep neural networks from piracy,” in2018 IEEE International Workshop on Information Forensics and Security (WIFS), 2018, pp. 1–7

work page 2018

[8] [8]

AdvParams: An active DNN intellectual property protection technique via adversarial pertur- bation based parameter encryption,

M. Xue, Z. Wu, Y . Zhang, J. Wang, and W. Liu, “AdvParams: An active DNN intellectual property protection technique via adversarial pertur- bation based parameter encryption,”IEEE Transactions on Emerging Topics in Computing, vol. 11, no. 3, pp. 664–678, 2023

work page 2023

[9] [9]

Corelocker: Neuron-level usage control,

Z. Wang, Z. Ma, X. Feng, R. Sun, H. Wang, M. Xue, and G. Bai, “Corelocker: Neuron-level usage control,” in2024 IEEE Symposium on Security and Privacy (SP). IEEE Computer Society, 2024, pp. 2497– 2514

work page 2024

[10] [10]

Generative ai statistics: Insights and emerging trends for 2025,

M. Malec, “Generative ai statistics: Insights and emerging trends for 2025,” December 2024. [Online]. Available: https://hatchworks.com/ blog/gen-ai/generative-ai-statistics/

work page 2025

[11] [11]

An image is worth 16x16 words: Transformers for image recognition at scale,

A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, M. Dehghani, M. Minderer, G. Heigold, S. Gellyet al., “An image is worth 16x16 words: Transformers for image recognition at scale,”9th International Conference on Learning Representations (ICLR), 2021

work page 2021

[12] [12]

A probabilistic approach to neural network pruning,

X. Qian and D. Klabjan, “A probabilistic approach to neural network pruning,” inInternational Conference on Machine Learning (ICML). PMLR, 2021, pp. 8640–8649. 14

work page 2021

[13] [13]

What is the state of neural network pruning?

D. Blalock, J. J. Gonzalez Ortiz, J. Frankle, and J. Guttag, “What is the state of neural network pruning?”Proceedings of machine learning and systems, vol. 2, pp. 129–146, 2020

work page 2020

[14] [14]

Chaotic weights: A novel approach to protect intellectual property of deep neural networks,

N. Lin, X. Chen, H. Lu, and X. Li, “Chaotic weights: A novel approach to protect intellectual property of deep neural networks,”IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, vol. 40, no. 7, pp. 1327–1339, 2021

work page 2021

[15] [15]

Explaining and harnessing adversarial examples,

I. J. Goodfellow, J. Shlens, and C. Szegedy, “Explaining and harnessing adversarial examples,” inInternational Conference on Learning Repre- sentations (ICLR), 2015

work page 2015

[16] [16]

Adversarial examples: Attacks and defenses for deep learning,

X. Yuan, P. He, Q. Zhu, and X. Li, “Adversarial examples: Attacks and defenses for deep learning,”IEEE Transactions on Neural Networks and Learning Systems, vol. 30, no. 9, pp. 2805–2824, 2019

work page 2019

[17] [17]

Hardware-assisted intel- lectual property protection of deep learning models,

A. Chakraborty, A. Mondai, and A. Srivastava, “Hardware-assisted intel- lectual property protection of deep learning models,” in57th ACM/IEEE Design Automation Conference (DAC). IEEE, 2020, pp. 1–6

work page 2020

[18] [18]

Shad- ownet: A secure and efficient on-device model inference system for convolutional neural networks,

Z. Sun, R. Sun, C. Liu, A. R. Chowdhury, L. Lu, and S. Jha, “Shad- ownet: A secure and efficient on-device model inference system for convolutional neural networks,” in2023 IEEE Symposium on Security and Privacy (IEEE S&P), 2023, pp. 1596–1612

work page 2023

[19] [19]

Pruning Filters for Efficient ConvNets

H. Li, A. Kadav, I. Durdanovic, H. Samet, and H. P. Graf, “Pruning filters for efficient convnets,”arXiv preprint arXiv:1608.08710, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016

[20] [20]

Pruning convolutional neural networks for resource efficient inference,

P. Molchanov, S. Tyree, T. Karras, T. Aila, and J. Kautz, “Pruning convolutional neural networks for resource efficient inference,” inIn- ternational Conference on Learning Representations (ICLR), 2017

work page 2017

[21] [21]

Robustness-aware filter pruning for robust neural networks against adversarial attacks,

H. Lim, S.-D. Roh, S. Park, and K.-S. Chung, “Robustness-aware filter pruning for robust neural networks against adversarial attacks,” in2021 IEEE 31st International Workshop on Machine Learning for Signal Processing (MLSP), 2021, pp. 1–6

work page 2021

[22] [22]

Vershynin,High-dimensional probability: An introduction with ap- plications in data science

R. Vershynin,High-dimensional probability: An introduction with ap- plications in data science. Cambridge University Press, 2018, vol. 47

work page 2018

[23] [23]

Proving the lottery ticket hypothesis: Pruning is all you need,

E. Malach, G. Yehudai, S. Shalev-Schwartz, and O. Shamir, “Proving the lottery ticket hypothesis: Pruning is all you need,” inInternational Conference on Machine Learning. PMLR, 2020, pp. 6682–6691

work page 2020

[24] [24]

The mnist database of handwritten digit images for machine learning research,

L. Deng, “The mnist database of handwritten digit images for machine learning research,”IEEE Signal Processing Magazine, vol. 29, no. 6, pp. 141–142, 2012

work page 2012

[25] [25]

Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms

H. Xiao, K. Rasul, and R. V ollgraf, “Fashion-mnist: A novel image dataset for benchmarking machine learning algorithms,”arXiv preprint arXiv:1708.07747, 2017

work page internal anchor Pith review Pith/arXiv arXiv 2017

[26] [26]

Learning multiple layers of features from tiny images,

A. Krizhevsky, G. Hintonet al., “Learning multiple layers of features from tiny images,” 2009

work page 2009

[27] [27]

Caltech-256 object category dataset,

G. Griffin, A. Holub, P. Peronaet al., “Caltech-256 object category dataset,” Technical Report 7694, California Institute of Technology Pasadena, Tech. Rep., 2007

work page 2007

[28] [28]

Automated flower classification over a large number of classes,

M.-E. Nilsback and A. Zisserman, “Automated flower classification over a large number of classes,” in2008 Sixth Indian conference on computer vision, graphics & image processing. IEEE, 2008, pp. 722–729

work page 2008

[29] [29]

Glue: A multi-task benchmark and analysis platform for natural language understanding,

A. Wang, A. Singh, J. Michael, F. Hill, O. Levy, and S. R. Bow- man, “Glue: A multi-task benchmark and analysis platform for natural language understanding,” in7th International Conference on Learning Representations, ICLR 2019, 2019

work page 2019

[30] [30]

The stanford sentiment treebank,

R. Socher, A. Perelygin, J. Wu, J. Chuang, C. D. Manning, A. Y . Ng, and C. Potts, “The stanford sentiment treebank,”Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), pp. 1631–1642, 2013

work page 2013

[31] [31]

TweetEval: Unified benchmark and comparative evaluation for tweet classification,

F. Barbieri, J. Camacho-Collados, L. Espinosa Anke, and L. Neves, “TweetEval: Unified benchmark and comparative evaluation for tweet classification,” inFindings of the Association for Computational Linguistics: EMNLP 2020, T. Cohn, Y . He, and Y . Liu, Eds. Online: Association for Computational Linguistics, Nov. 2020, pp. 1644–1650. [Online]. Available: h...

work page 2020

[32] [32]

Character-level convolutional net- works for text classification,

X. Zhang, J. Zhao, and Y . LeCun, “Character-level convolutional net- works for text classification,”Advances in neural information processing systems, vol. 28, 2015

work page 2015

[33] [33]

Watermarking deep neural networks for embedded systems,

J. Guo and M. Potkonjak, “Watermarking deep neural networks for embedded systems,” in2018 IEEE/ACM International Conference on Computer-Aided Design (ICCAD), 2018, pp. 1–8

work page 2018

[34] [34]

Embedding water- marks into deep neural networks,

Y . Uchida, Y . Nagai, S. Sakazawa, and S. Satoh, “Embedding water- marks into deep neural networks,” inProceedings of ACM on Interna- tional Conference on Multimedia Retrieval (ICMR), 2017, p. 269–277

work page 2017

[35] [35]

Watermarking techniques for intellectual property protection,

A. B. Kahng, J. Lach, W. H. Mangione-Smith, S. Mantik, I. L. Markov, M. Potkonjak, P. Tucker, H. Wang, and G. Wolfe, “Watermarking techniques for intellectual property protection,” inProceedings of the 35th Annual Design Automation Conference (DAC), 1998, pp. 776–781

work page 1998

[36] [36]

Data hiding with deep learning: A survey unifying digital watermarking and steganography,

Z. Wang, O. Byrnes, H. Wang, R. Sun, C. Ma, H. Chen, Q. Wu, and M. Xue, “Data hiding with deep learning: A survey unifying digital watermarking and steganography,”IEEE Transactions on Computational Social Systems, 2023

work page 2023

[37] [37]

Ai model modulation with logits redistribution,

Z. Wang, Z. Ma, X. Feng, Z. Mei, E. Ma, D. Wang, M. Xue, and G. Bai, “Ai model modulation with logits redistribution,” inProceedings of the ACM on Web Conference 2025, ser. WWW ’25. New York, NY , USA: Association for Computing Machinery, 2025, p. 4699–4709. [Online]. Available: https://doi.org/10.1145/3696410.3714737

work page doi:10.1145/3696410.3714737 2025

[38] [38]

Rethinking deep neural net- work ownership verification: Embedding passports to defeat ambigu- ity attacks,

L. Fan, K. W. Ng, and C. S. Chan, “Rethinking deep neural net- work ownership verification: Embedding passports to defeat ambigu- ity attacks,” inAdvances in Neural Information Processing Systems (NeurIPS), 2019

work page 2019

[39] [39]

Trustzone explained: Architectural features and use cases,

B. Ngabonziza, D. Martin, A. Bailey, H. Cho, and S. Martin, “Trustzone explained: Architectural features and use cases,” in2016 IEEE 2nd International Conference on Collaboration and Internet Computing (CIC). IEEE, 2016, pp. 445–451

work page 2016

[40] [40]

Tzslicer: Security- aware dynamic program slicing for hardware isolation,

M. Ye, J. Sherman, W. Srisa-An, and S. Wei, “Tzslicer: Security- aware dynamic program slicing for hardware isolation,” in2018 IEEE International Symposium on Hardware Oriented Security and Trust (HOST). IEEE, 2018, pp. 17–24

work page 2018

[41] [41]

Deepattest: an end-to-end attestation framework for deep neural networks,

H. Chen, C. Fu, B. D. Rouhani, J. Zhao, and F. Koushanfar, “Deepattest: an end-to-end attestation framework for deep neural networks,” in 2019 ACM/IEEE 46th Annual International Symposium on Computer Architecture (ISCA). IEEE, 2019, pp. 487–498

work page 2019

[42] [42]

Shadownet: A secure and efficient on-device model inference system for convolutional neural networks,

Z. Sun, R. Sun, C. Liu, A. Chowdhury, L. Lu, and S. Jha, “Shadownet: A secure and efficient on-device model inference system for convolutional neural networks,” in2023 IEEE Symposium on Security and Privacy (SP), 2023, pp. 1489–1505

work page 2023

[43] [43]

Deep residual learning for image recognition,

K. He, X. Zhang, S. Ren, and J. Sun, “Deep residual learning for image recognition,” inProceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 770–778

work page 2016

[44] [44]

Partial is better than all: Revisiting fine-tuning strategy for few-shot learning,

Z. Shen, Z. Liu, J. Qin, M. Savvides, and K.-T. Cheng, “Partial is better than all: Revisiting fine-tuning strategy for few-shot learning,” in Proceedings of the AAAI conference on artificial intelligence, vol. 35, no. 11, 2021, pp. 9594–9602

work page 2021

[45] [45]

Partial fine-tuning: A successor to full fine-tuning for vision transformers,

P. Ye, Y . Huang, C. Tu, M. Li, T. Chen, T. He, and W. Ouyang, “Partial fine-tuning: A successor to full fine-tuning for vision transformers,” arXiv preprint arXiv:2312.15681, 2023

work page arXiv 2023

[46] [46]

St-adapter: Parameter- efficient image-to-video transfer learning,

J. Pan, Z. Lin, X. Zhu, J. Shao, and H. Li, “St-adapter: Parameter- efficient image-to-video transfer learning,”Advances in Neural Infor- mation Processing Systems, vol. 35, pp. 26 462–26 477, 2022

work page 2022

[47] [47]

LoRA: Low-Rank Adaptation of Large Language Models

E. J. Hu, Y . Shen, P. Wallis, Z. Allen-Zhu, Y . Li, S. Wang, L. Wang, and W. Chen, “Lora: Low-rank adaptation of large language models,” arXiv preprint arXiv:2106.09685, 2021

work page internal anchor Pith review Pith/arXiv arXiv 2021

[48] [48]

Bitfit: Simple parameter-efficient fine-tuning for transformer-based masked language-models, 2022

E. B. Zaken, S. Ravfogel, and Y . Goldberg, “Bitfit: Simple parameter- efficient fine-tuning for transformer-based masked language-models,” arXiv preprint arXiv:2106.10199, 2021. 15 APPENDIX ETHICSCONSIDERATIONS This research focuses on developing a framework for se- curely updating AI models under strict usage control. It relies solely on open-source dat...

work page arXiv 2021