Hey, That's My Model! Introducing Chain & Hash, An LLM Fingerprinting Technique

Ahmed Salem; Mark Russinovich; Yanan Cai

arxiv: 2407.10887 · v4 · pith:UNKVRDCWnew · submitted 2024-07-15 · 💻 cs.CR · cs.AI

Hey, That's My Model! Introducing Chain & Hash, An LLM Fingerprinting Technique

Mark Russinovich , Yanan Cai , Ahmed Salem This is my paper

classification 💻 cs.CR cs.AI

keywords fingerprintfingerprintingmodelownershipchainframeworkhashmisuse

0 comments

read the original abstract

Growing concerns over the theft and misuse of Large Language Models (LLMs) underscore the need for effective fingerprinting to link a model to its original version and detect misuse. We define five essential properties for a successful fingerprint: Transparency, Efficiency, Persistence, Robustness, and Unforgeability. We present a novel fingerprinting framework that provides verifiable proof of ownership while preserving fingerprint integrity. Our approach makes two main contributions. First, a chain and hash technique that cryptographically binds fingerprint prompts to their responses, preventing collisions and enabling irrefutable ownership claims. Second, we address a realistic threat model in which instruction-tuned models' output distribution can be significantly altered through meta-prompts. By incorporating random padding and varied meta-prompt configurations during training, our method maintains robustness even under significant output style changes. Experiments show that our framework securely proves ownership, resists both benign transformations (e.g., fine-tuning) and adversarial fingerprint removal, and extends to fingerprinting LoRA adapters\footnote{We release our code at: https://github.com/microsoft/Chain-Hash.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 6 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

FLIPS: Instance-Fingerprinting for LLMs via Pseudo-random Sequences
cs.LG 2026-06 unverdicted novelty 8.0

FLIPS identifies LLM instances with 96% closed-set and 90% open-set accuracy by exploiting biases in generated binary random sequences across 237 instances.
KBF: Knowledge Boundary as Fingerprint for Language Model and Black-Box API Auditing
cs.CR 2026-05 unverdicted novelty 7.0

KBF uses stable numerical recall near the knowledge boundary to fingerprint and audit black-box LLM APIs, successfully detecting all tested substitutions and some real-world inconsistencies across production endpoints.
Copyright Protection for Large Language Models: A Survey of Methods, Challenges, and Trends
cs.CR 2025-08 accept novelty 7.0

A survey of LLM copyright protection that unifies text watermarking, model watermarking, and model fingerprinting while presenting new coverage of fingerprint transfer and removal.
Prompt2Fingerprint: Plug-and-Play LLM Fingerprinting via Text-to-Weight Generation
cs.CR 2026-05 unverdicted novelty 6.0

P2F generates low-rank parameter increments for LLM fingerprinting directly from textual descriptions in a single forward pass.
SIF: Semantically In-Distribution Fingerprints for Large Vision-Language Models
cs.CV 2026-04 unverdicted novelty 6.0

SIF creates semantically in-distribution fingerprints for LVLMs by distilling text watermarks into visual inputs and optimizing for robustness against detection and modification.
Position: LLM Watermarking Should Align Stakeholders' Incentives for Practical Adoption
cs.CR 2025-10 unverdicted novelty 4.0

LLM watermarking adoption is limited by misaligned stakeholder incentives; incentive-aligned approaches such as in-context watermarking can enable practical use in targeted domains like education and peer review.