Measuring Re-identification Risk

Adel Javanmard; Alessandro Epasto; Andres Munoz Medina; CJ Carey; Gabriel Henrique Nunes; Josh Karlin; Peilin Zhong; Sergei Vassilvitskii; Shankar Kumar; Travis Dick

arxiv: 2304.07210 · v2 · pith:UTLWNAFXnew · submitted 2023-04-12 · 💻 cs.CR · cs.LG

Measuring Re-identification Risk

CJ Carey , Travis Dick , Alessandro Epasto , Adel Javanmard , Josh Karlin , Shankar Kumar , Andres Munoz Medina , Vahab Mirrokni

show 3 more authors

Gabriel Henrique Nunes Sergei Vassilvitskii Peilin Zhong

This is my paper

classification 💻 cs.CR cs.LG

keywords re-identificationframeworkriskuserapplicationsboundsmeasurereal-world

0 comments

read the original abstract

Compact user representations (such as embeddings) form the backbone of personalization services. In this work, we present a new theoretical framework to measure re-identification risk in such user representations. Our framework, based on hypothesis testing, formally bounds the probability that an attacker may be able to obtain the identity of a user from their representation. As an application, we show how our framework is general enough to model important real-world applications such as the Chrome's Topics API for interest-based advertising. We complement our theoretical bounds by showing provably good attack algorithms for re-identification that we use to estimate the re-identification risk in the Topics API. We believe this work provides a rigorous and interpretable notion of re-identification risk and a framework to measure it that can be used to inform real-world applications.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Lessons from the Adoption and Deprecation of the Privacy Sandbox Web APIs
cs.CR 2026-06 unverdicted novelty 7.0

Longitudinal measurement study finds limited and uneven adoption of Privacy Sandbox APIs across websites and Chrome users, yielding lessons and recommendations after the project's 2025 cancellation.