arXiv preprint arXiv:2304.06929 , year=

· 2024 · arXiv 2304.06929

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

representative citing papers

Detecting Pretraining Data from Large Language Models

cs.CL · 2023-10-25 · conditional · novelty 7.0

Min-K% Prob detects pretraining data in LLMs by flagging outlier low-probability words in text, achieving 7.4% better performance than prior methods on the new WIKIMIA benchmark.

MC-PDD: Masked Corpus-Level Pretraining Data Detection for Black-Box Large Language Models

cs.CL · 2026-06-06 · unverdicted · novelty 6.0

A masked-token hit-rate comparison method detects pretraining data membership in black-box LLMs with performance comparable to white-box approaches.

TADP-RME: A Trust-Adaptive Differential Privacy Framework for Enhancing Reliability of Data-Driven Systems

cs.CR · 2026-04-09 · unverdicted · novelty 4.0

TADP-RME adapts the privacy budget via inverse trust scores in [0,1] and uses reverse manifold embedding to reduce inference attack success rates by up to 3.1% while preserving formal differential privacy guarantees.

citing papers explorer

Showing 3 of 3 citing papers.

Detecting Pretraining Data from Large Language Models cs.CL · 2023-10-25 · conditional · none · ref 111
Min-K% Prob detects pretraining data in LLMs by flagging outlier low-probability words in text, achieving 7.4% better performance than prior methods on the new WIKIMIA benchmark.
MC-PDD: Masked Corpus-Level Pretraining Data Detection for Black-Box Large Language Models cs.CL · 2026-06-06 · unverdicted · none · ref 16
A masked-token hit-rate comparison method detects pretraining data membership in black-box LLMs with performance comparable to white-box approaches.
TADP-RME: A Trust-Adaptive Differential Privacy Framework for Enhancing Reliability of Data-Driven Systems cs.CR · 2026-04-09 · unverdicted · none · ref 21
TADP-RME adapts the privacy budget via inverse trust scores in [0,1] and uses reverse manifold embedding to reduce inference attack success rates by up to 3.1% while preserving formal differential privacy guarantees.

arXiv preprint arXiv:2304.06929 , year=

fields

years

verdicts

representative citing papers

citing papers explorer