arXiv preprint arXiv:2509.21057 (2025)

PMark: Towards Robust · 2025 · arXiv 2509.21057

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

read on arXiv browse 5 citing papers

citation-role summary

background 1 baseline 1 method 1

citation-polarity summary

background 1 baseline 1 use method 1

representative citing papers

KBF: Knowledge Boundary as Fingerprint for Language Model and Black-Box API Auditing

cs.CR · 2026-05-28 · unverdicted · novelty 7.0

KBF uses stable numerical recall near the knowledge boundary to fingerprint and audit black-box LLM APIs, successfully detecting all tested substitutions and some real-world inconsistencies across production endpoints.

Green-Red Watermarking for Recommender Systems

cs.IR · 2026-04-26 · unverdicted · novelty 7.0

GREW uses a secret-key-driven green-red item partition and three ranking-integrated modules to embed verifiable watermarks in recommender systems that resist extraction attacks without data injection.

Decoding by Perturbation: Mitigating MLLM Hallucinations via Dynamic Textual Perturbation

cs.CL · 2026-04-14 · unverdicted · novelty 7.0

DeP mitigates MLLM hallucinations by dynamically perturbing text prompts to identify and reinforce stable visual evidence regions while counteracting language prior biases using attention variance and logit statistics.

RLSpoofer: A Lightweight Evaluator for LLM Watermark Spoofing Resilience

cs.CR · 2026-04-13 · unverdicted · novelty 7.0

RLSpoofer trains a 4B model on 100 watermarked paraphrase pairs to spoof PF watermarks at 62% success rate, far exceeding baselines trained on up to 10,000 samples.

Global Sketch-Based Watermarking for Diffusion Language Models

cs.CR · 2026-06-03 · unverdicted · novelty 6.0

Introduces a sketch-based watermarking method for masked diffusion language models providing an order-agnostic detection statistic decoupled from local context.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Decoding by Perturbation: Mitigating MLLM Hallucinations via Dynamic Textual Perturbation cs.CL · 2026-04-14 · unverdicted · none · ref 20
DeP mitigates MLLM hallucinations by dynamically perturbing text prompts to identify and reinforce stable visual evidence regions while counteracting language prior biases using attention variance and logit statistics.

arXiv preprint arXiv:2509.21057 (2025)

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer