Recent advances in adversarial training for adversarial robustness

Tao Bai, Jinqi Luo, Jun Zhao, Bihan Wen, Qian Wang · 2021 · arXiv 2102.01356

10 Pith papers cite this work. Polarity classification is still indexing.

10 Pith papers citing it

read on arXiv browse 10 citing papers

citation-role summary

background 4

citation-polarity summary

background 4

representative citing papers

Uncovering and Understanding FPR Manipulation Attack in Industrial IoT Networks

cs.CR · 2026-01-20 · unverdicted · novelty 8.0

FPR manipulation attack perturbs benign MQTT packets to flip labels to attacks in NIDS with 80-100% success, increasing SOC delays without gradient-based methods.

Feature-level analysis and adversarial transfer in rotationally equivariant quantum machine learning

quant-ph · 2026-04-16 · unverdicted · novelty 7.0

Rotationally equivariant quantum models can rely on vulnerable invariant statistics such as ring-averaged intensities, leaving them susceptible to classical transfer attacks, but suppressing the associated symmetry sectors substantially improves robustness.

Improving Feasibility via Fast Autoencoder-Based Projections

cs.LG · 2026-04-03 · unverdicted · novelty 7.0

An adversarially trained autoencoder learns a convex latent space to enable rapid approximate projections that enforce nonconvex constraints in optimization and reinforcement learning.

Sensitivity as a Double-Edged Sword: A Trade-off Between Discriminability and Adversarial Robustness

cs.CV · 2026-06-01 · unverdicted · novelty 6.0

Identifies sensitivity as the source of both discriminability and vulnerability in FC classifiers versus robustness in l2 classifiers, and introduces HPM prototype fusion plus MSA evaluation to improve adversarial robustness.

CoNewsReader: Supporting Comprehensive Understanding and Raising Critical Thoughts on Social Media News Through Comments

cs.HC · 2026-04-30 · conditional · novelty 6.0 · 2 refs

CoNewsReader integrates user comments with an LLM to improve critical news reading on social media, with a 24-participant study showing gains in comprehension and critical thinking over baseline interfaces.

Dictionary-Aligned Concept Control for Safeguarding Multimodal LLMs

cs.LG · 2026-04-10 · unverdicted · novelty 6.0

DACO curates a 15,000-concept dictionary from 400K image-caption pairs and uses it to initialize an SAE that enables granular, concept-specific steering of MLLM activations, raising safety scores on MM-SafetyBench and JailBreakV while preserving general capabilities.

Quantum Patches: Enhancing Robustness of Quantum Machine Learning Models

quant-ph · 2026-04-09 · unverdicted · novelty 6.0

Random quantum circuits used as adversarial training data reduce successful attack rates on QML models for CIFAR-10 from 89.8% to 68.45% and for CINIC-10 from 94.23% to 78.68%.

Towards Reliable Forgetting: A Survey on Machine Unlearning Verification

cs.LG · 2025-06-18 · unverdicted · novelty 6.0

A survey that organizes machine unlearning verification methods into behavioral and parametric categories and outlines open problems.

Towards Robust Personalized Federated Learning: Vulnerability Assessment and Defense Co-Design

cs.LG · 2026-06-22 · unverdicted · novelty 5.0

Personalized federated learning shows heightened vulnerability to transfer-based adversarial attacks from malicious clients, addressed by a defense framework of stochastic input noise, input-scaled trace regularization, and parameter sensitivity maximization.

Auto-ART: Structured Literature Synthesis and Automated Adversarial Robustness Testing

cs.CR · 2026-04-22 · unverdicted · novelty 5.0

Auto-ART delivers the first structured synthesis of adversarial robustness consensus plus an executable multi-norm testing framework that flags gradient masking in 92% of cases on RobustBench and reveals a 23.5 pp robustness gap.

citing papers explorer

Showing 9 of 9 citing papers after filters.

Uncovering and Understanding FPR Manipulation Attack in Industrial IoT Networks cs.CR · 2026-01-20 · unverdicted · none · ref 74
FPR manipulation attack perturbs benign MQTT packets to flip labels to attacks in NIDS with 80-100% success, increasing SOC delays without gradient-based methods.
Feature-level analysis and adversarial transfer in rotationally equivariant quantum machine learning quant-ph · 2026-04-16 · unverdicted · none · ref 32
Rotationally equivariant quantum models can rely on vulnerable invariant statistics such as ring-averaged intensities, leaving them susceptible to classical transfer attacks, but suppressing the associated symmetry sectors substantially improves robustness.
Improving Feasibility via Fast Autoencoder-Based Projections cs.LG · 2026-04-03 · unverdicted · none · ref 1
An adversarially trained autoencoder learns a convex latent space to enable rapid approximate projections that enforce nonconvex constraints in optimization and reinforcement learning.
Sensitivity as a Double-Edged Sword: A Trade-off Between Discriminability and Adversarial Robustness cs.CV · 2026-06-01 · unverdicted · none · ref 6
Identifies sensitivity as the source of both discriminability and vulnerability in FC classifiers versus robustness in l2 classifiers, and introduces HPM prototype fusion plus MSA evaluation to improve adversarial robustness.
Dictionary-Aligned Concept Control for Safeguarding Multimodal LLMs cs.LG · 2026-04-10 · unverdicted · none · ref 6
DACO curates a 15,000-concept dictionary from 400K image-caption pairs and uses it to initialize an SAE that enables granular, concept-specific steering of MLLM activations, raising safety scores on MM-SafetyBench and JailBreakV while preserving general capabilities.
Quantum Patches: Enhancing Robustness of Quantum Machine Learning Models quant-ph · 2026-04-09 · unverdicted · none · ref 31
Random quantum circuits used as adversarial training data reduce successful attack rates on QML models for CIFAR-10 from 89.8% to 68.45% and for CINIC-10 from 94.23% to 78.68%.
Towards Reliable Forgetting: A Survey on Machine Unlearning Verification cs.LG · 2025-06-18 · unverdicted · none · ref 5
A survey that organizes machine unlearning verification methods into behavioral and parametric categories and outlines open problems.
Towards Robust Personalized Federated Learning: Vulnerability Assessment and Defense Co-Design cs.LG · 2026-06-22 · unverdicted · none · ref 1
Personalized federated learning shows heightened vulnerability to transfer-based adversarial attacks from malicious clients, addressed by a defense framework of stochastic input noise, input-scaled trace regularization, and parameter sensitivity maximization.
Auto-ART: Structured Literature Synthesis and Automated Adversarial Robustness Testing cs.CR · 2026-04-22 · unverdicted · none · ref 44
Auto-ART delivers the first structured synthesis of adversarial robustness consensus plus an executable multi-norm testing framework that flags gradient masking in 92% of cases on RobustBench and reveals a 23.5 pp robustness gap.

Recent advances in adversarial training for adversarial robustness

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer