Adversarial Attacks on Variational Autoencoders

Eduardo Valle; George Gondim-Ribeiro; Pedro Tabacof

arxiv: 1806.04646 · v1 · pith:37Q6W6NZnew · submitted 2018-06-12 · 💻 cs.CV · cs.LG· cs.NE

Adversarial Attacks on Variational Autoencoders

George Gondim-Ribeiro , Pedro Tabacof , Eduardo Valle This is my paper

classification 💻 cs.CV cs.LGcs.NE

keywords attacksautoencodersadversarialattentiondrawresistancethreevariational

0 comments

read the original abstract

Adversarial attacks are malicious inputs that derail machine-learning models. We propose a scheme to attack autoencoders, as well as a quantitative evaluation framework that correlates well with the qualitative assessment of the attacks. We assess --- with statistically validated experiments --- the resistance to attacks of three variational autoencoders (simple, convolutional, and DRAW) in three datasets (MNIST, SVHN, CelebA), showing that both DRAW's recurrence and attention mechanism lead to better resistance. As autoencoders are proposed for compressing data --- a scenario in which their safety is paramount --- we expect more attention will be given to adversarial attacks on them.

This paper has not been read by Pith yet.

Adversarial Attacks on Variational Autoencoders

discussion (0)