Evasion Attacks against Machine Learning at Test Time

Battista Biggio; Blaine Nelson; Davide Maiorca; Fabio Roli; Giorgio Giacinto; Igino Corona; Nedim Srndic; Pavel Laskov

arxiv: 1708.06131 · v1 · pith:EJH5KR3Jnew · submitted 2017-08-21 · 💻 cs.CR · cs.LG

Evasion Attacks against Machine Learning at Test Time

Battista Biggio , Igino Corona , Davide Maiorca , Blaine Nelson , Nedim Srndic , Pavel Laskov , Giorgio Giacinto , Fabio Roli This is my paper

classification 💻 cs.CR cs.LG

keywords attackattacksclassifierevasionsecurityapproachlearningmachine

0 comments

read the original abstract

In security-sensitive applications, the success of machine learning depends on a thorough vetting of their resistance to adversarial data. In one pertinent, well-motivated attack scenario, an adversary may attempt to evade a deployed system at test time by carefully manipulating attack samples. In this work, we present a simple but effective gradient-based approach that can be exploited to systematically assess the security of several, widely-used classification algorithms against evasion attacks. Following a recently proposed framework for security evaluation, we simulate attack scenarios that exhibit different risk levels for the classifier by increasing the attacker's knowledge of the system and her ability to manipulate attack samples. This gives the classifier designer a better picture of the classifier performance under evasion attacks, and allows him to perform a more informed model selection (or parameter setting). We evaluate our approach on the relevant security task of malware detection in PDF files, and show that such systems can be easily evaded. We also sketch some countermeasures suggested by our analysis.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Stateful Detection of Black-Box Adversarial Attacks
cs.CR 2019-07 unverdicted novelty 7.0

The paper argues for stateful defenses over stateless ones to detect adversarial example generation via query history and introduces query blinding as a counter-attack.
AI Safety Landscape for Large Language Models: Taxonomy, State-of-the-art, and Future Directions
cs.AI 2024-08 unverdicted novelty 4.0

The paper introduces a taxonomy of AI safety for LLMs organized into Trustworthy AI, Responsible AI, and Safe AI perspectives, accompanied by a review of state-of-the-art methods, challenges, and future directions.