KDD2020

AdvMind: Inferring Adversary Intent of Black-Box Attacks

Ren Pang, Xinyang Zhang, Shouling Ji, Xiapu Luo, Ting Wang

24 citations

Abstract

Deep neural networks (DNNs) are inherently susceptible to adversarial attacks even under black-box settings, in which the adversary only has query access to the target models. In practice, while it may be possible to effectively detect such attacks (e.g., observing massive similar but non-identical queries), it is often challenging to exactly infer the adversary intent (e.g., the target class of the adversarial example the adversary attempts to craft) especially during early stages of the attacks, which is crucial for performing effective deterrence and remediation of the threats in many scenarios.