KDD2020
AdvMind: Inferring Adversary Intent of Black-Box Attacks
Ren Pang, Xinyang Zhang, Shouling Ji, Xiapu Luo, Ting Wang
24 citations
Abstract
Deep neural networks (DNNs) are inherently susceptible to adversarial attacks even under black-box settings, in which the adversary only has query access to the target models. In practice, while it may be possible to effectively detect such attacks (e.g., observing massive similar but non-identical queries), it is often challenging to exactly infer the adversary intent (e.g., the target class of the adversarial example the adversary attempts to craft) especially during early stages of the attacks, which is crucial for performing effective deterrence and remediation of the threats in many scenarios.