ICLR2021
Policy-Driven Attack: Learning to Query for Hard-label Black-box Adversarial Examples
Ziang Yan, Yiwen Guo, Jian Liang, Changshui Zhang
19 citations
Abstract
Query-efficient reward assignment mechanism: 1. Determine distortion baseline level and 2. Jump along the direction by distance 3. Project resulted image into and then check if they are still adversarial 4. If both, reward is 2; if only , reward is 1; if neither, reward is 0