Abstract
Previous works have proven the superior performance of ensemble-based black-box attacks on transferability. However, existing methods require significant difference in architecture among the source models to ensure gradient diversity. In this paper, we propose a Diverse Gradient Method (DGM), verifying that knowledge distillation is able to generate diverse gradients from unchangeable model architecture for boosting transferability. The core idea behind our DGM is to obtain transferable adversarial perturbations by fusing diverse gradients provided by a single source model and its distilled versions through an ensemble strategy. Experimental results show that DGM successfully crafts adversarial examples with higher transferability, only requiring extremely low training cost. Furthermore, our proposed method could be used as a flexible module to improve transferability of most of existing black-box attacks.
Original language | English |
---|---|
Title of host publication | IJCNN 2023 - International Joint Conference on Neural Networks |
Subtitle of host publication | Proceedings |
Publisher | IEEE Institute of Electrical and Electronic Engineers |
Number of pages | 9 |
ISBN (Electronic) | 978-1-6654-8867-9 |
ISBN (Print) | 978-1-6654-8868-6 |
DOIs | |
Publication status | Published - 2 Aug 2023 |
MoE publication type | A4 Article in a conference publication |
Event | International Joint Conference on Neural Networks, IJCNN 2023 - Gold Coast, Australia Duration: 18 Jun 2023 → 23 Jun 2023 |
Publication series
Series | Proceedings of the International Joint Conference on Neural Networks |
---|---|
Volume | 2023-June |
Conference
Conference | International Joint Conference on Neural Networks, IJCNN 2023 |
---|---|
Country/Territory | Australia |
City | Gold Coast |
Period | 18/06/23 → 23/06/23 |
Funding
VI. ACKNOWLEDGEMENT This research was funded by the National Natural Science Foundation of China under Grant 61972092, the Collaborative Innovation Major Project of Zhengzhou (20XTZX06013) and the Strategic Research and Consulting Project of Chinese Academy of Engineering (No. 2022HENYB03).
Keywords
- Adversarial examples
- Black-box attack
- Gradient diversity
- Transferability