This repo contains the sample code for reproducing results of ICLR'22 Paper Understanding and Improving Graph Injection Attack by Promoting Unnoticeability.
Updates:
- July 23, 2022: We shared GIA-HAO in AI Time PhD Workshop for ICLR'22 (in Chinese)! The video link is here. 😆
Graph Injection Attack (GIA) recently emerges as a practical attack scenario. To understand it more, we conduct a friendly comparison between GIA and Graph Modification Attack (GMA) -- Another attack scenario widly studied in early works. We found that:
- Although GIA can be provably more harmful than GMA under certain assumptions, the severe damage to the homophily makes it easily defendable by homophily defenders (even implemented in a very simple way), which would bring unreliable evaluation about GNNs' robustness.
- Hence we introduced homophily unnoticeability for GIA to constrain the damage and proposed Harmonious Adversarial Objective (HAO) to mitigate this issue.
If you are interested, find out more in our paper, slides, or our poster!
Figure 1. Illustration of GIA-HAO.
Essentially, HAO can be implemented very conveniently, in 3 steps!
# step 1: propagate one step (including the injected nodes) without self-connection
# then we obtain the aggregated neighbor features
with torch.no_grad():
features_propagate = gcn_norm(adj_attack, add_self_loops=False) @ features_concat
features_propagate = features_propagate[n_total:]
# step 2: calculate the node-centric homophily (here we implement it with cosine similarity)
homophily = F.cosine_similarity(features_attack, features_propagate)
# step 3: add homophily to your original L_atk with a proper weight, then you make it!
pred_loss += disguise_coe*homophily.mean()
Note that it is only a minimal implementation example, you can also implement HAO with different homophily measures tailored for your own case : )
In experiments, we comprehensively examine the effectiveness of HAO with different combinations of various GIA methods, against 38 defense models on 6 benchmarks, under both non-target attack and targeted attack settings. To handle the extensive experiments, we implement both single test and batch test modes.
We provide a example in jupyter notebook on homophily distribution analysis. You may use it analyze the homophily distribution changes based on the generated perturbed graphs from our code.
The key libraries used in our experiments are:
grb==0.1.0
torch==1.9.0
torch-geometric==1.7.2
scipy==1.6.2
More information about the dependencies can be found in this issue. If you are using the latest pytorch-geometric libraries and encounter some issues, you may check out this issue (Thanks Runlin Lei from RUC for the update).
By default, all of the datasets can be loaded automatically, and be careful about the path settings in load_graph.py
.
All of the generated perturbed graph will be saved to /atkg
and make sure the folder is valid.
The perturbed graph will be named as dataset_attack.pt
for non-targeted attack, and dataset_attack_target.pt
for targeted attack.
We use --disguise_coe
to control the strength of HAO.
In a single attack test, by specifying an attack method, you can generate the perturbed graph and evaluate the robustness of GNNs based on it. Take grb-cora
as an example:
a. Generating perturbed graphs:
# Generating Perturbed Graph with PGD
python gnn_misg.py --dataset 'grb-cora' --inductive --eval_robo --eval_attack 'gia' --grb_mode 'full' --num_layers 3 --runs 1 --disguise_coe 0
# Generating Perturbed Graph with PGD+HAO
python gnn_misg.py --dataset 'grb-cora' --inductive --eval_robo --eval_attack 'gia' --grb_mode 'full' --num_layers 3 --runs 1 --disguise_coe 1
b. Evaluating blackbox test robustness:
# Evaluating blackbox test robustness with GCN
python gnn_misg.py --dataset 'grb-cora' --inductive --eval_robo --eval_attack 'gia' --grb_mode 'full' --num_layers 3 --runs 1 --eval_robo_blk
# Evaluating blackbox test robustness with EGuard
python gnn_misg.py --dataset 'grb-cora' --inductive --eval_robo --eval_attack 'gia' --grb_mode 'full' --model 'egnnguard' --num_layers 3 --eval_robo_blk --runs 1
c. Evaluating with targeted attack:
Simply add --eval_target
to the running commands will do the job.
By changing the parameters according to our paper, you can simply reproduce results for other benchmarks, or test with your own GNNs and benchmarks!
Moreover, you can also evaluate the robustness of different defense models against different attack methods in batch. Here we use --batch_eval
command in gnn_misg.py
to enable batch evaluations.
During each evaluation of GNN model, you can use batch_attacks
and report_batch
to specify the attacks that you want to evaluate and report. Take grb-cora
as an example:
mkdir atkg
bash run_cora.sh
The other running scripts are given in /scripts
folder.
Since our focus is to verify the issues surrounding the homophily unnoticeability, we didn't do exhaustive hyperparameter tuning with the injection strategies. We believe there are more interesting stuffs to navigate about these strategies, where we list a few in Appendix A of our paper. Welcome to check it out and play with our code base!
If you have any questions, please feel free to send me (Andrew) an email via: yqchen [at] cse (dot) cuhk (dot) edu (dot) hk. If you find our paper and repo useful, please cite our paper:
@inproceedings{chen2022hao,
title={Understanding and Improving Graph Injection Attack by Promoting Unnoticeability},
author={Yongqiang Chen and Han Yang and Yonggang Zhang and Kaili Ma and Tongliang Liu and Bo Han and James Cheng},
booktitle={International Conference on Learning Representations},
year={2022},
url={https://openreview.net/forum?id=wkMG8cdvh7-}
}
Ack: The readme is inspired by GSAT 😄.