Compressing Deep Graph Neural Networks via Adversarial Knowledge Distillation

被引：10

作者：

He, Huarui ^{[1
]}

Wang, Jie ^{[1
,2
]}

Zhang, Zhanqiu ^{[1
]}

Wu, Feng ^{[1
]}

机构：

[1] Univ Sci & Technol China, Hefei, Peoples R China

[2] Hefei Comprehens Natl Sci Ctr, Inst Artificial Intelligence, Hefei, Peoples R China

来源：

PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022 | 2022年

关键词：

Graph Neural Networks; Knowledge Distillation; Adversarial Training; Network Compression;

D O I：

10.1145/3534678.3539315

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Deep graph neural networks (GNNs) have been shown to be expressive for modeling graph-structured data. Nevertheless, the over-stacked architecture of deep graph models makes it difficult to deploy and rapidly test on mobile or embedded systems. To compress over-stacked GNNs, knowledge distillation via a teacher-student architecture turns out to be an effective technique, where the key step is to measure the discrepancy between teacher and student networks with predefined distance functions. However, using the same distance for graphs of various structures may be unfit, and the optimal distance formulation is hard to determine. To tackle these problems, we propose a novel Adversarial Knowledge Distillation framework for graph models named GraphAKD, which adversarially trains a discriminator and a generator to adaptively detect and decrease the discrepancy. Specifically, noticing that the well-captured inter-node and inter-class correlations favor the success of deep GNNs, we propose to criticize the inherited knowledge from node-level and class-level views with a trainable discriminator. The discriminator distinguishes between teacher knowledge and what the student inherits, while the student GNN works as a generator and aims to fool the discriminator. Experiments on node-level and graph-level classification benchmarks demonstrate that GraphAKD improves the student performance by a large margin. The results imply that GraphAKD can precisely transfer knowledge from a complicated teacher GNN to a compact student GNN.

引用

页码：534 / 544

页数：11

共 50 条

[1] Online adversarial knowledge distillation for graph neural networks
Wang, Can
Wang, Zhe
Chen, Defang
Zhou, Sheng
Feng, Yan
Chen, Chun
EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
[2] Accelerating Molecular Graph Neural Networks via Knowledge Distillation
Kelvinius, Filip Ekstrom
Georgiev, Dimitar
Toshev, Artur Petrov
Gasteiger, Johannes
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36, NEURIPS 2023, 2023,
[3] Boosting Graph Neural Networks via Adaptive Knowledge Distillation
Guo, Zhichun
Zhang, Chunhui
Fan, Yujie
Tian, Yijun
Zhang, Chuxu
Chawla, Nitesh V.
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 6, 2023, : 7793 - 7801
[4] EGNN: Constructing explainable graph neural networks via knowledge distillation
Li, Yuan
Liu, Li
Wang, Guoyin
Du, Yong
Chen, Penggang
KNOWLEDGE-BASED SYSTEMS, 2022, 241
[5] On Representation Knowledge Distillation for Graph Neural Networks
Joshi, Chaitanya K.
Liu, Fayao
Xun, Xu
Lin, Jie
Foo, Chuan Sheng
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (04) : 4656 - 4667
[6] Compressing deep graph convolution network with multi-staged knowledge distillation
Kim, Junghun
Jung, Jinhong
Kang, U.
PLOS ONE, 2021, 16 (08):
[7] Graph-Free Knowledge Distillation for Graph Neural Networks
Deng, Xiang
Zhang, Zhongfei
PROCEEDINGS OF THE THIRTIETH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2021, 2021, : 2321 - 2327
[8] Online cross-layer knowledge distillation on graph neural networks with deep supervision
Jiongyu Guo
Defang Chen
Can Wang
Neural Computing and Applications, 2023, 35 : 22359 - 22374
[9] RELIANT: Fair Knowledge Distillation for Graph Neural Networks
Dong, Yushun
Zhang, Binchi
Yuan, Yiling
Zou, Na
Wang, Qi
Li, Jundong
PROCEEDINGS OF THE 2023 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2023, : 154 - +
[10] Online cross-layer knowledge distillation on graph neural networks with deep supervision
Guo, Jiongyu
Chen, Defang
Wang, Can
NEURAL COMPUTING & APPLICATIONS, 2023, 35 (30): : 22359 - 22374

← 1 2 3 4 5 →