Disentangled Representation Learning in Heterogeneous Information Network for Large-scale Android Malware Detection in the COVID-19 Era and Beyond

被引：0

作者：

Hou, Shifu ^{[1
]}

Fan, Yujie ^{[1
]}

Ju, Mingxuan ^{[1
]}

Ye, Yanfang ^{[1
]}

Wan, Wenqiang ^{[2
]}

Wang, Kui ^{[2
]}

Mei, Yinming ^{[2
]}

Xiong, Qi ^{[2
]}

Shao, Fudong ^{[2
]}

机构：

[1] Case Western Reserve Univ, Dept Comp & Data Sci, Cleveland, OH 44106 USA

[2] Tencent, Tencent Secur Lab, Shenzhen, Guangdong, Peoples R China

来源：

THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2021年 / 35卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the fight against the COVID-19 pandemic, many social activities have moved online; society's overwhelming reliance on the complex cyberspace makes its security more important than ever. In this paper, we propose and develop an intelligent system named Dr.HIN to protect users against the evolving Android malware attacks in the COVID-19 era and beyond. In Dr.HIN, besides app content, we propose to consider higher-level semantics and social relations among apps, developers and mobile devices to comprehensively depict Android apps; and then we introduce a structured heterogeneous information network (HIN) to model the complex relations and exploit meta-path guided strategy to learn node (i.e., app) representations from HIN. As the representations of malware could be highly entangled with benign apps in the complex ecosystem of development, it poses a new challenge of learning the latent explanatory factors hidden in the HIN embeddings to detect the evolving malware. To address this challenge, we propose to integrate domain priors generated from different views (i.e., app content, app authorship, app installation) to devise an adversarial disentangler to separate the distinct, informative factors of variations hidden in the HIN embeddings for large-scale Android malware detection. This is the first attempt of disentangled representation learning in HIN data. Promising experimental results based on real sample collections from security industry demonstrate the performance of Dr.HIN in evolving Android malware detection, by comparison with baselines and popular mobile security products.

引用

页码：7754 / 7761

页数：8

共 50 条

[1] A reinforcement learning malware detection model based on heterogeneous information network path representation
Yang, Kang
Cai, Lizhi
Wu, Jianhua
Liu, Zhenyu
Zhang, Meng
APPLIED INTELLIGENCE, 2025, 55 (06)
[2] Dr.Emotion: Disentangled Representation Learning for Emotion Analysis on Social Media to Improve Community Resilience in the COVID-19 Era and Beyond
Ju, Mingxuan
Song, Wei
Sun, Shiyu
Ye, Yanfang
Fan, Yujie
Hou, Shifu
Loparo, Kenneth
Zhao, Liang
PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 518 - 528
[3] Android Malware Detection Based on Heterogeneous Information Network with Cross-Layer Features
Xixuan, Ren
Lirui, Zhao
Kai, Wang
Zhixing, Xue
Anran, Hou
Qiao, Shao
2022 19th International Computer Conference on Wavelet Active Media Technology and Information Processing, ICCWAMTIP 2022, 2022,
[4] HinDroid: An Intelligent Android Malware Detection System Based on Structured Heterogeneous Information Network
Hou, Shifu
Ye, Yanfang
Song, Yangqiu
Abdulhayoglu, Melih
KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 1507 - 1515
[5] ANDROID MALWARE DETECTION BASED ON HETEROGENEOUS INFORMATION NETWORK WITH CROSS-LAYER FEATURES
Ren Xixuan
Zhao Lirui
Wang Kai
Xue Zhixing
Hou Anran
Shao Qiao
2022 19TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2022,
[6] AMDetector: Detecting Large-Scale and Novel Android Malware Traffic with Meta-learning
Li, Wenhao
Bao, Huaifeng
Zhang, Xiao-Yu
Li, Lin
COMPUTATIONAL SCIENCE, ICCS 2022, PT IV, 2022, : 387 - 401
[7] Comparing Classifiers: A Look at Machine-Learning and the Detection of Mobile Malware in COVID-19 Android Mobile Applications
Johnson, Seth
Donner, Ray
Perez, Alfredo J.
PROCEEDINGS OF THE 2023 INTERNATIONAL SYMPOSIUM ON THEORY, ALGORITHMIC FOUNDATIONS, AND PROTOCOL DESIGN FOR MOBILE NETWORKS AND MOBILE COMPUTING, MOBIHOC 2023, 2023, : 498 - 503
[8] FeatNet: Large-scale Fraud Device Detection by Network Representation Learning with Rich Features
Xu, Chao
Feng, Zhentan
Chen, Yizheng
Wang, Minghua
Wei, Tao
AISEC'18: PROCEEDINGS OF THE 11TH ACM WORKSHOP ON ARTIFICIAL INTELLIGENCE AND SECURITY, 2018, : 57 - 63
[9] PartNRL: Partial Nodes Representation Learning in Large-Scale Network
Li, Juan-Hui
Huang, Ling
Wang, Chang-Dong
Huang, Dong
Lai, Jian-Huang
IEEE ACCESS, 2019, 7 : 56457 - 56468
[10] GROUP TESTING FOR LARGE-SCALE COVID-19 SCREENING
Zahrouni, Wassim
Kamoun, Hichem
JOURNAL OF DECISION SYSTEMS, 2022, 32 (01) : 162 - 176

← 1 2 3 4 5 →