Disentangled Representation Learning in Heterogeneous Information Network for Large-scale Android Malware Detection in the COVID-19 Era and Beyond

被引:0
|
作者
Hou, Shifu [1 ]
Fan, Yujie [1 ]
Ju, Mingxuan [1 ]
Ye, Yanfang [1 ]
Wan, Wenqiang [2 ]
Wang, Kui [2 ]
Mei, Yinming [2 ]
Xiong, Qi [2 ]
Shao, Fudong [2 ]
机构
[1] Case Western Reserve Univ, Dept Comp & Data Sci, Cleveland, OH 44106 USA
[2] Tencent, Tencent Secur Lab, Shenzhen, Guangdong, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the fight against the COVID-19 pandemic, many social activities have moved online; society's overwhelming reliance on the complex cyberspace makes its security more important than ever. In this paper, we propose and develop an intelligent system named Dr.HIN to protect users against the evolving Android malware attacks in the COVID-19 era and beyond. In Dr.HIN, besides app content, we propose to consider higher-level semantics and social relations among apps, developers and mobile devices to comprehensively depict Android apps; and then we introduce a structured heterogeneous information network (HIN) to model the complex relations and exploit meta-path guided strategy to learn node (i.e., app) representations from HIN. As the representations of malware could be highly entangled with benign apps in the complex ecosystem of development, it poses a new challenge of learning the latent explanatory factors hidden in the HIN embeddings to detect the evolving malware. To address this challenge, we propose to integrate domain priors generated from different views (i.e., app content, app authorship, app installation) to devise an adversarial disentangler to separate the distinct, informative factors of variations hidden in the HIN embeddings for large-scale Android malware detection. This is the first attempt of disentangled representation learning in HIN data. Promising experimental results based on real sample collections from security industry demonstrate the performance of Dr.HIN in evolving Android malware detection, by comparison with baselines and popular mobile security products.
引用
收藏
页码:7754 / 7761
页数:8
相关论文
共 50 条
  • [1] A reinforcement learning malware detection model based on heterogeneous information network path representation
    Yang, Kang
    Cai, Lizhi
    Wu, Jianhua
    Liu, Zhenyu
    Zhang, Meng
    APPLIED INTELLIGENCE, 2025, 55 (06)
  • [2] Dr.Emotion: Disentangled Representation Learning for Emotion Analysis on Social Media to Improve Community Resilience in the COVID-19 Era and Beyond
    Ju, Mingxuan
    Song, Wei
    Sun, Shiyu
    Ye, Yanfang
    Fan, Yujie
    Hou, Shifu
    Loparo, Kenneth
    Zhao, Liang
    PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 518 - 528
  • [3] Android Malware Detection Based on Heterogeneous Information Network with Cross-Layer Features
    Xixuan, Ren
    Lirui, Zhao
    Kai, Wang
    Zhixing, Xue
    Anran, Hou
    Qiao, Shao
    2022 19th International Computer Conference on Wavelet Active Media Technology and Information Processing, ICCWAMTIP 2022, 2022,
  • [4] HinDroid: An Intelligent Android Malware Detection System Based on Structured Heterogeneous Information Network
    Hou, Shifu
    Ye, Yanfang
    Song, Yangqiu
    Abdulhayoglu, Melih
    KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 1507 - 1515
  • [5] ANDROID MALWARE DETECTION BASED ON HETEROGENEOUS INFORMATION NETWORK WITH CROSS-LAYER FEATURES
    Ren Xixuan
    Zhao Lirui
    Wang Kai
    Xue Zhixing
    Hou Anran
    Shao Qiao
    2022 19TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2022,
  • [6] AMDetector: Detecting Large-Scale and Novel Android Malware Traffic with Meta-learning
    Li, Wenhao
    Bao, Huaifeng
    Zhang, Xiao-Yu
    Li, Lin
    COMPUTATIONAL SCIENCE, ICCS 2022, PT IV, 2022, : 387 - 401
  • [7] Comparing Classifiers: A Look at Machine-Learning and the Detection of Mobile Malware in COVID-19 Android Mobile Applications
    Johnson, Seth
    Donner, Ray
    Perez, Alfredo J.
    PROCEEDINGS OF THE 2023 INTERNATIONAL SYMPOSIUM ON THEORY, ALGORITHMIC FOUNDATIONS, AND PROTOCOL DESIGN FOR MOBILE NETWORKS AND MOBILE COMPUTING, MOBIHOC 2023, 2023, : 498 - 503
  • [8] FeatNet: Large-scale Fraud Device Detection by Network Representation Learning with Rich Features
    Xu, Chao
    Feng, Zhentan
    Chen, Yizheng
    Wang, Minghua
    Wei, Tao
    AISEC'18: PROCEEDINGS OF THE 11TH ACM WORKSHOP ON ARTIFICIAL INTELLIGENCE AND SECURITY, 2018, : 57 - 63
  • [9] PartNRL: Partial Nodes Representation Learning in Large-Scale Network
    Li, Juan-Hui
    Huang, Ling
    Wang, Chang-Dong
    Huang, Dong
    Lai, Jian-Huang
    IEEE ACCESS, 2019, 7 : 56457 - 56468
  • [10] GROUP TESTING FOR LARGE-SCALE COVID-19 SCREENING
    Zahrouni, Wassim
    Kamoun, Hichem
    JOURNAL OF DECISION SYSTEMS, 2022, 32 (01) : 162 - 176