Disentangled Representation Learning in Heterogeneous Information Network for Large-scale Android Malware Detection in the COVID-19 Era and Beyond

被引:0
|
作者
Hou, Shifu [1 ]
Fan, Yujie [1 ]
Ju, Mingxuan [1 ]
Ye, Yanfang [1 ]
Wan, Wenqiang [2 ]
Wang, Kui [2 ]
Mei, Yinming [2 ]
Xiong, Qi [2 ]
Shao, Fudong [2 ]
机构
[1] Case Western Reserve Univ, Dept Comp & Data Sci, Cleveland, OH 44106 USA
[2] Tencent, Tencent Secur Lab, Shenzhen, Guangdong, Peoples R China
来源
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2021年 / 35卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the fight against the COVID-19 pandemic, many social activities have moved online; society's overwhelming reliance on the complex cyberspace makes its security more important than ever. In this paper, we propose and develop an intelligent system named Dr.HIN to protect users against the evolving Android malware attacks in the COVID-19 era and beyond. In Dr.HIN, besides app content, we propose to consider higher-level semantics and social relations among apps, developers and mobile devices to comprehensively depict Android apps; and then we introduce a structured heterogeneous information network (HIN) to model the complex relations and exploit meta-path guided strategy to learn node (i.e., app) representations from HIN. As the representations of malware could be highly entangled with benign apps in the complex ecosystem of development, it poses a new challenge of learning the latent explanatory factors hidden in the HIN embeddings to detect the evolving malware. To address this challenge, we propose to integrate domain priors generated from different views (i.e., app content, app authorship, app installation) to devise an adversarial disentangler to separate the distinct, informative factors of variations hidden in the HIN embeddings for large-scale Android malware detection. This is the first attempt of disentangled representation learning in HIN data. Promising experimental results based on real sample collections from security industry demonstrate the performance of Dr.HIN in evolving Android malware detection, by comparison with baselines and popular mobile security products.
引用
收藏
页码:7754 / 7761
页数:8
相关论文
共 50 条
  • [41] A large-scale analysis of Persian Tweets regarding Covid-19 vaccination
    Taha ShabaniMirzaei
    Houmaan Chamani
    Amirhossein Abaskohi
    Zhivar Sourati Hassan Zadeh
    Behnam Bahrak
    Social Network Analysis and Mining, 13
  • [42] Large-scale Implementation of a COVID-19 Remote Patient Monitoring Program
    Wang, Lulu
    Arky, Marisa
    Ierardo, Alyssa
    Scanlin, Anna
    Templeton, Melissa
    Booker, Ethan
    WESTERN JOURNAL OF EMERGENCY MEDICINE, 2023, 24 (06) : 1085 - 1093
  • [43] Risk Assessment of Large-Scale Sports Events in the Context of COVID-19
    Wang, Yiwei
    Xie, Ming
    Xie, Xiaowen
    Wang, Zhipeng
    Wang, Min
    Zhan, Xiuxiu
    Liu, Chuang
    Zhang, Zike
    Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2022, 51 (06): : 937 - 946
  • [44] Impact of the COVID-19 pandemic on the Internet latency: A large-scale study
    Candela, Massimo
    Luconi, Valerio
    Vecchio, Alessio
    COMPUTER NETWORKS, 2020, 182
  • [45] Identifying Drug Candidates for COVID-19 with Large-Scale Drug Screening
    Wu, Yifei
    Pegan, Scott D. D.
    Crich, David
    Lou, Lei
    Mullininx, Lauren Nicole
    Starling, Edward B. B.
    Booth, Carson
    Chishom, Andrew Edward
    Chang, Kuan Y. Y.
    Xie, Zhong-Ru
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2023, 24 (05)
  • [46] Predicting COVID-19 Spread from Large-Scale Mobility Data
    Schwabe, Amray
    Persson, Joel
    Feuerriegel, Stefan
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 3531 - 3539
  • [47] Large-scale network intrusion detection algorithm based on distributed learning
    College of Computer Science and Technology, Jilin University, Changchun 130012, China
    不详
    Ruan Jian Xue Bao/Journal of Software, 2008, 19 (04): : 993 - 1003
  • [48] Large-scale network intrusion detection based on distributed learning algorithm
    Tian, Daxin
    Liu, Yanheng
    Xiang, Yang
    INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2009, 8 (01) : 25 - 35
  • [49] Large-scale network intrusion detection based on distributed learning algorithm
    Daxin Tian
    Yanheng Liu
    Yang Xiang
    International Journal of Information Security, 2009, 8 : 25 - 35
  • [50] 'Refbin' an online platform to extract and classify large-scale information: a pilot study of COVID-19 related papers
    Lunna, Shania
    Flinn, Isabelle
    Prytherch, James
    Torfs-Leibman, Camille
    Robtoy, Sarah
    Bansak, Matt
    Krag, David
    BMJ HEALTH & CARE INFORMATICS, 2022, 29 (01)