Cross-Regional Malware Detection via Model Distilling and Federated Learning

被引:1
|
作者
Botacin, Marcus [1 ]
Gomes, Heitor [2 ]
机构
[1] Texas A&M Univ, College Stn, TX 77840 USA
[2] Victoria Univ Wellington, Wellington, New Zealand
关键词
malware; federated learning; model distilling;
D O I
10.1145/3678890.3678893
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine Learning (ML) is a key part of modern malware detection pipelines, but its application is not straightforward. It involves multiple practical challenges that are frequently unaddressed by the literature works. A key challenge is the heterogeneity of scenarios. Antivirus (AV) companies for instance operate under different performance constraints in the backend and in the endpoint, and with a diversity of datasets according to the country they operate in. In this paper, we evaluate the impact of these heterogeneous aspects by developing a classification pipeline for 3 datasets of 10K malware samples each collected by an AV company in the USA, Brazil, and Japan in the same period. We characterize the different requirements for these datasets and we show that a different number of features is required to reach the optimal detection rate in each scenario. We show that a global model combining the three datasets increases the detection of the three individual datasets. We propose using Federated Learning (FL) to build the global model and a distilling process to generate the local versions. We order the samples temporally to show that although retraining on concept drift detection helps recover the detection rate, only a FL approach can increase the detection rate.
引用
收藏
页码:97 / 113
页数:17
相关论文
共 50 条
  • [31] Cross-regional oil palm tree counting and detection via a multi-level attention domain adaptation network
    Zheng, Juepeng
    Fu, Haohuan
    Li, Weijia
    Wu, Wenzhao
    Zhao, Yi
    Dong, Runmin
    Yu, Le
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2020, 167 : 154 - 177
  • [32] Federated Model Search via Reinforcement Learning
    Yao, Dixi
    Wang, Lingdong
    Xu, Jiayu
    Xiang, Liyao
    Shao, Shuo
    Chen, Yingqi
    Tong, Yanjun
    2021 IEEE 41ST INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2021), 2021, : 830 - 840
  • [33] FedMe: Federated Learning via Model Exchange
    Matsuda, Koji
    Sasaki, Yuya
    Xiao, Chuan
    Onizuka, Makoto
    PROCEEDINGS OF THE 2022 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2022, : 459 - 467
  • [34] Light-Weight Federated Transfer Learning Approach to Malware Detection on Computational Edges
    Mittal, Sakshi
    Rajvanshi, Prateek
    Ul Amin, Riaz
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2025, 16 (02) : 12 - 19
  • [35] Unified Transmission Pricing Model for Cross-regional Electricity Trading Considering Reliability
    Huang H.
    Yang D.
    He M.
    Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2017, 41 (17): : 51 - 59
  • [36] Model of Force-Assembling for Cross-Regional Fire-Fighting and Rescue
    Cheng, Xiaohong
    Kang, Qingchun
    Xia, Dengyou
    Jia, Dingduo
    Liu, Jing
    NEW PERSPECTIVES ON RISK ANALYSIS AND CRISIS RESPONSE, 2009, : 527 - 532
  • [37] Efficient Request Scheduling in Cross-Regional Edge Collaboration via Digital Twin Networks
    Liang, Yuzhu
    Li, Guo
    Guo, Jianxiong
    Liu, Qin
    Zheng, Xi
    Wang, Tian
    2024 IEEE/ACM 32ND INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE, IWQOS, 2024,
  • [38] BeiDou satellites cross-regional communication path assignment model and resource management
    Liu, Sheng
    Wu, Di
    Zhang, Lanyong
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2021, 17 (07):
  • [39] Strategic human resources, innovation and entrepreneurship fit - A cross-regional comparative model
    Wang, ZM
    Zang, Z
    INTERNATIONAL JOURNAL OF MANPOWER, 2005, 26 (06) : 544 - 559
  • [40] BeiDou satellites cross-regional communication path assignment model and resource management
    Liu, Sheng
    Wu, Di
    Zhang, Lanyong
    International Journal of Distributed Sensor Networks, 2021, 17 (07)