Cross-Regional Malware Detection via Model Distilling and Federated Learning

被引:1
|
作者
Botacin, Marcus [1 ]
Gomes, Heitor [2 ]
机构
[1] Texas A&M Univ, College Stn, TX 77840 USA
[2] Victoria Univ Wellington, Wellington, New Zealand
关键词
malware; federated learning; model distilling;
D O I
10.1145/3678890.3678893
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine Learning (ML) is a key part of modern malware detection pipelines, but its application is not straightforward. It involves multiple practical challenges that are frequently unaddressed by the literature works. A key challenge is the heterogeneity of scenarios. Antivirus (AV) companies for instance operate under different performance constraints in the backend and in the endpoint, and with a diversity of datasets according to the country they operate in. In this paper, we evaluate the impact of these heterogeneous aspects by developing a classification pipeline for 3 datasets of 10K malware samples each collected by an AV company in the USA, Brazil, and Japan in the same period. We characterize the different requirements for these datasets and we show that a different number of features is required to reach the optimal detection rate in each scenario. We show that a global model combining the three datasets increases the detection of the three individual datasets. We propose using Federated Learning (FL) to build the global model and a distilling process to generate the local versions. We order the samples temporally to show that although retraining on concept drift detection helps recover the detection rate, only a FL approach can increase the detection rate.
引用
收藏
页码:97 / 113
页数:17
相关论文
共 50 条
  • [1] Cross-Regional Fraud Detection via Continual Learning With Knowledge Transfer
    Li, Yujie
    Yang, Xin
    Gao, Qiang
    Wang, Hao
    Zhang, Junbo
    Li, Tianrui
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (12) : 7865 - 7877
  • [2] An opinion based cross-regional meteorological event detection model
    Zhu, Yifan
    Chambua, James
    Lu, Hao
    Shi, Kaize
    Niu, Zhendong
    WEATHER, 2019, 74 (02) : 51 - 55
  • [3] A Cross-Regional Load Forecasting Method Based on a Pseudo-Distributed Federated Learning Strategy
    Deng, Jinsong
    Cai, Shaotang
    Wu, Weinong
    Jiang, Rong
    Deng, Hongyu
    Ma, Jinhua
    Luo, Yonghang
    IEEE ACCESS, 2025, 13 : 22446 - 22458
  • [4] Federated learning for malware detection in IoT devices
    Rey, Valerian
    Sanchez Sanchez, Pedro Miguel
    Huertas Celdran, Alberto
    Bovet, Gerome
    COMPUTER NETWORKS, 2022, 204
  • [5] SIM-FED: Secure IoT malware detection model with federated learning
    Nobakht, Mehrnoosh
    Javidan, Reza
    Pourebrahimi, Alireza
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 116
  • [6] Cross-Regional Oil Palm Tree Detection
    Wu, Wenzhao
    Zheng, Juepeng
    Fu, Haohuan
    Li, Weijia
    Yu, Le
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 248 - 257
  • [7] The intrinsic convenience of federated learning malware IoT detection
    Camerota, Chiara
    Pecorella, Tommaso
    Bagdanov, Andrew D.
    2024 20TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT, CNSM 2024, 2024,
  • [8] On the Impact of Heterogeneity on Federated Learning at the Edge with DGA Malware Detection
    Ma Viet Duc
    Nguyen Thanh Luan
    Nguyen Tien Tai
    Nguyen Pham Trung Hieu
    Nguyen Ngoc Minh
    Pham Trung Hieu
    Vu Hai
    Nguyen Huu Thanh
    ASIAN INTERNET ENGINEERING CONFERENCE, AINTEC 2024, 2024, : 10 - 17
  • [9] Comprehensive Android Malware Detection Based on Federated Learning Architecture
    Fang, Wenbo
    He, Junjiang
    Li, Wenshan
    Lan, Xiaolong
    Chen, Yang
    Li, Tao
    Huang, Jiwu
    Zhang, Linlin
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 18 : 3977 - 3990
  • [10] (POSTER) Federated Learning Assisted Model for Android Malware Detection using Gannet Optimization Algorithm
    Arya, Shikha
    Peddoju, Sateesh Kumar
    2024 20TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING IN SMART SYSTEMS AND THE INTERNET OF THINGS, DCOSS-IOT 2024, 2024, : 777 - 779