A Comprehensive Study on Efficient and Accurate Machine Learning-Based Malicious PE Detection

被引:2
|
作者
Barut, Onur [1 ]
Zhang, Tong [1 ]
Luo, Yan [2 ]
Li, Peilong [3 ]
机构
[1] Intel Corp, Network & Edge Grp, Santa Clara, CA 95054 USA
[2] Univ Massachusetts Lowell, Dept Elect & Comp Eng, Lowell, MA USA
[3] Elizabethtown Coll, Dept Comp Sci, Elizabethtown, PA USA
关键词
Malware Analysis; Ransomware Detection; Machine Learning; Feature Engineering;
D O I
10.1109/CCNC51644.2023.10060214
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
For safe and trustworthy digital services, fast and accurate malware detection is critical. Because of the financial rewards, ransomware assaults are one of the most commonly employed malware variants by cyber criminals. Because of the dynamic environment in which new malware variants arise on a regular basis, it is critical to maintain databases up-to-date in order to protect the digital world from ransomware threats. In this study, we curated the Ransomary dataset containing 2871 ransomware and 4208 benign PE files to allow researchers to use their own algorithms to accomplish fast and precise detection. We examined the Ransomary dataset and compared feature extraction and raw data techniques of static malware analysis. In the EMBER, DeepDetectNet, and Ransomary datasets, we found that effective feature selection with the LightGBM model can yield more than 0.99 AUC. Finally, we demonstrate that using raw data from the first 1KB of PE files may result in an accurate and extremely rapid response time. We intend to continuously expand Ransomary dataset and encourage more researchers to use static, dynamic, or hybrid analysis to identify ransomware more quickly and accurately.
引用
收藏
页数:4
相关论文
共 50 条
  • [31] Learning-Based Detection for Malicious Android Application Using Code Vectorization
    Liu, Lin
    Ren, Wang
    Xie, Feng
    Yi, Shengwei
    Yi, Junkai
    Jia, Peng
    SECURITY AND COMMUNICATION NETWORKS, 2021, 2021
  • [32] SandboxNet: A Learning-Based Malicious Application Detection Framework in SDN Networks
    Chi, Po-Wen
    Zheng, Yu
    Chang, Wei-Yang
    Wang, Ming-Hung
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2022, 38 (06) : 1189 - 1211
  • [33] A Comprehensive Analysis of Machine Learning- and Deep Learning-Based Solutions for DDoS Attack Detection in SDN
    Naziya Aslam
    Shashank Srivastava
    M. M. Gore
    Arabian Journal for Science and Engineering, 2024, 49 : 3533 - 3573
  • [34] Deep Learning-Based Malicious Account Detection in the Momo Social Network
    Wang, Jiaqi
    He, Xinlei
    Gong, Qingyuan
    Chen, Yang
    Wang, Tianyi
    Wang, Xin
    2018 27TH INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND NETWORKS (ICCCN), 2018,
  • [35] Machine learning-based wavelength detection system
    Kwon, Ik-Hyun
    Choi, Yong-Joon
    Ide, Tomoya
    Noda, Toshihiko
    Takahashi, Kazuhiro
    Sawada, Kazuaki
    JAPANESE JOURNAL OF APPLIED PHYSICS, 2025, 64 (01)
  • [36] Machine learning-based phishing attack detection
    Hossain S.
    Sarma D.
    Chakma R.J.
    International Journal of Advanced Computer Science and Applications, 2020, 11 (09): : 378 - 388
  • [37] Machine learning-based test smell detection
    Pontillo, Valeria
    d'Aragona, Dario Amoroso
    Pecorelli, Fabiano
    Di Nucci, Dario
    Ferrucci, Filomena
    Palomba, Fabio
    EMPIRICAL SOFTWARE ENGINEERING, 2024, 29 (02)
  • [38] Machine Learning-Based Phishing Attack Detection
    Hossain, Sohrab
    Sarma, Dhiman
    Chakma, Rana Joyti
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (09) : 378 - 388
  • [39] Machine Learning-Based Colorectal Cancer Detection
    Blanes-Vidal, Victoria
    Baatrup, Gunnar
    Nadimi, Esmaeil S.
    PROCEEDINGS OF THE 2018 CONFERENCE ON RESEARCH IN ADAPTIVE AND CONVERGENT SYSTEMS (RACS 2018), 2018, : 43 - 46
  • [40] Machine learning-based test smell detection
    Valeria Pontillo
    Dario Amoroso d’Aragona
    Fabiano Pecorelli
    Dario Di Nucci
    Filomena Ferrucci
    Fabio Palomba
    Empirical Software Engineering, 2024, 29