The Effect of the Ransomware Dataset Age on the Detection Accuracy of Machine Learning Models

被引:1
|
作者
Yaseen, Qussai M. [1 ,2 ]
机构
[1] Ajman Univ, Coll Engn & Informat Technol, Artificial Intelligence Res Ctr AIRC, Ajman 20550, U Arab Emirates
[2] Jordan Univ Sci & Technol, Fac Comp & Informat Technol, Dept Comp Informat Syst, Irbid 22110, Jordan
关键词
Android malware; information security; supervised machine learning; ransomware; MALWARE DETECTION; CLASSIFICATION;
D O I
10.3390/info14030193
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Several supervised machine learning models have been proposed and used to detect Android ransomware. These models were trained using different datasets from different sources. However, the age of the ransomware datasets was not considered when training and testing these models. Therefore, the detection accuracy for those models is inaccurate since they learned using features from specific ransomware, old or new ransomware, and they did not learn using diverse ransomware features from different ages. This paper sheds light on the importance of considering the age of ransomware datasets and its effects on the detection accuracy of supervised machine learning models. This proves that supervised machine learning models trained using new ransomware dataset are inefficient in detecting old types of ransomware and vice versa. Moreover, this paper collected a large and diverse dataset of ransomware applications that comprises new and old ransomware developed during the period 2008-2020. Furthermore, the paper proposes a supervised machine learning model that is trained and tested using the diverse dataset. The experiments show that the proposed model is efficient in detecting Android ransomware regardless of its age by achieving an accuracy of approximately 97.48%. Moreover, the results shows that the proposed model outperforms the state-of-the-art approaches considered in this work.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Dynamic Feature Dataset for Ransomware Detection Using Machine Learning Algorithms
    Herrera-Silva, Juan A.
    Hernandez-alvarez, Myriam
    [J]. SENSORS, 2023, 23 (03)
  • [2] RanSAP: An open dataset of ransomware storage access patterns for training machine learning models
    Hirano, Manabu
    Hodota, Ryo
    Kobayashi, Ryotaro
    [J]. FORENSIC SCIENCE INTERNATIONAL-DIGITAL INVESTIGATION, 2022, 40
  • [3] Preparing a Dataset of Ransomware BTC Addresses for Machine Learning Purpose
    Rodwald, Przemyslaw
    [J]. SYSTEM DEPENDABILITY-THEORY AND APPLICATIONS, DEPCOS-RELCOMEX 2024, 2024, 1026 : 227 - 236
  • [4] Machine Learning Algorithms and Frameworks in Ransomware Detection
    Smith, Daryle
    Khorsandroo, Sajad
    Roy, Kaushik
    [J]. IEEE ACCESS, 2022, 10 : 117597 - 117610
  • [5] Authentic Learning of Machine Learning to Ransomware Detection and Prevention
    Faruk, Md Jobair Hossain
    Masum, Mohammad
    Shahriar, Hossain
    Qian, Kai
    Lo, Dan
    [J]. 2022 IEEE 46TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE (COMPSAC 2022), 2022, : 442 - 443
  • [6] Ransomware Classification and Detection With Machine Learning Algorithms
    Masum, Mohammad
    Faruk, Md Jobair Hossain
    Shahriar, Hossain
    Qian, Kai
    Lo, Dan
    Adnan, Muhaiminul Islam
    [J]. 2022 IEEE 12TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2022, : 316 - 322
  • [7] Ransomware detection using machine learning algorithms
    Bae, Seong Il
    Lee, Gyu Bin
    Im, Eul Gyu
    [J]. CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2020, 32 (18):
  • [8] Ransomware Detection Using Machine Learning: A Survey
    Alraizza, Amjad
    Algarni, Abdulmohsen
    [J]. BIG DATA AND COGNITIVE COMPUTING, 2023, 7 (03)
  • [9] Early Ransomware Detection with Deep Learning Models
    Davidian, Matan
    Kiperberg, Michael
    Vanetik, Natalia
    [J]. FUTURE INTERNET, 2024, 16 (08)
  • [10] Change Point Detection with Machine Learning for Rapid Ransomware Detection
    Melaragno, Anthony
    Casey, William
    [J]. 2022 IEEE INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, INTL CONF ON CLOUD AND BIG DATA COMPUTING, INTL CONF ON CYBER SCIENCE AND TECHNOLOGY CONGRESS (DASC/PICOM/CBDCOM/CYBERSCITECH), 2022, : 154 - 162