Detection of malicious software by analyzing the behavioral artifacts using machine learning algorithms

被引:35
|
作者
Singh, Jagsir [1 ]
Singh, Jaswinder [1 ]
机构
[1] Punjabi Univ Patiala, Dept Comp Sci & Engn, Patiala, Punjab, India
关键词
Dynamic analysis; Malware; Machine learning algorithms; Random Forest; Static analysis; MALWARE; CLASSIFICATION; SECURITY; FRAMEWORK; FAMILIES; PRIVACY; WRAPPER; SYSTEMS;
D O I
10.1016/j.infsof.2020.106273
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Malicious software deliberately affects the computer systems. Malware are analyzed using static or dynamic analysis techniques. Using these techniques, unique patterns are extracted to detect malware correctly. In this paper, a behavior-based malware detection technique is proposed. Various runtime features are extracted by setting up a dynamic analysis environment using the Cuckoo sandbox. Three primary features are processed for developing malware classifier. Firstly, printable strings are processed word by word using text mining techniques which produced a very high dimension matrix of the string features. Then we apply the singular value decomposition technique for reducing dimensions of string features. Secondly, Shannon entropy is computed over the printable strings and API calls to consider the randomness of API and PSI features. In addition to these features, behavioral features regarding file operations, registry key modification and network activities are used in malware detection. Finally, all features are integrated in the training feature set to develop the malware classifiers using the machine learning algorithms. The proposed technique is validated with 16489 malware and 8422 benign files. Our experimental results show the accuracy of 99.54% in malware detection using ensemble machine learning algorithms. Moreover, it aims to develop a behavior-based malware detection technique of high accuracy by processing the runtime features in a new way.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Detection of Malicious Software by Analyzing Distinct Artifacts Using Machine Learning and Deep Learning Algorithms
    Ashik, Mathew
    Jyothish, A.
    Anandaram, S.
    Vinod, P.
    Mercaldo, Francesco
    Martinelli, Fabio
    Santone, Antonella
    [J]. ELECTRONICS, 2021, 10 (14)
  • [2] MalDC: Malicious Software Detection and Classification using Machine Learning
    Moon, Jaewoong
    Kim, Subin
    Jangyong, Park
    Lee, Jieun
    Kim, Kyungshin
    Song, Jaeseung
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2022, 16 (05): : 1466 - 1488
  • [3] Detection analysis of malicious cyber attacks using machine learning algorithms
    Karthika, R. A.
    Maheswari, M.
    [J]. MATERIALS TODAY-PROCEEDINGS, 2022, 68 : 26 - 34
  • [4] On the Feasibility of Supervised Machine Learning for the Detection of Malicious Software Packages
    Ohm, Marc
    Boes, Felix
    Bungartz, Christian
    Meier, Michael
    [J]. PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY AND SECURITY, ARES 2022, 2022,
  • [5] Detection of malicious URLs using machine learning
    Reyes-Dorta, Nuria
    Caballero-Gil, Pino
    Rosa-Remedios, Carlos
    [J]. WIRELESS NETWORKS, 2024,
  • [6] Malicious URL Detection Using Machine Learning
    Hani, Dr Raed Bani
    Amoura, Motasem
    Ammourah, Mohammad
    Abu Khalil, Yazeed
    [J]. 2024 15TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS, ICICS 2024, 2024,
  • [7] Malicious Network Traffic Detection for DNS over HTTPS using Machine Learning Algorithms
    Casanova, Lionel F. Gonzalez
    Lin, Po-Chiang
    [J]. APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2023, 12 (02)
  • [8] An Analysis Employing Various Machine Learning Algorithms for Detection of Malicious URLs
    Rizvi, Fizza
    Mohi ud din, Saika
    Sharma, Nonita
    Sharma, Deepak Kumar
    [J]. Communications in Computer and Information Science, 2023, 1782 CCIS : 235 - 241
  • [9] Malicious Software Detection System in a Virtual Machine Using Database
    Cheon, Hyun-woo
    Lee, Kyu-Won
    Lee, Sang-Ho
    Lee, Geuk
    [J]. CONVERGENCE AND HYBRID INFORMATION TECHNOLOGY, 2011, 206 : 212 - +
  • [10] Evaluation of Machine Learning Algorithms for Detection of Malicious Traffic in SCADA Network
    L. Rajesh
    Penke Satyanarayana
    [J]. Journal of Electrical Engineering & Technology, 2022, 17 : 913 - 928