Android malware classification using optimum feature selection and ensemble machine learning

被引:0
|
作者
Islam, Rejwana [2 ]
Sayed, Moinul Islam [1 ]
Saha, Sajal [1 ]
Hossain, Mohammad Jamal [2 ]
Masud, Md Abdul [2 ]
机构
[1] Computer Science, Western University, London,ON, Canada
[2] Computer Science and Information Technology, Patuakhali Science and Technology University, Dumki, Patuakhali, Bangladesh
关键词
Android - Android malware - Category Classification - Dynamic features - Dynamics analysis - Ensemble - Machine-learning - Malwares - Multi-classification - Supervised machine learning;
D O I
10.1016/j.iotcps.2023.03.001
中图分类号
学科分类号
摘要
The majority of smartphones on the market run on the Android operating system. Security has been a core concern with this platform since it allows users to install apps from unknown sources. With thousands of apps being produced and launched daily, malware detection using Machine Learning (ML) has attracted significant attention compared to traditional detection techniques. Despite academic and commercial efforts, developing an efficient and reliable method for classifying malware remains challenging. As a result, several datasets for malware analysis have been generated and made available during the past ten years. These datasets may contain static features, such as API calls, intents, and permissions, or dynamic features, like logcat errors, shared memory, and system calls. Dynamic analysis is more resilient when it comes to code obfuscation. Though binary classification and multi-classification have been carried out in recent studies, the latter provides valuable insight into the nature of malware. Because each malware variant operates differently, identifying its category might help prevent it. Using the well-known ensemble ML approach called weighted voting, this study performed dynamic feature analysis for multi-classification. Random Forest, K-nearest Neighbors, Multi-Level Perceptrons, Decision Trees, Support Vector Machines, and Logistic Regression are all studied in this ensemble model. We used a recent dataset named CCCS-CIC-AndMal-2020, which contains an extensive collection of Android applications and malware samples. A well-researched data preparation phase followed by weighted voting based on R2 scores of the ML classifiers presents an accuracy of 95.0% even after excluding 60.2% features, outperforming all recent studies. © 2023 The Authors
引用
收藏
页码:100 / 111
相关论文
共 50 条
  • [1] High Performance Classification of Android Malware Using Ensemble Machine Learning
    Ouk, Pagnchakneat C.
    Pak, Wooguil
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (01): : 381 - 398
  • [2] Android Malware Detection Using Ensemble Feature Learning
    Rout, Siddhartha Suman
    Vashishtha, Lalit Kumar
    Chatterjee, Kakali
    Rout, Jitendra Kumar
    [J]. INFORMATION SYSTEMS AND MANAGEMENT SCIENCE, ISMS 2021, 2023, 521 : 531 - 539
  • [3] Ensemble Machine Learning Approach for Android Malware Classification Using Hybrid Features
    Pektas, Abdurrahman
    Acarman, Tankut
    [J]. PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON COMPUTER RECOGNITION SYSTEMS CORES 2017, 2018, 578 : 191 - 200
  • [4] FEATURE SELECTION AND MACHINE LEARNING CLASSIFICATION FOR MALWARE DETECTION
    Khammas, Ban Mohammed
    Monemi, Alireza
    Bassi, Joseph Stephen
    Ismail, Ismahani
    Nor, Sulaiman Mohd
    Marsono, Muhammad Nadzir
    [J]. JURNAL TEKNOLOGI, 2015, 77 (01):
  • [5] Android Malware Detection Using Machine Learning with Feature Selection Based on the Genetic Algorithm
    Lee, Jaehyeong
    Jang, Hyuk
    Ha, Sungmin
    Yoon, Yourim
    [J]. MATHEMATICS, 2021, 9 (21)
  • [6] DETECTION OF ANDROID MALWARE USING DEEP LEARNING ENSEMBLE WITH CHEETAH-OPTIMIZED FEATURE SELECTION
    Almotairi, Sultan
    Khan, Mohd Abdul Rahim
    Alharbi, Olayan
    Alzaid, Zaid
    Hausawi, Yasser M.
    Almutairi, Jaber
    [J]. ADVANCES AND APPLICATIONS IN DISCRETE MATHEMATICS, 2024, 41 (05): : 357 - 392
  • [7] BFEDroid: A Feature Selection Technique to Detect Malware in Android Apps Using Machine Learning
    Chimeleze, Collins
    Jamil, Norziana
    Ismail, Roslan
    Lam, Kwok-Yan
    Teh, Je Sen
    Samual, Joshua
    Okeke, Chidiebere Akachukwu
    [J]. SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
  • [8] Ensemble Feature Selection for Android SMS Malware Detection
    Ibrahim, Syed F.
    Hossain, Md Sakir
    Islam, Md Moontasirul
    Mostofa, Md Golam
    [J]. ADVANCES IN CYBERSECURITY, CYBERCRIMES, AND SMART EMERGING TECHNOLOGIES, 2023, 4 : 15 - 26
  • [9] Android malware detection applying feature selection techniques and machine learning
    Mohammad Reza Keyvanpour
    Mehrnoush Barani Shirzad
    Farideh Heydarian
    [J]. Multimedia Tools and Applications, 2023, 82 : 9517 - 9531
  • [10] Feature Selection and Ensemble of Classifiers for Android Malware Detection
    Coronado-De-Alba, Lilian D.
    Rodriguez-Mota, Abraham
    Escamilla-Ambrosio, Ponciano J.
    [J]. 2016 8TH IEEE LATIN-AMERICAN CONFERENCE ON COMMUNICATIONS (LATINCOM), 2016,