Combining traditional machine learning and anomaly detection for several imbalanced Android malware dataset's classification

被引:0
|
作者
Gan, Yiwei [1 ]
Han, Qian [2 ]
Gao, Yumeng [3 ]
机构
[1] Dalhousie Univ, Halifax, NS B3H 4R2, Canada
[2] Dartmouth Coll, Hanover, NH USA
[3] Agcy Sci Technol & Res STAR, Inst Infocomm Res, Singapore, Singapore
关键词
Android malware; machine learning; data mining; static analysis; mobile security; DETECTION SYSTEM;
D O I
10.1145/3529399.3529412
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the number of mobile devices has exploded in recent years, so has the amount of advanced Android malware. Among the popular Android malware on the market today, click fraud malware, adware, banking Trojans, spyware, etc. are usually disguised and hidden in the heap of good Android applications. These advanced malwares lurk in the third-party application market trusted by users, and potentially endanger the security of the user's smart device causing privacy or economic loss. Therefore, this paper leverages and combines traditional machine learning and anomaly detection methods to detect specific classes of Android malware in three highly imbalanced Android datasets (entertainment + social app vs. click fraud malware + adware; financial services app vs. banking trojan; communication app vs. spyware). The experiment results show that our proposed combined methods have great performance on the three sub-datasets, achieving the average f1-score over 0.98 on three imbalanced datasets, which performs better than the traditional machine learning algorithm used alone. In addition, we use the combined methods to analyze the correlation between the top features of the dataset, and provide interpretable insights for other researchers focusing on Android malware classification in the coming future.
引用
收藏
页码:74 / 80
页数:7
相关论文
共 50 条
  • [1] A new machine learning-based method for android malware detection on imbalanced dataset
    Diyana Tehrany Dehkordy
    Abbas Rasoolzadegan
    [J]. Multimedia Tools and Applications, 2021, 80 : 24533 - 24554
  • [2] A new machine learning-based method for android malware detection on imbalanced dataset
    Dehkordy, Diyana Tehrany
    Rasoolzadegan, Abbas
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (16) : 24533 - 24554
  • [3] Malware Detection in Android Systems with Traditional Machine Learning Models: A Survey
    Bayazit, Esra Calik
    Sahingoz, Ozgur Koray
    Dogan, Buket
    [J]. 2ND INTERNATIONAL CONGRESS ON HUMAN-COMPUTER INTERACTION, OPTIMIZATION AND ROBOTIC APPLICATIONS (HORA 2020), 2020, : 374 - 381
  • [4] Machine learning aided Android malware classification
    Milosevic, Nikola
    Dehghantanha, Ali
    Choo, Kitn-Kwang Raymond
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2017, 61 : 266 - 274
  • [5] Android Anomaly Detection System Using Machine Learning Classification
    Kurniawan, Harry
    Rosmansyah, Yusep
    Dabarsyah, Budiman
    [J]. 5TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS 2015, 2015, : 288 - 293
  • [6] Machine Learning for anomaly detection. Performance study considering anomaly distribution in an imbalanced dataset
    El Hajjami, Salma
    Malki, Jamal
    Berrada, Mohammed
    Fourka, Bouziane
    [J]. PROCEEDINGS OF 2020 5TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND ARTIFICIAL INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS (CLOUDTECH'20), 2020, : 109 - 116
  • [7] Handling Imbalanced Dataset Classification in Machine Learning
    Yadav, Seema
    Bhole, Girish P.
    [J]. 2020 IEEE PUNE SECTION INTERNATIONAL CONFERENCE (PUNECON), 2020, : 38 - 43
  • [8] An Android Malware Detection Leveraging Machine Learning
    Shatnawi, Ahmed S.
    Jaradat, Aya
    Yaseen, Tuqa Bani
    Taqieddin, Eyad
    Al-Ayyoub, Mahmoud
    Mustafa, Dheya
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [9] Android Malware Detection Based on Machine Learning
    Wang, Qing-Fei
    Fang, Xiang
    [J]. 2018 4TH ANNUAL INTERNATIONAL CONFERENCE ON NETWORK AND INFORMATION SYSTEMS FOR COMPUTERS (ICNISC 2018), 2018, : 434 - 436
  • [10] Android Malware Detection Using Machine Learning
    Droos, Ayat
    Al-Mahadeen, Awss
    Al-Harasis, Tasnim
    Al-Attar, Rama
    Ababneh, Mohammad
    [J]. 2022 13TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2022, : 36 - 41