Towards a fair comparison and realistic evaluation framework of android malware detectors based on static analysis and machine learning

被引：15

作者：

Molina-Coronado, Borja ^{[1
]}

Mori, Usue ^{[2
]}

Mendiburu, Alexander ^{[1
]}

Miguel-Alonso, Jose ^{[1
]}

机构：

[1] Univ Basque Country UPV EHU, Dept Comp Architecture & Technol, Ps Manuel Lardizabal 1, Donostia San Sebastian 20018, Gipuzkoa, Spain

[2] Univ Basque Country UPV EHU, Dept Comp Sci & Artificial Intelligence, Ps Manuel Lardizabal 1, Donostia San Sebastian 20018, Gipuzkoa, Spain

来源：

COMPUTERS & SECURITY | 2023年 / 124卷

关键词：

Android malware detection; Machine learning; Mobile security; Experimental analysis; Static analysis; OBFUSCATION; DISCOVERY; KNOWLEDGE; MODEL;

D O I：

10.1016/j.cose.2022.102996

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

As in other cybersecurity areas, machine learning (ML) techniques have emerged as a promising solution to detect Android malware. In this sense, many proposals employing a variety of algorithms and feature sets have been presented to date, often reporting impresive detection performances. However, the lack of reproducibility and the absence of a standard evaluation framework make these proposals difficult to compare. In this paper, we perform an analysis of 10 influential research works on Android malware detection using a common evaluation framework. We have identified five factors that, if not taken into account when creating datasets and designing detectors, significantly affect the trained ML models and their performances. In particular, we analyze the effect of (1) the presence of duplicated samples, (2) label (goodware/greyware/malware) attribution, (3) class imbalance, (4) the presence of apps that use evasion techniques and, (5) the evolution of apps. Based on this extensive experimentation, we conclude that the studied ML-based detectors have been evaluated optimistically, which justifies the good published results. Our findings also highlight that it is imperative to generate realistic experimental scenarios, taking into account the aforementioned factors, to foster the rise of better ML-based Android malware detection solutions. (c) 2022 Elsevier Ltd. All rights reserved.

引用

页数：16

共 50 条

[21] A Method for Automatic Android Malware Detection Based on Static Analysis and Deep Learning
Ibrahim, Mulhem
Issa, Bayan
Jasser, Muhammed Basheer
IEEE ACCESS, 2022, 10 : 117334 - 117352
[22] Analysis of Machine Learning Solutions to Detect Malware in Android
Jamil, Qudsia
Shah, Munam Ali
2016 SIXTH INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING TECHNOLOGY (INTECH), 2016, : 226 - 232
[23] An Android Malware Detection System Based on Machine Learning
Wen, Long
Yu, Haiyang
GREEN ENERGY AND SUSTAINABLE DEVELOPMENT I, 2017, 1864
[24] Detecting Android Malware Based on Extreme Learning Machine
Sun, Yuxia
Xie, Yunlong
Qiu, Zhi
Pan, Yuchang
Weng, Jian
Guo, Song
2017 IEEE 15TH INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, 15TH INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, 3RD INTL CONF ON BIG DATA INTELLIGENCE AND COMPUTING AND CYBER SCIENCE AND TECHNOLOGY CONGRESS(DASC/PICOM/DATACOM/CYBERSCI, 2017, : 47 - 53
[25] Static Analysis of Android Malware Detection using Deep Learning
Sandeep, H. R.
PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICCS), 2019, : 841 - 845
[26] Android Malware Classification Addressing Repackaged Entities by the Evaluation of Static Features and Multiple Machine Learning Algorithms
Hasan, Md Rashedul
PROCEEDINGS OF SEVENTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2022, VOL. 2, 2023, 448 : 29 - 38
[27] MLDroid—framework for Android malware detection using machine learning techniques
Arvind Mahindru
A. L. Sangal
Neural Computing and Applications, 2021, 33 : 5183 - 5240
[28] Static and Dynamic Analysis of Android Malware and Goodware Written with Unity Framework
Shim, Jaewoo
Lim, Kyeonghwan
Cho, Seong-je
Han, Sangchul
Park, Minkyu
SECURITY AND COMMUNICATION NETWORKS, 2018,
[29] Functionality-Verification Attack Framework Based on Reinforcement Learning Against Static Malware Detectors
Tian, Buwei
Jiang, Junyong
He, Zichen
Yuan, Xin
Dong, Lu
Sun, Changyin
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 19 : 8500 - 8514
[30] Comprehensive Android Malware Detection: Leveraging Machine Learning and Sandboxing Techniques through Static and Dynamic Analysis
Bhooshan, Prashant
Darshan, Shiva S. L.
Sonkar, Nidhi
2024 IEEE 21ST INTERNATIONAL CONFERENCE ON MOBILE AD-HOC AND SMART SYSTEMS, MASS 2024, 2024, : 580 - 585

← 1 2 3 4 5 →