Predicting the Impact of Android Malicious Samples via Machine Learning

被引:12
|
作者
Qiu, Junyang [1 ]
Luo, Wei [1 ]
Pan, Lei [1 ]
Tai, Yonghang [2 ]
Zhang, Jun [3 ]
Xiang, Yang [3 ]
机构
[1] Deakin Univ, Sch Informat Technol, Geelong, Vic 3216, Australia
[2] Yunnan Normal Univ, Sch Phys & Elect Informat, Kunming 650500, Yunnan, Peoples R China
[3] Swinburne Univ Technol, Sch Software & Elect Engn, Melbourne, Vic 3122, Australia
关键词
Android malware; deep neural network; high impact malicious samples; low impact malicious samples; static analysis; SVM; NEURAL-NETWORKS;
D O I
10.1109/ACCESS.2019.2914311
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Recently, Android malicious samples threaten billions of mobile end users' security or privacy. The community researchers have designed many methods to automatically and accurately identify Android malware samples. However, the rapid increase of Android malicious samples outpowers the capabilities of traditional Android malware detectors and classifiers with respect to the cyber security risk management needs. It is important to identify the small proportion of Android malicious samples that may produce high cyber-security or privacy impact. In this paper, we propose a light-weight solution to automatically identify the Android malicious samples with high security and privacy impact. We manually check a number of Android malware families and corresponding security incidents and define two impact metrics for Android malicious samples. Our investigation results in a new Android malware dataset with impact ground truth (low impact or high impact). This new dataset is employed to empirically investigate the intrinsic characteristics of low-impact as well as high-impact malicious samples. To characterize and capture Android malicious samples' pattern, reverse engineering is performed to extract semantic features to represent malicious samples. The leveraged features are parsed from both the AndroidManifest.xml files as well as the disassembled binary classes.dex codes. Then, the extracted features are embedded into numerical vectors. Furthermore, we train highly accurate support vector machine and deep neural network classifiers to categorize the candidate Android malicious samples into low impact or high impact. The empirical results validate the effectiveness of our designed light-weight solution. This method can be further utilized for identifying those high-impact Android malicious samples in the wild.
引用
收藏
页码:66304 / 66316
页数:13
相关论文
共 50 条
  • [41] Machine Learning for Android Scareware Detection
    Bagui, Sikha
    Brock, Hunter
    JOURNAL OF INFORMATION TECHNOLOGY RESEARCH, 2022, 15 (01)
  • [42] Machine Learning to Identify Android Malware
    Tam, Geran
    Hunter, Aaron
    2018 9TH IEEE ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2018,
  • [43] Predicting depression and suicidal tendencies by analyzing online activities using machine learning in android devices
    Qadeer, Sara
    Memon, Khuhed
    Palli, Ghulam Hyder
    MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2024, 43 (01) : 213 - 224
  • [44] The impact of machine learning in predicting risk of violence: A systematic review
    Parmigiani, Giovanna
    Barchielli, Benedetta
    Casale, Simona
    Mancini, Toni
    Ferracuti, Stefano
    FRONTIERS IN PSYCHIATRY, 2022, 13
  • [45] Machine learning methods for predicting the outcome of hypervelocity impact events
    Ryan, Shannon
    Thaler, Stephen
    Kandanaarachchi, Sevvandi
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 45 : 23 - 39
  • [46] Predicting the impact of feedback on matter clustering with machine learning in CAMELS
    Delgado, Ana Maria
    Angles-Alcazar, Daniel
    Thiele, Leander
    Pandey, Shivam
    Lehman, Kai
    Somerville, Rachel S.
    Ntampaka, Michelle
    Genel, Shy
    Villaescusa-Navarro, Francisco
    Hernquist, Lars
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2023, 526 (04) : 5306 - 5325
  • [47] Man-in-the-Middle Attacks Against Machine Learning Classifiers Via Malicious Generative Models
    Wang, Derui
    Li, Chaoran
    Wen, Sheng
    Nepal, Surya
    Xiang, Yang
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2021, 18 (05) : 2074 - 2087
  • [48] Estimating permanent price impact via machine learning
    Philip, R.
    JOURNAL OF ECONOMETRICS, 2020, 215 (02) : 414 - 449
  • [49] Quantifying the Impact of Adversarial Evasion Attacks on Machine Learning Based Android Malware Classifiers
    Abaid, Zainab
    Kaafar, Mohamed Ali
    Jha, Sanjay
    2017 IEEE 16TH INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS (NCA), 2017, : 375 - 384
  • [50] Impact of datasets on machine learning based methods in Android malware detection: an empirical study
    Ge, Xiuting
    Huang, Yifan
    Hui, Zhanwei
    Wang, Xiaojuan
    Cao, Xu
    2021 IEEE 21ST INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY (QRS 2021), 2021, : 81 - 92