Characterizing Android apps' behavior for effective detection of malapps at large scale

被引:57
|
作者
Wang, Xing [1 ]
Wang, Wei [1 ]
He, Yongzhong [1 ]
Liu, Jiqiang [1 ]
Han, Zhen [1 ]
Zhang, Xiangliang [2 ]
机构
[1] Beijing Jiaotong Univ, Beijing Key Lab Secur & Privacy Intelligent Trans, 3 Shangyuancun, Beijing 100044, Peoples R China
[2] King Abdullah Univ Sci & Technol, Div Comp Elect & Math Sci & Engn, Thuwal, Saudi Arabia
基金
中国国家自然科学基金;
关键词
Android; Malicious apps detection; Feature comparison;
D O I
10.1016/j.future.2017.04.041
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Android malicious applications (inalapps) have surged and been sophisticated, posing a great threat to users. How to characterize, understand and detect Android malapps at a large scale is thus a big challenge. In this work, we are motivated to discover the discriminatory and persistent features extracted from Android APK files for automated malapp detection at a large scale. To achieve this goal, firstly we extract a very large number of features from each app and categorize the features into two groups, namely, app-specific features as well as platform-defined features. These feature sets will then be fed into four classifiers (i.e., Logistic Regression, linear SVM, Decision Tree and Random Forest) for the detection of malapps. Secondly, we evaluate the persistence of app-specific and platform-defined features on classification performance with two data sets collected in different time periods. Thirdly, we comprehensively analyze the relevant features selected by Logistic Regression classifier to identify the contributions of each feature set. We conduct extensive experiments on large real-world app sets consisting of 213,256 benign apps collected from six app markets, 4,363 benign apps from Google Play market, and 18,363 malapps. The experimental results and our analysis give insights regarding what discriminatory features are most effective to characterize malapps for building an effective and efficient malapp detection system. With the selected discriminatory features, the Logistic Regression classifier yields the best true positive rate as 96% with a false positive rate as 0.06%. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:30 / 45
页数:16
相关论文
共 50 条
  • [21] MUBot: Learning to Test Large-Scale Commercial Android Apps like a Human
    Peng, Chao
    Zhang, Zhao
    Lv, Zhengwei
    Yang, Ping
    2022 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE MAINTENANCE AND EVOLUTION (ICSME 2022), 2022, : 543 - 552
  • [22] A large-scale exploratory study of android sports apps in the google play store
    Chembakottu, Bhagya
    Li, Heng
    Khomh, Foutse
    INFORMATION AND SOFTWARE TECHNOLOGY, 2023, 164
  • [23] AndroZooOpen: Collecting Large-scale Open Source Android Apps for the Research Community
    Liu, Pei
    Li, Li
    Zhao, Yanjie
    Sun, Xiaoyu
    Grundy, John
    2020 IEEE/ACM 17TH INTERNATIONAL CONFERENCE ON MINING SOFTWARE REPOSITORIES, MSR, 2020, : 548 - 552
  • [24] A Large-Scale Longitudinal Analysis of Missing Label Accessibility Failures in Android Apps
    Fok, Raymond
    Zhong, Mingyuan
    Ross, Anne Spencer
    Fogarty, James
    Wobbrock, Jacob O.
    PROCEEDINGS OF THE 2022 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI' 22), 2022,
  • [25] PRADA: Prioritizing Android Devices for Apps by Mining Large-Scale Usage Data
    Lu, Xuan
    Liu, Xuanzhe
    Li, Huoran
    Xie, Tao
    Mei, Qiaozhu
    Hao, Dan
    Huang, Gang
    Feng, Feng
    2016 IEEE/ACM 38TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2016, : 3 - 13
  • [26] DroidKin: Lightweight Detection of Android Apps Similarity
    Gonzalez, Hugo
    Stakhanova, Natalia
    Ghorbani, Ali A.
    INTERNATIONAL CONFERENCE ON SECURITY AND PRIVACY IN COMMUNICATION NETWORKS, SECURECOMM 2014, PT I, 2015, 152 : 436 - 453
  • [27] Feature Point Detection for Repacked Android Apps
    Khan, M. A. Rahim
    Jain, Manoj Kumar
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2020, 26 (06): : 1359 - 1373
  • [28] Detection of SQLite Database Vulnerabilities in Android Apps
    Jain, Vineeta
    Gaur, M. S.
    Laxmi, Vijay
    Mosbah, Mohamed
    INFORMATION SYSTEMS SECURITY, 2016, 10063 : 521 - 531
  • [29] ShuffleDog: Characterizing and Adapting User-Perceived Latency of Android Apps
    Huang, Gang
    Xu, Mengwei
    Lin, Felix Xiaozhu
    Liu, Yunxin
    Ma, Yun
    Pushp, Saumay
    Liu, Xuanzhe
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2017, 16 (10) : 2913 - 2926
  • [30] Characterizing the evolution of statically-detectable performance issues of Android apps
    Teerath Das
    Massimiliano Di Penta
    Ivano Malavolta
    Empirical Software Engineering, 2020, 25 : 2748 - 2808