AnDarwin: Scalable Detection of Android Application Clones Based on Semantics

被引:41
|
作者
Crussell, Jonathan [1 ]
Gibler, Clint [1 ]
Chen, Hao [1 ]
机构
[1] Univ Calif Davis, Comp Sci, Davis, CA 95616 USA
基金
美国国家科学基金会;
关键词
Program analysis; clustering; plagiarism detection; mobile applications;
D O I
10.1109/TMC.2014.2381212
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Smartphones rely on their vibrant application markets; however, plagiarism threatens the long-term health of these markets. We present a scalable approach to detecting similar Android apps based on their semantic information. We implement our approach in a tool called AnDarwin and evaluate it on 265,359 apps collected from 17 markets including Google Play and numerous third-party markets. In contrast to earlier approaches, AnDarwin has four advantages: it avoids comparing apps pairwise, thus greatly improving its scalability; it analyzes only the app code and does not rely on other information-such as the app's market, signature, or description-thus greatly increasing its reliability; it can detect both full and partial app similarity; and it can automatically detect library code and remove it from the similarity analysis. We present two use cases for AnDarwin: finding similar apps by different developers ("clones") and similar apps from the same developer ("rebranded"). In 10 hours, AnDarwin detected at least 4,295 apps that are the victims of cloning and 36,106 rebranded apps. Additionally, AnDarwin detects similar code that is injected into many apps, which may indicate the spread of malware. Our evaluation demonstrates AnDarwin's ability to accurately detect similar apps on a large scale.
引用
收藏
页码:2007 / 2019
页数:13
相关论文
共 50 条
  • [1] A Rapid and Scalable Method for Android Application Repackaging Detection
    Jiao, Sibei
    Cheng, Yao
    Ying, Lingyun
    Su, Purui
    Feng, Dengguo
    INFORMATION SECURITY PRACTICE AND EXPERIENCE, ISPEC 2015, 2015, 9065 : 349 - 364
  • [2] A New Semantics-Based Android Malware Detection
    Zhang, Xiaohan
    Jin, Zhengping
    2016 2ND IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATIONS (ICCC), 2016, : 1412 - 1416
  • [3] Scalable Detection of Semantic Clones
    Gabel, Mark
    Jiang, Lingxiao
    Su, Zhendong
    ICSE'08 PROCEEDINGS OF THE THIRTIETH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, 2008, : 321 - 330
  • [4] Scalable Race Detection for Android Applications
    Bielik, Pavol
    Raychev, Veselin
    Vechev, Martin
    ACM SIGPLAN NOTICES, 2015, 50 (10) : 332 - 348
  • [5] Scalable and accurate detection of code clones
    S. Sargsyan
    Sh. Kurmangaleev
    A. Belevantsev
    A. Avetisyan
    Programming and Computer Software, 2016, 42 : 27 - 33
  • [6] Scalable and Accurate Detection of Code Clones
    Sargsyan, S.
    Kurmangaleev, Sh.
    Belevantsev, A.
    Avetisyan, A.
    PROGRAMMING AND COMPUTER SOFTWARE, 2016, 42 (01) : 27 - 33
  • [7] A Scalable Cloud-Based Android App Repackaging Detection Framework
    Li, Jinghua
    Liu, Xiaoyan
    Zhang, Huixiang
    Mu, Dejun
    GREEN, PERVASIVE, AND CLOUD COMPUTING, 2016, 9663 : 113 - 125
  • [8] DECKARD: Scalable and accurate tree-based detection of code clones
    Jiang, Lingxiao
    Misherghi, Ghassan
    Su, Zhendong
    Glondu, Stephane
    ICSE 2007: 29TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, PROCEEDINGS, 2007, : 96 - +
  • [9] Apposcopy: Semantics-Based Detection of Android Malware through Static Analysis
    Feng, Yu
    Anand, Saswat
    Dillig, Isil
    Aiken, Alex
    22ND ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON THE FOUNDATIONS OF SOFTWARE ENGINEERING (FSE 2014), 2014, : 576 - 587
  • [10] SAINTDroid: Scalable, Automated Incompatibility Detection for Android
    Silva, Bruno
    Stevens, Clay
    Mansoor, Niloofar
    Srisa-An, Witawas
    Yu, Tingting
    Bagheri, Hamid
    2022 52ND ANNUAL IEEE/IFIP INTERNATIONAL CONFERENCE ON DEPENDABLE SYSTEMS AND NETWORKS (DSN 2022), 2022, : 567 - 579