Android Authorship Attribution Using Source Code-Based Features

被引:0
|
作者
Aydogan, Emre [1 ]
Sen, Sevil [1 ]
机构
[1] Hacettepe Univ, Dept Comp Engn, Wireless Networks & Intelligent Secure Syst WISE L, TR-06800 Ankara, Turkiye
关键词
Android; authorship attribution; mobile malware; metadata; obfuscation; source code-based; BINARY CODE; ROBUST;
D O I
10.1109/ACCESS.2024.3351945
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the widespread use of mobile devices, Android has become the most popular operating system, and new applications being uploaded to the Android market every day. However, due to the ease of modifying and repackaging Android binaries, Android applications can easily be modified and imitated by other developers and released in third-party Android markets. Therefore, determining the original developers of Android applications is a challenging problem known as authorship attribution. This study explores the distinctive features of Android applications to identify their authors. Software developers generally leave a footprint that reflects their writing styles in their applications. Therefore, this footprint, which can be extracted from either the source code or the binary code, can help identify the authors of software applications. Since obtaining the source code of applications in the wild can be impractical, especially when dealing with malware, researchers prefer to focus on the binaries of applications. Therefore, this study proposes an approach that identifies Android developers by deriving a wide range of features from different parts of Android applications, such as smali files, libraries, manifest files, and metadata information. Moreover, other features such as configuration, dex code, resource-based, and string-related features are inherited from other studies in Android authorship attribution and fused with the proposed feature set. The proposed approach was evaluated on benign and malware datasets and compared with those of other studies. The results show that the proposed features increase the accuracy by showing 82.5% and 95.6% in the market and malware datasets, respectively. The results demonstrate the positive impact of the proposed features on Android authorship attribution.
引用
收藏
页码:6569 / 6589
页数:21
相关论文
共 50 条
  • [31] A study on the real-time code-based GNSS positioning with Android smartphones
    Bahadur, Berkay
    [J]. MEASUREMENT, 2022, 194
  • [32] Poster: Git Blame Who?: Stylistic Authorship Attribution of Small, Incomplete Source Code Fragments
    Dauber, Edwin
    Caliskan, Aylin
    Harang, Richard
    Greenstadt, Rachel
    [J]. PROCEEDINGS 2018 IEEE/ACM 40TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING - COMPANION (ICSE-COMPANION, 2018, : 356 - 357
  • [33] Authorship Attribution on Bengali Literature using Stylometric Features and Neural Network
    Islam, Md. Ashikul
    Kabir, Md. Minhazul
    Islam, Md. Saiful
    Tasnim, Ayesha
    [J]. 2018 4TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATION & COMMUNICATION TECHNOLOGY (ICEEICT), 2018, : 360 - 363
  • [34] A Computational Approach for Authorship Attribution of Literary Texts using Sintatic Features
    Varela, Paulo
    Justino, Edson
    Britto, Alceu, Jr.
    Bortolozzi, Flavio
    [J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 4835 - 4842
  • [35] Automatic Authorship Attribution for Texts in Croatian Language Using Combinations of Features
    Reicher, Tomislav
    Kristo, Ivan
    Belsa, Igor
    Silic, Artur
    [J]. KNOWLEDGE-BASED AND INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT II, 2010, 6277 : 21 - 30
  • [36] Authorship Attribution of Short Historical Arabic Texts Based on Lexical Features
    Ouamour, S.
    Sayoud, H.
    [J]. 2013 INTERNATIONAL CONFERENCE ON CYBER-ENABLED DISTRIBUTED COMPUTING AND KNOWLEDGE DISCOVERY (CYBERC), 2013, : 144 - 147
  • [37] Deep Metric Learning for Code Authorship Attribution and Verification
    White, Riley
    Sprague, Nathan
    [J]. 20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021), 2021, : 1089 - 1093
  • [38] Selecting text features relevant for authorship attribution
    Zoya, Rezanova, I
    Alexandr, Romanov S.
    Roman, Meshcheryakov, V
    [J]. VESTNIK TOMSKOGO GOSUDARSTVENNOGO UNIVERSITETA FILOLOGIYA-TOMSK STATE UNIVERSITY JOURNAL OF PHILOLOGY, 2013, 26 (06): : 38 - 52
  • [39] Exploring syntactic and semantic features for authorship attribution
    Wu, Haiyan
    Zhang, Zhiqiang
    Wu, Qingfeng
    [J]. APPLIED SOFT COMPUTING, 2021, 111 (111)
  • [40] Temporally Robust Software Features for Authorship Attribution
    Burrows, Steven
    Uitdenbogerd, Alexandra L.
    Turpin, Andrew
    [J]. 2009 IEEE 33RD INTERNATIONAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2009, : 593 - 600