Android Authorship Attribution Using Source Code-Based Features

被引:0
|
作者
Aydogan, Emre [1 ]
Sen, Sevil [1 ]
机构
[1] Hacettepe Univ, Dept Comp Engn, Wireless Networks & Intelligent Secure Syst WISE L, TR-06800 Ankara, Turkiye
关键词
Android; authorship attribution; mobile malware; metadata; obfuscation; source code-based; BINARY CODE; ROBUST;
D O I
10.1109/ACCESS.2024.3351945
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the widespread use of mobile devices, Android has become the most popular operating system, and new applications being uploaded to the Android market every day. However, due to the ease of modifying and repackaging Android binaries, Android applications can easily be modified and imitated by other developers and released in third-party Android markets. Therefore, determining the original developers of Android applications is a challenging problem known as authorship attribution. This study explores the distinctive features of Android applications to identify their authors. Software developers generally leave a footprint that reflects their writing styles in their applications. Therefore, this footprint, which can be extracted from either the source code or the binary code, can help identify the authors of software applications. Since obtaining the source code of applications in the wild can be impractical, especially when dealing with malware, researchers prefer to focus on the binaries of applications. Therefore, this study proposes an approach that identifies Android developers by deriving a wide range of features from different parts of Android applications, such as smali files, libraries, manifest files, and metadata information. Moreover, other features such as configuration, dex code, resource-based, and string-related features are inherited from other studies in Android authorship attribution and fused with the proposed feature set. The proposed approach was evaluated on benign and malware datasets and compared with those of other studies. The results show that the proposed features increase the accuracy by showing 82.5% and 95.6% in the market and malware datasets, respectively. The results demonstrate the positive impact of the proposed features on Android authorship attribution.
引用
收藏
页码:6569 / 6589
页数:21
相关论文
共 50 条
  • [41] Authorship Attribution Using Entropy
    Grabchak, M.
    Zhang, Z.
    Zhang, D. T.
    [J]. JOURNAL OF QUANTITATIVE LINGUISTICS, 2013, 20 (04) : 301 - 313
  • [42] Stylometric Features for Authorship Attribution of Polish Texts
    Szwed, Piotr
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING, ICAISC 2017, PT II, 2017, 10246 : 171 - 182
  • [43] Source Code Authorship Identification Using Deep Neural Networks
    Kurtukova, Anna
    Romanov, Aleksandr
    Shelupanov, Alexander
    [J]. SYMMETRY-BASEL, 2020, 12 (12): : 1 - 17
  • [44] Meta-Heuristic Guided Feature Optimization for Enhanced Authorship Attribution in Java']Java Source Code
    Al-Ahmad, Bilal
    Al-Madi, Nailah
    Alzaqebah, Abdullah
    Alkhawaldeh, Rami S.
    Aldebei, Khaled
    Kabir, Md. Faisal
    Altaharwa, Ismail
    Abu-Faraj, Mua'ad
    Aljarah, Ibrahim
    [J]. IEEE ACCESS, 2023, 11 : 141657 - 141673
  • [45] Code-Based Cryptosystems Using Generalized Concatenated Codes
    Puchinger, Sven
    Mueelich, Sven
    Ishak, Karim
    Bossert, Martin
    [J]. APPLICATIONS OF COMPUTER ALGEBRA, 2017, 198 : 397 - 423
  • [46] ASAP: A Source Code Authorship Program
    Matthew F. Tennyson
    [J]. International Journal on Software Tools for Technology Transfer, 2019, 21 : 471 - 484
  • [47] A code-based blind signature
    Blazy, Olivier
    Gaborit, Philippe
    Schrek, Julien
    Sendrier, Nicolas
    [J]. 2017 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2017, : 2718 - 2722
  • [48] ASAP: A Source Code Authorship Program
    Tennyson, Matthew F.
    [J]. INTERNATIONAL JOURNAL ON SOFTWARE TOOLS FOR TECHNOLOGY TRANSFER, 2019, 21 (04) : 471 - 484
  • [49] Code-Based Cryptosystems Evolution
    Samokhina, Marina
    Trushina, Oksana
    [J]. 2017 FOURTH INTERNATIONAL CONFERENCE ON ENGINEERING AND TELECOMMUNICATION (EN&T), 2017, : 15 - 17
  • [50] Authorship Attribution Using Capsule-Based Fusion Approach
    Suman, Chanchal
    Kumar, Rohit
    Saha, Sriparna
    Bhattacharyya, Pushpak
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2021), 2021, 12801 : 289 - 300