AndroZooOpen: Collecting Large-scale Open Source Android Apps for the Research Community

被引:24
|
作者
Liu, Pei [1 ]
Li, Li [1 ]
Zhao, Yanjie [1 ]
Sun, Xiaoyu [1 ]
Grundy, John [1 ]
机构
[1] Monash Univ, Fac Informat Technol, Melbourne, Australia
基金
澳大利亚研究理事会;
关键词
Android; Open-source; AndroZoo; AndroZooOpen;
D O I
10.1145/3379597.3387503
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
It is critical for research to have an open, well-curated, representative set of apps for analysis. We present a collection of open-source Android apps collected from several sources, including Github. Our dataset, AndroZooOpen, currently contains over 45,000 app artefacts, a representative picture of Github-hosted Android apps. For apps released on Google Play, metadata including categories, ratings and user reviews, are also stored. We share this new dataset as part of our ongoing research to better support and enable new research topics involving Android app artefact analysis, and as a supplement dataset for AndroZoo, a well-known app collection of close-sourced Android apps.
引用
收藏
页码:548 / 552
页数:5
相关论文
共 50 条
  • [21] A Large-scale Dataset of (Open Source) License Text Variants
    Zacchiroli, Stefano
    2022 MINING SOFTWARE REPOSITORIES CONFERENCE (MSR 2022), 2022, : 757 - 761
  • [22] Large-scale open innovation: open source vs. patent pools
    Rayna, Thierry
    Striukova, Ludmila
    INTERNATIONAL JOURNAL OF TECHNOLOGY MANAGEMENT, 2010, 52 (3-4) : 477 - 496
  • [23] Permission Issues in Open-source Android Apps: An Exploratory Study
    Scoccia, Gian Luca
    Peruma, Anthony
    Pujols, Virginia
    Malavolta, Ivano
    Krutz, Daniel E.
    2019 19TH IEEE INTERNATIONAL WORKING CONFERENCE ON SOURCE CODE ANALYSIS AND MANIPULATION (SCAM), 2019, : 238 - 249
  • [24] A Large-Scale Empirical Study on the Effects of Code Obfuscations on Android Apps and Anti-Malware Products
    Hammad, Mahmoud
    Garcia, Joshua
    Malek, Sam
    PROCEEDINGS 2018 IEEE/ACM 40TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE), 2018, : 421 - 431
  • [25] Dark Hazard: Learning-based, Large-scale Discovery of Hidden Sensitive Operations in Android Apps
    Pan, Xiaorui
    Wang, Xueqiang
    Duan, Yue
    Wang, XiaoFeng
    Yin, Heng
    24TH ANNUAL NETWORK AND DISTRIBUTED SYSTEM SECURITY SYMPOSIUM (NDSS 2017), 2017,
  • [26] Examining Image-Based Button Labeling for Accessibility in Android Apps through Large-Scale Analysis
    Ross, Anne Spencer
    Zhang, Xiaoyi
    Fogarty, James
    Wobbrock, Jacob O.
    ASSETS'18: PROCEEDINGS OF THE 20TH INTERNATIONAL ACM SIGACCESS CONFERENCE ON COMPUTERS AND ACCESSIBILITY, 2018, : 119 - 130
  • [27] A large-scale study on the adoption of anti-debugging and anti-tampering protections in android apps
    Berlato, Stefano
    Ceccato, Mariano
    JOURNAL OF INFORMATION SECURITY AND APPLICATIONS, 2020, 52
  • [28] A Large-scale Investigation into Geodifferences in Mobile Apps
    Kumar, Renuka
    Virkud, Apurva
    Raman, Ram Sundara
    Prakash, Atul
    Ensafi, Roya
    PROCEEDINGS OF THE 31ST USENIX SECURITY SYMPOSIUM, 2022, : 1203 - 1220
  • [29] A Large-Scale Study of Application Incompatibilities in Android
    Cai, Haipeng
    Zhang, Ziyi
    Li, Li
    Fu, Xiaoqin
    PROCEEDINGS OF THE 28TH ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS (ISSTA '19), 2019, : 216 - 227
  • [30] An open source analysis framework for large-scale building energy modeling
    Ball, Brian L.
    Long, Nicholas
    Fleming, Katherine
    Balbach, Chris
    Lopez, Phylroy
    JOURNAL OF BUILDING PERFORMANCE SIMULATION, 2020, 13 (05) : 487 - 500