Mining user privacy concern topics from app reviews

被引:0
|
作者
Zhang, Jianzhang [1 ]
Zhou, Jialong [1 ]
Hua, Jinping [2 ]
Niu, Nan [3 ]
Liu, Chuang [1 ]
机构
[1] Hangzhou Normal Univ, Dept Management Sci & Engn, Hangzhou, Zhejiang, Peoples R China
[2] Jiangxi Prov Inst Cyber Secur, Nanchang, Jiangxi, Peoples R China
[3] Univ Cincinnati, Dept Elect Engn & Comp Sci, Cincinnati, OH 45221 USA
基金
中国国家自然科学基金;
关键词
Privacy concerns; Topic modeling; App reviews mining; Privacy requirements; Requirements engineering; MOBILE APPS; REQUIREMENTS; PERCEPTION; TAXONOMY;
D O I
10.1016/j.jss.2025.112355
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Context: As mobile applications (apps) widely spread throughout our society and daily life, various personal information is constantly demanded by apps in exchange for more intelligent and customized functionality. An increasing number of users are voicing their privacy concerns through app reviews on app stores. Objective: The main challenge of effectively mining privacy concerns from user reviews lies in that reviews expressing privacy concerns are overridden by a large number of reviews expressing more generic themes and noisy content. In this work, we propose a novel automated approach to overcome that challenge. Method: Our approach first employs information retrieval and document embeddings to extract candidate privacy reviews in an unsupervised manner, which are further labeled to prepare the annotation dataset. Then, supervised classifiers are trained to automatically identify privacy reviews. Finally, an interpretable topic mining algorithm is designed to detect privacy concern topics contained in the privacy reviews. Results: Experimental results show that the best performing document embedding achieves an average precision of 96.80% in the top 100 retrieved candidate privacy reviews, outperforming the taxonomy-based baseline, which achieves 73.87%. All trained privacy review classifiers achieve an F1 score above 91%, surpassing the keyword-matching baseline by as much as 7.5% and the large language model baseline by up to 2.74%. For detecting privacy concern topics from privacy reviews, our proposed algorithm achieves both better topic coherence and topic diversity than three strong topic modeling baselines, including LDA. Conclusion: Empirical evaluation results demonstrate the effectiveness of our approach in identifying privacy reviews and detecting user privacy concerns in app reviews.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] App Reviews: Breaking the User and Developer Language Barrier
    Hoon, Leonard
    Angel Rodriguez-Garcia, Miguel
    Vasa, Rajesh
    Valencia-Garcia, Rafael
    Schneider, Jean-Guy
    TRENDS AND APPLICATIONS IN SOFTWARE ENGINEERING, 2016, 405 : 223 - 233
  • [32] App-Aware Response Synthesis for User Reviews
    Farooq, Umar
    Siddique, A. B.
    Jamour, Fuad
    Zhao, Zhijia
    Hristidis, Vagelis
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 699 - 708
  • [33] User Reviews of Depression App Features: Sentiment Analysis
    Meyer, Julien
    Okuboyejo, Senanu
    JMIR FORMATIVE RESEARCH, 2021, 5 (12)
  • [34] User Intention Mining in Bussiness Reviews: A Review
    Habib, Anam
    Saddozai, Furqan Khan
    Sattar, Anum
    Khan, Aurangzeb
    Hameed, Ibrahim A.
    Kundi, Fazal Masud
    2018 5TH INTERNATIONAL CONFERENCE ON BEHAVIORAL, ECONOMIC, AND SOCIO-CULTURAL COMPUTING (BESC), 2018, : 243 - 249
  • [35] The Role of User Reviews in App Updates: A Preliminary Investigation on App Release Notes
    Wang, Chong
    Liu, Tianyang
    Liang, Peng
    Daneva, Maya
    van Sinderen, Marten
    2021 28TH ASIA-PACIFIC SOFTWARE ENGINEERING CONFERENCE (APSEC 2021), 2021, : 520 - 525
  • [36] Analysis of COVID-19 Gov PK app user reviews to determine online privacy concerns of Pakistani citizens
    Yaqub, Ussama
    Saleem, Tauqeer
    Zaman, Salma
    GLOBAL KNOWLEDGE MEMORY AND COMMUNICATION, 2024, 73 (6/7) : 913 - 928
  • [37] Analyzing User Perspectives on Mobile App Privacy at Scale
    Nema, Preksha
    Anthonysamy, Pauline
    Taft, Nina
    Peddinti, Sai Teja
    2022 ACM/IEEE 44TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2022), 2022, : 112 - 124
  • [38] Personalized Mobile App Recommendation: Reconciling App Functionality and User Privacy Preference
    Liu, Bin
    Kong, Deguang
    Cen, Lei
    Gong, Neil Zhenqiang
    Jin, Hongxia
    Xiong, Hui
    WSDM'15: PROCEEDINGS OF THE EIGHTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2015, : 315 - 324
  • [39] Identifying Functional and Non-functional Software Requirements From User App Reviews
    Dave, Dev
    Anu, Vaibhav
    2022 IEEE INTERNATIONAL IOT, ELECTRONICS AND MECHATRONICS CONFERENCE (IEMTRONICS), 2022, : 845 - 850
  • [40] Experience Report: Understanding Cross-Platform App Issues From User Reviews
    Man, Yichuan
    Gao, Cuiyun
    Lyu, Michael R.
    Jiang, Jiuchun
    2016 IEEE 27TH INTERNATIONAL SYMPOSIUM ON SOFTWARE RELIABILITY ENGINEERING (ISSRE), 2016, : 138 - 149