Mining user privacy concern topics from app reviews

被引:0
|
作者
Zhang, Jianzhang [1 ]
Zhou, Jialong [1 ]
Hua, Jinping [2 ]
Niu, Nan [3 ]
Liu, Chuang [1 ]
机构
[1] Hangzhou Normal Univ, Dept Management Sci & Engn, Hangzhou, Zhejiang, Peoples R China
[2] Jiangxi Prov Inst Cyber Secur, Nanchang, Jiangxi, Peoples R China
[3] Univ Cincinnati, Dept Elect Engn & Comp Sci, Cincinnati, OH 45221 USA
基金
中国国家自然科学基金;
关键词
Privacy concerns; Topic modeling; App reviews mining; Privacy requirements; Requirements engineering; MOBILE APPS; REQUIREMENTS; PERCEPTION; TAXONOMY;
D O I
10.1016/j.jss.2025.112355
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Context: As mobile applications (apps) widely spread throughout our society and daily life, various personal information is constantly demanded by apps in exchange for more intelligent and customized functionality. An increasing number of users are voicing their privacy concerns through app reviews on app stores. Objective: The main challenge of effectively mining privacy concerns from user reviews lies in that reviews expressing privacy concerns are overridden by a large number of reviews expressing more generic themes and noisy content. In this work, we propose a novel automated approach to overcome that challenge. Method: Our approach first employs information retrieval and document embeddings to extract candidate privacy reviews in an unsupervised manner, which are further labeled to prepare the annotation dataset. Then, supervised classifiers are trained to automatically identify privacy reviews. Finally, an interpretable topic mining algorithm is designed to detect privacy concern topics contained in the privacy reviews. Results: Experimental results show that the best performing document embedding achieves an average precision of 96.80% in the top 100 retrieved candidate privacy reviews, outperforming the taxonomy-based baseline, which achieves 73.87%. All trained privacy review classifiers achieve an F1 score above 91%, surpassing the keyword-matching baseline by as much as 7.5% and the large language model baseline by up to 2.74%. For detecting privacy concern topics from privacy reviews, our proposed algorithm achieves both better topic coherence and topic diversity than three strong topic modeling baselines, including LDA. Conclusion: Empirical evaluation results demonstrate the effectiveness of our approach in identifying privacy reviews and detecting user privacy concerns in app reviews.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Exploring the Influence of Software Evolution on Mobile App Accessibility: Insights from User Reviews
    Oliveira, Alberto Dumont Alves
    Dos Santos, Paulo Sergio Henrique
    Aljedaani, Wajdi
    Eler, Marcelo Medeiros
    Journal of the Brazilian Computer Society, 2024, 30 (01) : 584 - 607
  • [42] Mining User Requirements from Application Store Reviews Using Frame Semantics
    Jha, Nishant
    Mahmoud, Anas
    REQUIREMENTS ENGINEERING: FOUNDATION FOR SOFTWARE QUALITY, REFSQ 2017, 2017, 10153 : 273 - 287
  • [43] Opinion Mining from Online User Reviews Using Fuzzy Linguistic Hedges
    Dalal, Mita K.
    Zaveri, Mukesh A.
    APPLIED COMPUTATIONAL INTELLIGENCE AND SOFT COMPUTING, 2014, 2014
  • [44] AR-Miner: Mining Informative Reviews for Developers from Mobile App Marketplace
    Chen, Ning
    Lin, Jialiu
    Hoi, Steven C. H.
    Xiao, Xiaokui
    Zhang, Boshen
    36TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING (ICSE 2014), 2014, : 767 - 778
  • [45] Mining the Influencing Factors and Their Asymmetrical Effects of mHealth Sleep App User Satisfaction From Real-world User-Generated Reviews: Content Analysis and Topic Modeling
    Nuo, Mingfu
    Zheng, Shaojiang
    Wen, Qinglian
    Fang, Hongjuan
    Wang, Tong
    Liang, Jun
    Han, Hongbin
    Lei, Jianbo
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2023, 25
  • [46] App Update Patterns: How Developers Act on User Reviews in Mobile App Stores
    Wang, Shance
    Wang, Zhongjie
    Xu, Xiaofei
    Sheng, Quan Z.
    SERVICE-ORIENTED COMPUTING, ICSOC 2017, 2017, 10601 : 125 - 141
  • [47] Leveraging User Reviews to Improve Accuracy for Mobile App Retrieval
    Park, Dae Hoon
    Liu, Mengwen
    Zhai, ChengXiang
    Wang, Haohong
    SIGIR 2015: PROCEEDINGS OF THE 38TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2015, : 533 - 542
  • [48] Method of Relevance Judgment for App Software's User Reviews
    Xiang, Qixin
    Jiang, Ying
    Ran, Meng
    Ding, Jiaman
    DATA SCIENCE, PT II, 2017, 728 : 28 - 41
  • [49] An Automatic Analysis of User Reviews Method for APP Evolution and Maintenance
    Xiao J.-M.
    Chen S.-Z.
    Feng Z.-Y.
    Liu P.-L.
    Xue X.
    Jisuanji Xuebao/Chinese Journal of Computers, 2020, 43 (11): : 2184 - 2202
  • [50] Reconciling Mobile App Privacy and Usability on Smartphones: Could User Privacy Profiles Help?
    Liu, Bin
    Lin, Jialiu
    Sadeh, Norman
    WWW'14: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2014, : 201 - 211