Mining user privacy concern topics from app reviews

被引:0
|
作者
Zhang, Jianzhang [1 ]
Zhou, Jialong [1 ]
Hua, Jinping [2 ]
Niu, Nan [3 ]
Liu, Chuang [1 ]
机构
[1] Hangzhou Normal Univ, Dept Management Sci & Engn, Hangzhou, Zhejiang, Peoples R China
[2] Jiangxi Prov Inst Cyber Secur, Nanchang, Jiangxi, Peoples R China
[3] Univ Cincinnati, Dept Elect Engn & Comp Sci, Cincinnati, OH 45221 USA
基金
中国国家自然科学基金;
关键词
Privacy concerns; Topic modeling; App reviews mining; Privacy requirements; Requirements engineering; MOBILE APPS; REQUIREMENTS; PERCEPTION; TAXONOMY;
D O I
10.1016/j.jss.2025.112355
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Context: As mobile applications (apps) widely spread throughout our society and daily life, various personal information is constantly demanded by apps in exchange for more intelligent and customized functionality. An increasing number of users are voicing their privacy concerns through app reviews on app stores. Objective: The main challenge of effectively mining privacy concerns from user reviews lies in that reviews expressing privacy concerns are overridden by a large number of reviews expressing more generic themes and noisy content. In this work, we propose a novel automated approach to overcome that challenge. Method: Our approach first employs information retrieval and document embeddings to extract candidate privacy reviews in an unsupervised manner, which are further labeled to prepare the annotation dataset. Then, supervised classifiers are trained to automatically identify privacy reviews. Finally, an interpretable topic mining algorithm is designed to detect privacy concern topics contained in the privacy reviews. Results: Experimental results show that the best performing document embedding achieves an average precision of 96.80% in the top 100 retrieved candidate privacy reviews, outperforming the taxonomy-based baseline, which achieves 73.87%. All trained privacy review classifiers achieve an F1 score above 91%, surpassing the keyword-matching baseline by as much as 7.5% and the large language model baseline by up to 2.74%. For detecting privacy concern topics from privacy reviews, our proposed algorithm achieves both better topic coherence and topic diversity than three strong topic modeling baselines, including LDA. Conclusion: Empirical evaluation results demonstrate the effectiveness of our approach in identifying privacy reviews and detecting user privacy concerns in app reviews.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Mining and Analyzing User Feedback from App Reviews: An Econometric Approach
    Guo, Tong
    Guo, Bin
    Ouyang, Yi
    Yu, Zhiwen
    2018 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2018, : 841 - 848
  • [2] Investigating User Perceptions of Mobile App Privacy: An Analysis of User-Submitted App Reviews
    Besmer, Andrew R.
    Watson, Jason
    Banks, M. Shane
    INTERNATIONAL JOURNAL OF INFORMATION SECURITY AND PRIVACY, 2020, 14 (04) : 74 - 91
  • [3] Topics of Concern: Identifying User Issues in Reviews of IoT Apps and Devices
    Truelove, Andrew
    Chowdhury, Farah Naz
    Gnawali, Omprakash
    Alipour, Mohammad Amin
    2019 IEEE/ACM 1ST INTERNATIONAL WORKSHOP ON SOFTWARE ENGINEERING RESEARCH & PRACTICES FOR THE INTERNET OF THINGS (SERP4IOT 2019), 2019, : 33 - 40
  • [4] App store mining for feature extraction: analyzing user reviews
    Memon, Zulfiqar Ali
    Munawar, Nida
    Kamal, Maha
    ACTA SCIENTIARUM-TECHNOLOGY, 2024, 46 (01)
  • [5] App store mining for iterative domain analysis: Combine app descriptions with user reviews
    Liu, Yuzhou
    Liu, Lei
    Liu, Huaxiao
    Yin, Xinglong
    SOFTWARE-PRACTICE & EXPERIENCE, 2019, 49 (06): : 1013 - 1040
  • [6] Opinion Mining from User Reviews
    Tripathy, Amiya Kumar
    Sundararajan, Revathy
    Deshpande, Chinmay
    Mishra, Pankaj
    Natarajan, Neha
    2015 INTERNATIONAL CONFERENCE ON TECHNOLOGY FOR SUSTAINABLE DEVELOPMENT (ICTSD-2015), 2015,
  • [7] A systematic literature review: Opinion mining studies from mobile app store user reviews
    Genc-Nayebi, Necmiye
    Abran, Alain
    JOURNAL OF SYSTEMS AND SOFTWARE, 2017, 125 : 207 - 219
  • [8] Mining the participatory role of massive user reviews in the update design of APP software
    Qian Y.
    Cao E.
    Deng W.
    Yuan H.
    Xitong Gongcheng Lilun yu Shijian/System Engineering Theory and Practice, 2021, 41 (03): : 554 - 564
  • [9] Mining User Rationale from Software Reviews
    Kurtanovic, Zijad
    Maalej, Walid
    2017 IEEE 25TH INTERNATIONAL REQUIREMENTS ENGINEERING CONFERENCE (RE), 2017, : 61 - 70
  • [10] Mining User Opinions in Mobile App Reviews: A Keyword-based Approach
    Phong Minh Vu
    Tam The Nguyen
    Hung Viet Pham
    Tung Thanh Nguyen
    2015 30TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE), 2015, : 749 - 759