An Automatic Analysis of User Reviews Method for APP Evolution and Maintenance

被引：0

作者：

Xiao J.-M. ^{[1
,2
]}

Chen S.-Z. ^{[1
,2
]}

Feng Z.-Y. ^{[1
,2
]}

Liu P.-L. ^{[1
,2
]}

Xue X. ^{[1
,2
]}

机构：

[1] Tianjin Key Laboratory of Cognitive Computing and Application, Tianjin

[2] College of Intelligence and Computing, Tianjin University, Tianjin

来源：

Jisuanji Xuebao/Chinese Journal of Computers | 2020年 / 43卷 / 11期

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

Intent classification; Maintenance and evolution; Opinion recommendations; Sentiment analysis; User reviews;

D O I：

10.11897/SP.J.1016.2020.02184

中图分类号：

学科分类号：

摘要：

Application distribution platforms such as Google Play Store or Apple App Store allow users to submit feedbacks to download applications in the form of ratings or reviews. These feedbacks can directly or indirectly reflect users' intention, and it can greatly help mobile developers (or app provider) to continuously maintain and improve their applications, such as fix the existing bugs, add or refining the app features, etc. and so as to better satisfying user expectations continuously. App reviews provide an opportunity to proactively collect user complaints and promptly improve apps' user experience, in terms of bug fixing and feature refinement. However, for many popular applications, since the large amount of user review data, unstructured review data, and inconsistent review quality, identifying the valuable review information becomes a challenging task. Therefore, classification of user reviews into specific topics and automated analysis to reduce the workload of manual analysis has become a new idea for app review mining analysis. In this paper, we propose a method named ARICA(Automatic Review Intention Classification Analysis) to automatically analyze crowd user reviews to efficiently provide developers with software maintenance and evolution suggestions. Firstly, ARICA classifies the reviews into different categories according to the user's feedbacks, and then uses the LDA topic model to classify the reviews under each user's intent category. This allows a preliminary screening of user reviews to obtain review information under each intent category. Secondly, ARICA clusters user views with similar semantic expressions under each review topic to further filter the redundant information in reviews, so that can easier and intuitive to understand the user's original feedback and capture the user's true intention more accurately. Afterwards, ARICA uses the sentiment analysis tool called SentiStrength to obtain user sentiment, and then analyzes the sentiment distribution of user reviews to identify the user's significant intentions. Finally, the multidimensional information such as user intentions and sentiment preferences are considered comprehensively for calculating the review score and then ARICA prioritizes reviews for realizing the opinions recommendation for the developers. We use real app review data from Google Play to verify the performance of review intent classification and sentence clustering of ARICA. The experimental results show that ARICA has precision of 80% in the process of user review intention classification, compared with the state-of-the-art existing automatic user intentions mining method TextCNN which based on Convolutional Neural Networks (CNN), the F-Measure of ARICA is improved by 19.1%. Meanwhile, ARICA achieves 86% of the precision during the clustering of review sentences, which provides effective support for subsequent developers to recommend app update tasks. Further, we use the official app changelog as a ground truth, and empirically analyzed whether our recommended user reviews can be truly adopted by developers, the results show that ARICA can efficiently help developers better understand the user's real requirements, which is of great significance for developers to perform subsequent app maintenance and evolution tasks. In addition, we also publish the original data set, the manually labeled data set and the source code of ARICA on github which provide materials for other relevant researchers. © 2020, Science Press. All right reserved.

引用

页码：2184 / 2202

页数：18

共 39 条

[1] Inukollu V N, Keshamoni D D, Kang T, Et al., Factors influencing quality of mobile apps: Role of mobile app development life cycle, International Journal of Software Engineering&Applications, 5, 5, pp. 15-34, (2014)
[2] Guzman E, Maalej W, Et al., How do users like this feature? A fine grained sentiment analysis of app reviews, Proceedings of the 2014 IEEE International Requirements Engineering Conference, pp. 153-162, (2014)
[3] Pagano D, Maalej W, Et al., User feedback in the AppStore: An empirical study, Proceedings of the 2014 IEEE International Requirements Engineering Conference, pp. 125-134, (2013)
[4] Wang Zhong-Yuan, Cheng Jian-Peng, Wang Hai-Xun, Et al., Short text understanding: A survey, Journal of Computer Research and Development, 53, 2, pp. 262-269, (2016)
[5] Yang H, Liang P., Identification and classification of requirements from app user reviews, Proceedings of the 2015 International Conference on Software Engineering, pp. 7-12, (2015)
[6] Chen N, Lin J, Hoi S C H, Et al., AR-miner: Mining informative reviews for developers from mobile app marketplace, Proceedings of the 2015 International Conference on Software Engineering, pp. 767-778, (2014)
[7] Maalej W, Nabil H., Bug report, feature request, or simply praise? On automatically classifying app reviews, Proceedings of the 2015 IEEE International Requirements Engineering Conference, pp. 116-125, (2015)
[8] Panichella S, Di Sorbo A, Guzman E, Et al., How can i improve my app? Classifying user reviews for software maintenance and evolution, Proceedings of the 2015 IEEE International Conference on Software Maintenance and Evolution, pp. 281-290, (2015)
[9] Guzman E, Elhalaby M, Bruegge B, Et al., Ensemble methods for app review classification: An approach for software evolution, Proceedings of the 2016 IEEE/ACM International Conference on Automated Software Engineering, pp. 771-776, (2016)
[10] Guzman E, Aly O, Bruegge B, Et al., Retrieving diverse opinions from app reviews, Proceedings of the 2015 International Symposium on Empirical Software Engineering and Measurement, pp. 1-10, (2015)

← 1 2 3 4 →