Automatic Classification of Online Doctor Reviews: Evaluation of Text Classifier Algorithms

被引：15

作者：

Rivas, Ryan ^{[1
]}

Montazeri, Niloofar ^{[1
]}

Le, Nhat X. T. ^{[1
]}

Hristidis, Vagelis ^{[1
]}

机构：

[1] Univ Calif Riverside, Dept Comp Sci & Engn, 363 Winston Chung Hall,900 Univ Ave, Riverside, CA 92521 USA

来源：

JOURNAL OF MEDICAL INTERNET RESEARCH | 2018年 / 20卷 / 11期

基金：

美国国家科学基金会;

关键词：

patient satisfaction; patient reported outcome measures; quality indicators; health care; supervised machine learning; CHINA;

D O I：

10.2196/11141

中图分类号：

R19 [保健组织与事业（卫生事业管理）];

学科分类号：

摘要：

Background: An increasing number of doctor reviews are being generated by patients on the internet. These reviews address a diverse set of topics (features), including wait time, office staff, doctor's skills, and bedside manners. Most previous work on automatic analysis of Web-based customer reviews assumes that (1) product features are described unambiguously by a small number of keywords, for example, battery for phones and (2) the opinion for each feature has a positive or negative sentiment. However, in the domain of doctor reviews, this setting is too restrictive: a feature such as visit duration for doctor reviews may be expressed in many ways and does not necessarily have a positive or negative sentiment. Objective: This study aimed to adapt existing and propose novel text classification methods on the domain of doctor reviews. These methods are evaluated on their accuracy to classify a diverse set of doctor review features. Methods: We first manually examined a large number of reviews to extract a set of features that are frequently mentioned in the reviews. Then we proposed a new algorithm that goes beyond bag-of-words or deep learning classification techniques by leveraging natural language processing (NLP) tools. Specifically, our algorithm automatically extracts dependency tree patterns and uses them to classify review sentences. Results: We evaluated several state-of-the-art text classification algorithms as well as our dependency tree-based classifier algorithm on a real-world doctor review dataset. We showed that methods using deep learning or NLP techniques tend to outperform traditional bag-of-words methods. In our experiments, the 2 best methods used NLP techniques; on average, our proposed classifier performed 2.19% better than an existing NLP-based method, but many of its predictions of specific opinions were incorrect. Conclusions: We conclude that it is feasible to classify doctor reviews. Automatically classifying these reviews would allow patients to easily search for doctors based on their personal preference criteria.

引用

页数：14

共 50 条

[1] Automatic Text Classification using Modified Centroid Classifier
Elmarhumy, Mahmoud
Fattah, Mohamed Abdel
Ren, Fuji
[J]. IEEE NLP-KE 2009: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2009, : 282 - +
[2] Automatic text classification to support systematic reviews in medicine
Garcia Adeva, J. J.
Pikatza Atxa, J. M.
Ubeda Carrillo, M.
Ansuategi Zengotitabengoa, E.
[J]. EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (04) : 1498 - 1508
[3] Components of game experience: An automatic text analysis of online reviews
Wang, Xiaohui
Goh, Dion Hoe-Lian
[J]. ENTERTAINMENT COMPUTING, 2020, 33
[4] A new modified centroid classifier approach for automatic text classification
Elmarhoumy, Mahmoud
Fattah, Mohamed Abdel
Suzuki, Motoyuki
Ren, Fuji
[J]. IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2013, 8 (04) : 364 - 370
[5] Text feature selection for sentiment classification of Chinese online reviews
Wang, Hongwei
Yin, Pei
Yao, Jiani
Liu, James N. K.
[J]. JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2013, 25 (04) : 425 - 439
[6] Data and text mining from online reviews: An automatic literature analysis
Moro, Sergio
Rita, Paulo
[J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2022, 12 (03)
[7] Sentiment Analysis in Online Reviews Classification using Text Mining Techniques
Agueda, M.
Rita, P.
Guerreiro, P.
[J]. 2019 14TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2019,
[8] A Text Classification Based Method for Context Extraction from Online Reviews
Zahra Lahlou, Fatima
Mountassir, Asmaa
Benbrahim, Houda
Kassou, Ismail
[J]. 2013 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS: THEORIES AND APPLICATIONS (SITA), 2013,
[9] Automatic text classification using machine learning and optimization algorithms
R. Janani
S. Vijayarani
[J]. Soft Computing, 2021, 25 : 1129 - 1145
[10] Automatic text classification using machine learning and optimization algorithms
Janani, R.
Vijayarani, S.
[J]. SOFT COMPUTING, 2021, 25 (02) : 1129 - 1145

← 1 2 3 4 5 →