Automatic Classification of Online Doctor Reviews: Evaluation of Text Classifier Algorithms

被引:15
|
作者
Rivas, Ryan [1 ]
Montazeri, Niloofar [1 ]
Le, Nhat X. T. [1 ]
Hristidis, Vagelis [1 ]
机构
[1] Univ Calif Riverside, Dept Comp Sci & Engn, 363 Winston Chung Hall,900 Univ Ave, Riverside, CA 92521 USA
基金
美国国家科学基金会;
关键词
patient satisfaction; patient reported outcome measures; quality indicators; health care; supervised machine learning; CHINA;
D O I
10.2196/11141
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: An increasing number of doctor reviews are being generated by patients on the internet. These reviews address a diverse set of topics (features), including wait time, office staff, doctor's skills, and bedside manners. Most previous work on automatic analysis of Web-based customer reviews assumes that (1) product features are described unambiguously by a small number of keywords, for example, battery for phones and (2) the opinion for each feature has a positive or negative sentiment. However, in the domain of doctor reviews, this setting is too restrictive: a feature such as visit duration for doctor reviews may be expressed in many ways and does not necessarily have a positive or negative sentiment. Objective: This study aimed to adapt existing and propose novel text classification methods on the domain of doctor reviews. These methods are evaluated on their accuracy to classify a diverse set of doctor review features. Methods: We first manually examined a large number of reviews to extract a set of features that are frequently mentioned in the reviews. Then we proposed a new algorithm that goes beyond bag-of-words or deep learning classification techniques by leveraging natural language processing (NLP) tools. Specifically, our algorithm automatically extracts dependency tree patterns and uses them to classify review sentences. Results: We evaluated several state-of-the-art text classification algorithms as well as our dependency tree-based classifier algorithm on a real-world doctor review dataset. We showed that methods using deep learning or NLP techniques tend to outperform traditional bag-of-words methods. In our experiments, the 2 best methods used NLP techniques; on average, our proposed classifier performed 2.19% better than an existing NLP-based method, but many of its predictions of specific opinions were incorrect. Conclusions: We conclude that it is feasible to classify doctor reviews. Automatically classifying these reviews would allow patients to easily search for doctors based on their personal preference criteria.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Automatic Text Classification using Modified Centroid Classifier
    Elmarhumy, Mahmoud
    Fattah, Mohamed Abdel
    Ren, Fuji
    [J]. IEEE NLP-KE 2009: PROCEEDINGS OF INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING, 2009, : 282 - +
  • [2] Automatic text classification to support systematic reviews in medicine
    Garcia Adeva, J. J.
    Pikatza Atxa, J. M.
    Ubeda Carrillo, M.
    Ansuategi Zengotitabengoa, E.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (04) : 1498 - 1508
  • [3] Components of game experience: An automatic text analysis of online reviews
    Wang, Xiaohui
    Goh, Dion Hoe-Lian
    [J]. ENTERTAINMENT COMPUTING, 2020, 33
  • [4] A new modified centroid classifier approach for automatic text classification
    Elmarhoumy, Mahmoud
    Fattah, Mohamed Abdel
    Suzuki, Motoyuki
    Ren, Fuji
    [J]. IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2013, 8 (04) : 364 - 370
  • [5] Text feature selection for sentiment classification of Chinese online reviews
    Wang, Hongwei
    Yin, Pei
    Yao, Jiani
    Liu, James N. K.
    [J]. JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE, 2013, 25 (04) : 425 - 439
  • [6] Data and text mining from online reviews: An automatic literature analysis
    Moro, Sergio
    Rita, Paulo
    [J]. WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2022, 12 (03)
  • [7] Sentiment Analysis in Online Reviews Classification using Text Mining Techniques
    Agueda, M.
    Rita, P.
    Guerreiro, P.
    [J]. 2019 14TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2019,
  • [8] A Text Classification Based Method for Context Extraction from Online Reviews
    Zahra Lahlou, Fatima
    Mountassir, Asmaa
    Benbrahim, Houda
    Kassou, Ismail
    [J]. 2013 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS: THEORIES AND APPLICATIONS (SITA), 2013,
  • [9] Automatic text classification using machine learning and optimization algorithms
    R. Janani
    S. Vijayarani
    [J]. Soft Computing, 2021, 25 : 1129 - 1145
  • [10] Automatic text classification using machine learning and optimization algorithms
    Janani, R.
    Vijayarani, S.
    [J]. SOFT COMPUTING, 2021, 25 (02) : 1129 - 1145