KLOSURE: Closing in on open-ended patient questionnaires with text mining

被引:7
|
作者
Spasic, Irena [1 ]
Owen, David [1 ]
Smith, Andrew [2 ]
Button, Kate [3 ]
机构
[1] Cardiff Univ, Sch Comp Sci & Informat, Cardiff, S Glam, Wales
[2] Cardiff Univ, Sch Psychol, Cardiff, S Glam, Wales
[3] Cardiff Univ, Sch Healthcare Sci, Cardiff, S Glam, Wales
基金
英国惠康基金; 英国工程与自然科学研究理事会;
关键词
Text mining; Natural language processing; Text classification; Named entity recognition; Sentiment analysis; Patient reported outcome measure; Open-ended questionnaire; BIOMEDICAL TEXT; UMLS;
D O I
10.1186/s13326-019-0215-3
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background Knee injury and Osteoarthritis Outcome Score (KOOS) is an instrument used to quantify patients' perceptions about their knee condition and associated problems. It is administered as a 42-item closed-ended questionnaire in which patients are asked to self-assess five outcomes: pain, other symptoms, activities of daily living, sport and recreation activities, and quality of life. We developed KLOG as a 10-item open-ended version of the KOOS questionnaire in an attempt to obtain deeper insight into patients' opinions including their unmet needs. However, the open-ended nature of the questionnaire incurs analytical overhead associated with the interpretation of responses. The goal of this study was to automate such analysis. We implemented KLOSURE as a system for mining free-text responses to the KLOG questionnaire. It consists of two subsystems, one concerned with feature extraction and the other one concerned with classification of feature vectors. Feature extraction is performed by a set of four modules whose main functionalities are linguistic pre-processing, sentiment analysis, named entity recognition and lexicon lookup respectively. Outputs produced by each module are combined into feature vectors. The structure of feature vectors will vary across the KLOG questions. Finally, Weka, a machine learning workbench, was used for classification of feature vectors. Results The precision of the system varied between 62.8 and 95.3%, whereas the recall varied from 58.3 to 87.6% across the 10 questions. The overall performance in terms of F-measure varied between 59.0 and 91.3% with an average of 74.4% and a standard deviation of 8.8. Conclusions We demonstrated the feasibility of mining open-ended patient questionnaires. By automatically mapping free text answers onto a Likert scale, we can effectively measure the progress of rehabilitation over time. In comparison to traditional closed-ended questionnaires, our approach offers much richer information that can be utilised to support clinical decision making. In conclusion, we demonstrated how text mining can be used to combine the benefits of qualitative and quantitative analysis of patient experiences.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Use of open-ended questionnaires to examine the effects of tinnitus and its relation to patient-reported outcome measures
    Manchaiah, Vinaya
    Andersson, Gerhard
    Fagelson, Marc A.
    Boyd, Ryan L.
    Beukes, Eldre W.
    INTERNATIONAL JOURNAL OF AUDIOLOGY, 2022, 61 (07) : 592 - 599
  • [22] Open-Ended Generality
    Costantini, Filippo
    PHILOSOPHICAL FORUM, 2018, 49 (02): : 161 - 191
  • [23] OPEN-ENDED VASECTOMY
    ERREY, BB
    EDWARDS, IS
    FERTILITY AND STERILITY, 1988, 49 (02) : 380 - 380
  • [24] OPEN-ENDED HUMANISM
    CHUMAN, J
    HUMANIST, 1975, 35 (02) : 22 - 22
  • [25] OPEN-ENDED VASECTOMY
    HORAN, AH
    FERTILITY AND STERILITY, 1986, 46 (05) : 979 - 979
  • [26] The Open-Ended Question
    Chapman-Novakofski, Karen
    JOURNAL OF NUTRITION EDUCATION AND BEHAVIOR, 2011, 43 (03) : 141 - 141
  • [27] Open-Ended Blues
    Cohen, Aaron
    DOWN BEAT, 2010, 77 (02): : 8 - 8
  • [28] OPEN-ENDED VASECTOMY
    ERREY, B
    FERTILITY AND STERILITY, 1984, 41 (01) : 164 - 164
  • [29] Open-ended by design
    Ashenden, Terry
    Cleanroom Technology, 2009, 16 (05): : 15 - 16
  • [30] Text Mining of Open-Ended Questions in Self-Assessment of University Teachers: An LDA Topic Modeling Approach
    Buenano-Fernandez, Diego
    Gonzalez, Mario
    Gil, David
    Lujan-Mora, Sergio
    IEEE ACCESS, 2020, 8 : 35318 - 35330