Predicting Clinical Events Based on Raw Text: From Bag-of-Words to Attention-Based Transformers

被引:1
|
作者
Roussinov, Dmitri [1 ]
Conkie, Andrew [2 ]
Patterson, Andrew [1 ]
Sainsbury, Christopher [3 ]
机构
[1] Univ Strathclyde, Dept Comp & Informat Sci, Glasgow, Scotland
[2] Red Star Consulting, Glasgow, Scotland
[3] NHS Greater Glasgow & Clyde, Glasgow, Scotland
来源
基金
英国工程与自然科学研究理事会;
关键词
discharge summaries; BERT; clinical event prediction; pre-trained language models; transformers; deep learning; RISK;
D O I
10.3389/fdgth.2021.810260
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Identifying which patients are at higher risks of dying or being re-admitted often happens to be resource- and life- saving, thus is a very important and challenging task for healthcare text analytics. While many successful approaches exist to predict such clinical events based on categorical and numerical variables, a large amount of health records exists in the format of raw text such as clinical notes or discharge summaries. However, the text-analytics models applied to free-form natural language found in those notes are lagging behind the break-throughs happening in the other domains and remain to be primarily based on older bag-of-words technologies. As a result, they rarely reach the accuracy level acceptable for the clinicians. In spite of their success in other domains, the superiority of deep neural approaches over classical bags of words for this task has not yet been convincingly demonstrated. Also, while some successful experiments have been reported, the most recent break-throughs due to the pre-trained language models have not yet made their ways into the medical domain. Using a publicly available healthcare dataset, we have explored several classification models to predict patients' re-admission or a fatality based on their discharge summaries and established that 1) The performance of the neural models used in our experiments convincingly exceeds those based on bag-of-words by several percentage points as measured by the standard metrics. 2) This allows us to achieve the accuracy typically acceptable by the clinicians as of practical use (area under the ROC curve above 0.70) for the majority of our prediction targets. 3) While the pre-trained attention-based transformer performed only on par with the model that averages word embeddings when applied to full length discharge summaries, the transformer still handles shorter text segments substantially better, at times with the margin of 0.04 in the area under the ROC curve. Thus, our findings extend the success of pre-trained language models reported in other domains to the task of clinical event prediction, and likely to other text-classification tasks in the healthcare analytics domain. 4) We suggest several models to overcome the transformers' major drawback (their input size limitation), and confirm that this is crucial to achieve their top performance. Our modifications are domain agnostic, and thus can be applied in other applications where the text inputs exceed 200 words. 5) We have successfully demonstrated how non-text attributes (such as patient age, demographics, type of admission etc.) can be combined with text to gain additional improvements for several prediction targets. We include extensive ablation studies showing the impact of the training size, and highlighting the tradeoffs between the performance and the resources needed.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Generic Object Detection based on Boosting embedded with Bag-of-words
    Qiu Xuena
    Liu Shirong
    Song Jiatao
    EMERGING SYSTEMS FOR MATERIALS, MECHANICS AND MANUFACTURING, 2012, 109 : 285 - +
  • [22] Foreground Objects Recognition in Video Based on Bag-of-Words Model
    Hu, Miao-Jun
    Li, Cui-Hua
    Qu, Yan-Yun
    Huang, Jian-Xin
    PROCEEDINGS OF THE 2009 CHINESE CONFERENCE ON PATTERN RECOGNITION AND THE FIRST CJK JOINT WORKSHOP ON PATTERN RECOGNITION, VOLS 1 AND 2, 2009, : 450 - 454
  • [23] Bag-of-Words Based Deep Neural Network for Image Retrieval
    Bai, Yalong
    Yu, Wei
    Xiao, Tianjun
    Xu, Chang
    Yang, Kuiyuan
    Ma, Wei-Ying
    Zhao, Tiejun
    PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 229 - 232
  • [24] A Novel Image Classification Method Based on Bag-of-Words Framework
    Liu, Yi
    Yu, Ming
    Xue, Cuihong
    Yang, Yueqiang
    2018 IEEE 8TH ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER), 2018, : 534 - 539
  • [25] Fusing Color and Shape for Bag-of-Words Based Object Recognition
    van de Weijer, Joost
    Khan, Fahad Shahbaz
    COMPUTATIONAL COLOR IMAGING, CCIW 2013, 2013, 7786 : 25 - 34
  • [26] Research on motion recognition algorithm based on bag-of-words model
    Ting Huang
    Sheng-Rong Ru
    Zhi-Hong Zeng
    Long Zhang
    Microsystem Technologies, 2021, 27 : 1647 - 1654
  • [27] Offensive Language Detection in Spanish Social Media: Testing From Bag-of-Words to Transformers Models
    Molero, Jose Maria
    Perez-Martin, Jorge
    Rodrigo, Alvaro
    Penas, Anselmo
    IEEE ACCESS, 2023, 11 : 95639 - 95652
  • [28] Comparing High Dimensional Word Embeddings Trained on Medical Text to Bag-of-Words for Predicting Medical Codes
    Yogarajan, Vithya
    Gouk, Henry
    Smith, Tony
    Mayo, Michael
    Pfahringer, Bernhard
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2020), PT I, 2020, 12033 : 97 - 108
  • [29] Attention-Based Neural Text Segmentation
    Badjatiya, Pinkesh
    Kurisinkel, Litton J.
    Gupta, Manish
    Varma, Vasudeva
    ADVANCES IN INFORMATION RETRIEVAL (ECIR 2018), 2018, 10772 : 180 - 193
  • [30] Bag-of-Words and Object-Based Classification for Cloud Extraction From Satellite Imagery
    Yuan, Yi
    Hu, Xiangyun
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2015, 8 (08) : 4197 - 4205