Enhanced ICD-10 code assignment of clinical texts: A summarization-based approach

被引:0
|
作者
Sun, Yaoqian [1 ]
Sang, Lei [2 ]
Wu, Dan [1 ]
He, Shilin [2 ]
Chen, Yani [1 ]
Duan, Huilong [1 ]
Chen, Han [2 ]
Lu, Xudong [1 ]
机构
[1] Zhejiang Univ, Coll Biomed Engn & Instrument Sci, Zheda Rd, Hangzhou 310027, Zhejiang Provin, Peoples R China
[2] Chinese Peoples Liberat Army Gen Hosp, Dept Informat, Hainan Hosp, Sanya 572013, Hainan Province, Peoples R China
基金
中国国家自然科学基金;
关键词
Automated ICD-10 code assignment; Text summarization; Text matching; Natural Language Processing;
D O I
10.1016/j.artmed.2024.102967
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Background: Assigning International Classification of Diseases (ICD) codes to clinical texts is a common and crucial practice in patient classification, hospital management, and further statistics analysis. Current autocoding methods mainly transfer this task to a multi-label classification problem. Such solutions are suffering from high-dimensional mapping space and excessive redundant information in long clinical texts. To alleviate such a situation, we introduce text summarization methods to the ICD coding regime and apply text matching to select ICD codes. Method: We focus on the tenth revision of the ICD (ICD-10) coding and design a novel summarization-based approach (SuM) with an end-to-end strategy to efficiently assign ICD-10 code to clinical texts. In this approach, a knowledge-guided pointer network is purposed to distill and summarize key information in clinical texts precisely. Then a matching model with matching-aggregation architecture follows to align the summary result with code, tuning the one-vs-all scenario to one-vs-one matching so that the large-label-space obstacle laid in classification approaches would be avoided. Result: The 12,788 ICD-10 coded discharge summaries from a Chinese hospital were collected to evaluate the proposed approach. Compared with existing methods, the purposed model achieves the greatest coding results with Micro AUC of 0.9548, MRR@10 of 0.7977, Precision@10 of 0.0944, and Recall@10 of 0.9439 for the TOP50 Dataset. Results on the FULL-Dataset remain consistent. Also, the proposed knowledge encoder and applied end-to-end strategy are proven to facilitate the whole model to gain efficacy in selecting the most suitable code. Conclusion: The proposed automatic ICD-10 code assignment approach via text summarization can effectively capture critical messages in long clinical texts and improve the performance of ICD-10 coding of clinical texts.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Automatic ICD-10 Code Association: A Challenging Task on French Clinical Texts
    Tchouka, Yakini
    Couchot, Jean-Francois
    Laiymani, David
    Selles, Philippe
    Rahrnani, Azzedine
    2023 IEEE 36TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS, 2023, : 91 - 96
  • [2] Automatic Assignment of ICD-10 Codes to Diagnostic Texts using Transformers Based Techniques
    Popescu, Mihai Horia
    Roitero, Kevin
    Travasci, Stefano
    Della Mea, Vincenzo
    2021 IEEE 9TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2021), 2021, : 188 - 192
  • [3] Is There an ICD-10 Code for Hyperbole?
    Topol, Eric J.
    JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2017, 317 (01): : 10 - 11
  • [4] ICD-10: there's a code for that
    不详
    LANCET, 2015, 386 (10002): : 1420 - 1420
  • [5] Moments without an ICD-10 code
    Maurer, Brian T.
    JAAPA-JOURNAL OF THE AMERICAN ACADEMY OF PHYSICIAN ASSISTANTS, 2016, 29 (08): : 58 - 58
  • [6] Welcome to the ICD-10 code for sarcopenia
    Anker, Stefan D.
    Morley, John E.
    von Haehling, Stephan
    JOURNAL OF CACHEXIA SARCOPENIA AND MUSCLE, 2016, 7 (05) : 512 - 514
  • [7] Impact of the US ICD-9 to ICD-10 Code Transition on Clinical Research
    Kamauu, Aaron W. C.
    Lohnes, Maggie
    Paredes, Kyle P.
    DuVall, Scott
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2016, 25 : 611 - 612
  • [8] Incorrect ICD-10 Code and MACE Endpoint
    Andersson, Charlotte
    JAMA INTERNAL MEDICINE, 2015, 175 (06) : E1 - E1
  • [9] Web-based field studies on diagnostic classification and code assignment of mental disorders: comparison of ICD-11 and ICD-10
    Gaebel, Wolfgang
    Riesbeck, Mathias
    Zielasek, Juergen
    Kerst, Ariane
    Meisenzahl-Lechner, Eva
    Koellner, Volker
    Rose, Matthias
    Hofmann, Tobias
    Schaefer, Ingo
    Lotzin, Annett
    Briken, Peer
    Klein, Verena
    Brunner, Franziska
    Keeley, Jared W.
    Rebello, Tahilia J.
    Andrews, Howard F.
    Reed, Geoffrey M.
    Kostanjsek, Nenad F. I.
    Hasan, Alkomiet
    Russek, Pamina
    Falkai, Peter
    FORTSCHRITTE DER NEUROLOGIE PSYCHIATRIE, 2018, 86 (03) : 163 - 171
  • [10] A keyphrase-based approach for interpretable ICD-10 code classification of Spanish medical reports
    Duque, Andres
    Fabregat, Hermenegildo
    Araujo, Lourdes
    Martinez-Romo, Juan
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2021, 121