Enhanced ICD-10 code assignment of clinical texts: A summarization-based approach

被引:0
|
作者
Sun, Yaoqian [1 ]
Sang, Lei [2 ]
Wu, Dan [1 ]
He, Shilin [2 ]
Chen, Yani [1 ]
Duan, Huilong [1 ]
Chen, Han [2 ]
Lu, Xudong [1 ]
机构
[1] Zhejiang Univ, Coll Biomed Engn & Instrument Sci, Zheda Rd, Hangzhou 310027, Zhejiang Provin, Peoples R China
[2] Chinese Peoples Liberat Army Gen Hosp, Dept Informat, Hainan Hosp, Sanya 572013, Hainan Province, Peoples R China
基金
中国国家自然科学基金;
关键词
Automated ICD-10 code assignment; Text summarization; Text matching; Natural Language Processing;
D O I
10.1016/j.artmed.2024.102967
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Background: Assigning International Classification of Diseases (ICD) codes to clinical texts is a common and crucial practice in patient classification, hospital management, and further statistics analysis. Current autocoding methods mainly transfer this task to a multi-label classification problem. Such solutions are suffering from high-dimensional mapping space and excessive redundant information in long clinical texts. To alleviate such a situation, we introduce text summarization methods to the ICD coding regime and apply text matching to select ICD codes. Method: We focus on the tenth revision of the ICD (ICD-10) coding and design a novel summarization-based approach (SuM) with an end-to-end strategy to efficiently assign ICD-10 code to clinical texts. In this approach, a knowledge-guided pointer network is purposed to distill and summarize key information in clinical texts precisely. Then a matching model with matching-aggregation architecture follows to align the summary result with code, tuning the one-vs-all scenario to one-vs-one matching so that the large-label-space obstacle laid in classification approaches would be avoided. Result: The 12,788 ICD-10 coded discharge summaries from a Chinese hospital were collected to evaluate the proposed approach. Compared with existing methods, the purposed model achieves the greatest coding results with Micro AUC of 0.9548, MRR@10 of 0.7977, Precision@10 of 0.0944, and Recall@10 of 0.9439 for the TOP50 Dataset. Results on the FULL-Dataset remain consistent. Also, the proposed knowledge encoder and applied end-to-end strategy are proven to facilitate the whole model to gain efficacy in selecting the most suitable code. Conclusion: The proposed automatic ICD-10 code assignment approach via text summarization can effectively capture critical messages in long clinical texts and improve the performance of ICD-10 coding of clinical texts.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Interpretable deep learning to map diagnostic texts to ICD-10 codes
    Atutxa, Aitziber
    Diaz de Ilarraza, Arantza
    Gojenola, Koldo
    Oronoz, Maite
    Perez-de-Vinaspre, Olatz
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2019, 129 : 49 - 59
  • [22] A Semi-Automated Approach for Multilingual Terminology Matching: Mapping the French Version of the ICD-10 to the ICD-10 CM
    Sylvestre, Emmanuelle
    Bouzille, Guillaume
    Mcduffie, Michael
    Chazard, Emmanuel
    Avillach, Paul
    Cuggia, Marc
    DIGITAL PERSONALIZED HEALTH AND MEDICINE, 2020, 270 : 18 - 22
  • [23] An Automated ICD-10 Code Assigning System using A Classification Method
    Singto, Chanida
    Wongwirat, Olarn
    13TH BIOMEDICAL ENGINEERING INTERNATIONAL CONFERENCE (BMEICON 2021), 2018,
  • [24] Social Determinants of Health ICD-10 Code Use in Inpatient Pediatrics
    McQuistion, Kaitlyn
    Stokes, Stacey
    Allard, Bridget
    Bhansali, Priti
    Davidson, Amber
    Hall, Matthew
    Magyar, Matthew
    Parikh, Kavita
    PEDIATRICS, 2023, 152 (02)
  • [25] DERIVATION OF ICD-10 CODE LISTS FOR NOVEL PATIENT SAFETY INDICATORS
    Ghali, William A.
    Southern, Danielle
    Quan, Hude
    JOURNAL OF GENERAL INTERNAL MEDICINE, 2014, 29 : S65 - S65
  • [26] ANALYZING GENERATIVE AI PROFICIENCY FOR ICD-10 CODE GENERATION AND INTERPRETATION
    Winberg, D.
    Xuan, D.
    Tang, T.
    Olin, S.
    Rong, R.
    Tang, M.
    Shi, L.
    VALUE IN HEALTH, 2024, 27 (12)
  • [27] ICD-10 Based Expanded Code Set for Use in Cleft Lip/Palate Research and Surveillance
    Allori, Alexander C.
    Cragan, Janet D.
    Cassell, Cynthia H.
    Marcus, Jeffrey R.
    BIRTH DEFECTS RESEARCH PART A-CLINICAL AND MOLECULAR TERATOLOGY, 2016, 106 (11) : 905 - 914
  • [28] Variability in clinical diagnoses during the ICD-8 and ICD-10 era
    Nordgaard, Julie
    Jessen, Kasper
    Saebye, Ditte
    Parnas, Josef
    SOCIAL PSYCHIATRY AND PSYCHIATRIC EPIDEMIOLOGY, 2016, 51 (09) : 1293 - 1299
  • [29] Why Sepsis Survivors Need an ICD-10 Code for "Sepsis Aftercare"
    Oh, Sungho
    Mikkelsen, Mark E.
    O'Connor, Melissa
    Bowles, Kathryn H.
    CHEST, 2022, 162 (05) : 979 - 981
  • [30] Diagnostic Accuracy of the ICD-10 Code for Lichen Sclerosus in Black Women
    Akinshemoyin Vaughn, Olushola L.
    Anyanwu, Precious A.
    Estill, Mariah C.
    JOURNAL OF LOWER GENITAL TRACT DISEASE, 2024, 28 (04) : 377 - 378