Machine Learning Approaches on Diagnostic Term Encoding With the ICD for Clinical Documentation

被引:13
|
作者
Atutxa, Aitziber [1 ]
Perez, Alicia [1 ]
Casillas, Arantza [1 ]
机构
[1] Univ Basque Country, IXA Res Grp, UPV EHU, Donostia San Sebastian 20080, Spain
关键词
Clinical text mining; International Classification of Diseases; machine learning; natural language processing;
D O I
10.1109/JBHI.2017.2743824
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This work focuses on data mining applied to the clinical documentation domain. Diagnostic terms (DTs) are used as keywords to retrieve valuable information from electronic health records. Indeed, they are encoded manually by experts following the International Classification of Diseases (ICD). The goal of this work is to explore the aid of text mining on DT encoding. From the machine learning (ML) perspective, this is a high-dimensional classification task, as it comprises thousands of codes. This work delves into a robust representation of the instances to improve ML results. The proposed system is able to find the right ICD code among more than 1500 possible ICD codes with 92% precision for the main disease (primary class) and 88% for the main disease together with the nonessential modifiers (fully specified class). The methodology employed is simple and portable. According to the experts from public hospitals, the system is very useful in particular for documentation and pharmacosurveillance services. In fact, they reported an accuracy of 91.2% on a small randomly extracted test. Hence, together with this paper, we made the software publicly available in order to help the clinical and research community.
引用
收藏
页码:1323 / 1329
页数:7
相关论文
共 50 条
  • [1] Developing a machine learning model to detect diagnostic uncertainty in clinical documentation
    Marshall, Trisha L.
    Nickels, Lindsay C.
    Brady, Patrick W.
    Edgerton, Ezra J.
    Lee, James J.
    Hagedorn, Philip A.
    [J]. JOURNAL OF HOSPITAL MEDICINE, 2023, 18 (05) : 405 - 412
  • [2] Machine Learning Approaches for Clinical Psychology and Psychiatry
    Dwyer, Dominic B.
    Falkai, Peter
    Koutsouleris, Nikolaos
    [J]. ANNUAL REVIEW OF CLINICAL PSYCHOLOGY, VOL 14, 2018, 14 : 91 - 118
  • [3] Grading Documentation with Machine Learning
    Messer, Marcus
    Shi, Miaojing
    Brown, Neil C. C.
    Kolling, Michael
    [J]. ARTIFICIAL INTELLIGENCE IN EDUCATION, PT I, AIED 2024, 2024, 14829 : 105 - 117
  • [4] Documentation of Machine Learning Software
    Hashemi, Yalda
    Nayebi, Maleknaz
    Antoniol, Giuliano
    [J]. PROCEEDINGS OF THE 2020 IEEE 27TH INTERNATIONAL CONFERENCE ON SOFTWARE ANALYSIS, EVOLUTION, AND REENGINEERING (SANER '20), 2020, : 666 - 667
  • [5] Development and Validation of Clinical Diagnostic Model for Girls with Central Precocious Puberty: Machine-learning Approaches
    Quynh Thi Vu Huynh
    Nguyen Quoc Khanh Le
    Huang, Shih-Yi
    Ban Tran Ho
    Tru Huy Vu
    Hong Thi Minh Pham
    An Le Pham
    Hou, Jia-Woei
    Ngan Thi Kim Nguyen
    Chen, Yang Ching
    [J]. PLOS ONE, 2022, 17 (01):
  • [6] NOTESENSE: DEVELOPMENT OF A MACHINE LEARNING ALGORITHM FOR FEEDBACK ON CLINICAL REASONING DOCUMENTATION
    Schaye, Verity
    Guzman, Benedict
    Rafel, Jesse Burk
    Kudlowitz, David
    Reinstein, Ilan
    Miller, Louis
    Cocks, Patrick
    Chun, Jonathan
    Aphinyanaphongs, Yin
    Marin, Marina
    [J]. JOURNAL OF GENERAL INTERNAL MEDICINE, 2021, 36 (SUPPL 1) : S110 - S110
  • [7] Metabolomics and machine learning approaches for diagnostic and prognostic biomarkers screening in sepsis
    She, Han
    Du, Yuanlin
    Du, Yunxia
    Tan, Lei
    Yang, Shunxin
    Luo, Xi
    Li, Qinghui
    Xiang, Xinming
    Lu, Haibin
    Hu, Yi
    Liu, Liangming
    Li, Tao
    [J]. BMC ANESTHESIOLOGY, 2023, 23 (01)
  • [8] Metabolomics and machine learning approaches for diagnostic and prognostic biomarkers screening in sepsis
    Han She
    Yuanlin Du
    Yunxia Du
    Lei Tan
    Shunxin Yang
    Xi Luo
    Qinghui Li
    Xinming Xiang
    Haibin Lu
    Yi Hu
    Liangming Liu
    Tao Li
    [J]. BMC Anesthesiology, 23
  • [9] Psychometric and Machine Learning Approaches for Diagnostic Assessment and Tests of Individual Classification
    Gonzalez, Oscar
    [J]. PSYCHOLOGICAL METHODS, 2021, 26 (02) : 236 - 254
  • [10] Multimodal Machine Learning for Automated ICD Coding
    Xu, Keyang
    Lam, Mike
    Pang, Jingzhi
    Gao, Xin
    Band, Charlotte
    Mathur, Piyush
    Papay, Frank
    Khanna, Ashish K.
    Cywinski, Jacek B.
    Maheshwari, Kamal
    Xie, Pengtao
    Xing, Eric P.
    [J]. Proceedings of Machine Learning Research, 2019, 106 : 197 - 215