Machine Learning for Automatic Encoding of French Electronic Medical Records: Is More Data Better ?

被引:1
|
作者
Gobeill, Julien [1 ,2 ]
Ruch, Patrick [1 ,2 ]
Meyer, Rodolphe [3 ]
机构
[1] Swiss Inst Bioinformat, SIB Text Min Grp, Geneva, Switzerland
[2] HES So HEG, Informat Sci, Geneva, Switzerland
[3] Univ Hospitals Geneva HUG, Informat Syst Dept, Geneva, Switzerland
来源
关键词
Medical coding; machine learning; text mining;
D O I
10.3233/SHTI200173
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
The encoding of Electronic Medical Records is a complex and time-consuming task. We report on a machine learning model for proposing diagnoses and procedures codes, from a large realistic dataset of 245 000 electronic medical records at the University Hospitals of Geneva. Our study particularly focuses on the impact of training data quantity on the model's performances. We show that the performances of the models do not increase while encoded instances from previous years are exploited for learning data. Furthermore, supervised models are shown to be highly perishable: we show a potential drop in performances of around -10% per year. Consequently, great and constant care must be exercised for designing and updating the content of such knowledge bases exploited by machine learning.
引用
收藏
页码:312 / 316
页数:5
相关论文
共 50 条
  • [1] Automatic Classification with Unbalanced Data for Electronic Medical Records
    Zhang, Yunqiu
    Li, Bocheng
    Chen, Yan
    Data Analysis and Knowledge Discovery, 2022, 6 (2-3): : 233 - 241
  • [2] Automatic processing of Electronic Medical Records using Deep Learning
    Osmani, Venet
    Li, Li
    Danieletto, Matteo
    Glicksberg, Benjamin
    Dudley, Joel
    Mayora, Oscar
    PROCEEDINGS OF THE 12TH EAI INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING TECHNOLOGIES FOR HEALTHCARE (PERVASIVEHEALTH 2018), 2018, : 251 - 257
  • [3] Utilizing machine learning and electronic medical records to predict preeclampsia
    Hirschberg, Carly I.
    Sirendi, Marek
    Howe, Christina
    Chatterjee, Ipsita
    Hayden, Dean
    Ton, Trung Nghia
    Tan, Zachary
    Blitz, Matthew J.
    Combs, Adriann
    Nimaroff, Michael
    Rochelson, Burton
    AMERICAN JOURNAL OF OBSTETRICS AND GYNECOLOGY, 2023, 228 (01) : S12 - S12
  • [4] Utilizing machine learning and electronic medical records to predict postpartum preeclampsia
    Hirschberg, Carly I.
    Sirendi, Marek
    Howe, Christina
    Chatterjee, Ipsita
    Hayden, Dean
    Ton, Trung Nghia
    Tan, Zachary
    Suarez, Fernando
    Blitz, Matthew J.
    Combs, Adriann
    Nimaroff, Michael
    Rochelson, Burton
    AMERICAN JOURNAL OF OBSTETRICS AND GYNECOLOGY, 2023, 228 (01) : S615 - S616
  • [5] Applications of Machine Learning Using Electronic Medical Records in Spine Surgery
    Schwartz, John T.
    Gao, Michael
    Geng, Eric A.
    Mody, Kush S.
    Mikhail, Christopher M.
    Cho, Samuel K.
    NEUROSPINE, 2019, 16 (04) : 643 - 653
  • [6] Using Electronic Health Records and Machine Learning to Make Medical-Related Predictions from Non-Medical Data
    Pitoglou, Stavros
    Koumpouros, Yiannis
    Anastasiou, Athanasios
    2018 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND DATA ENGINEERING (ICMLDE 2018), 2018, : 56 - 60
  • [7] Data Analytics and Machine Learning for Disease Identification in Electronic Health Records
    Benke, Kurt K.
    JAMA OPHTHALMOLOGY, 2019, 137 (05) : 497 - 498
  • [8] Leveraging electronic medical records and machine learning for early detection of ovarian cancer
    Giannakeas, Vasily
    Kotsopoulos, Joanne
    Narod, Steven
    INTERNATIONAL JOURNAL OF GYNECOLOGICAL CANCER, 2024, 34 (SUPPL_1) : A404 - A405
  • [9] Identification of newborns at risk for autism using electronic medical records and machine learning
    Rahman, Rayees
    Kodesh, Arad
    Levine, Stephen Z.
    Sandin, Sven
    Reichenberg, Abraham
    Schlessinger, Avner
    EUROPEAN PSYCHIATRY, 2020, 63 (01) : e22
  • [10] Personalizing Cholesterol Management Therapy Using Electronic Medical Records and Machine Learning
    Ward, Andrew T.
    Li, Jiang
    Sarraju, Ashish
    Valencia, Areli
    Scheinker, David
    Rodriguez, Fatima
    CIRCULATION, 2020, 142