Machine Learning for Automatic Encoding of French Electronic Medical Records: Is More Data Better ?

被引:1
|
作者
Gobeill, Julien [1 ,2 ]
Ruch, Patrick [1 ,2 ]
Meyer, Rodolphe [3 ]
机构
[1] Swiss Inst Bioinformat, SIB Text Min Grp, Geneva, Switzerland
[2] HES So HEG, Informat Sci, Geneva, Switzerland
[3] Univ Hospitals Geneva HUG, Informat Syst Dept, Geneva, Switzerland
来源
DIGITAL PERSONALIZED HEALTH AND MEDICINE | 2020年 / 270卷
关键词
Medical coding; machine learning; text mining;
D O I
10.3233/SHTI200173
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
The encoding of Electronic Medical Records is a complex and time-consuming task. We report on a machine learning model for proposing diagnoses and procedures codes, from a large realistic dataset of 245 000 electronic medical records at the University Hospitals of Geneva. Our study particularly focuses on the impact of training data quantity on the model's performances. We show that the performances of the models do not increase while encoded instances from previous years are exploited for learning data. Furthermore, supervised models are shown to be highly perishable: we show a potential drop in performances of around -10% per year. Consequently, great and constant care must be exercised for designing and updating the content of such knowledge bases exploited by machine learning.
引用
收藏
页码:312 / 316
页数:5
相关论文
共 50 条
  • [41] AUTOMATED MACHINE LEARNING FRAMEWORK TO PROCESS ELECTRONIC MEDICAL RECORDS FOR CARDIOVASCULAR COMPLICATION RISK ASSESSMENT
    Lim, Chan
    Mekhael, Mario
    Pottle, Christopher
    El Hajjar, Abdel Hadi
    Noujaim, Charbel
    Zhang, Yichi
    Marrouche, Nassir F.
    JOURNAL OF THE AMERICAN COLLEGE OF CARDIOLOGY, 2022, 79 (09) : 2017 - 2017
  • [42] Electronic medical records: A vision for medical data and service grids
    Liu, Liping
    Zhu, Dan
    2007 INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT, VOLS 1-3, 2007, : 1133 - +
  • [43] Mining Primary Care Electronic Health Records for Automatic Disease Phenotyping: A Transparent Machine Learning Framework
    Fernandez-Gutierrez, Fabiola
    Kennedy, Jonathan I.
    Cooksey, Roxanne
    Atkinson, Mark
    Choy, Ernest
    Brophy, Sinead
    Huo, Lin
    Zhou, Shang-Ming
    DIAGNOSTICS, 2021, 11 (10)
  • [44] Machine Learning and Electronic Health Records: A Paradigm Shift
    Adkins, Daniel E.
    AMERICAN JOURNAL OF PSYCHIATRY, 2017, 174 (02): : 93 - 94
  • [45] Development and validation of a rheumatoid arthritis case definition: a machine learning approach using data from primary care electronic medical records
    Pham, Anh N. Q.
    Barber, Claire E. H.
    Drummond, Neil
    Jasper, Lisa
    Klein, Doug
    Lindeman, Cliff
    Widdifield, Jessica
    Williamson, Tyler
    Jones, C. Allyson
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 24 (01)
  • [46] Learning and recommending treatments using electronic medical records
    Hoang, Khanh Hung
    Ho, Tu Bao
    KNOWLEDGE-BASED SYSTEMS, 2019, 181
  • [47] Modern Machine Learning: More with Less, Cheaper and Better
    Bohte, Sander
    Hung Son Nguyen
    ERCIM NEWS, 2016, (107): : 16 - 17
  • [48] Learning Treatment Regimens from Electronic Medical Records
    Hoang, Khanh Hung
    Ho, Tu Bao
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2018, PT I, 2018, 10937 : 411 - 422
  • [49] Automated Fusion of Multimodal Electronic Health Records for Better Medical Predictions
    Cui, Suhan
    Wang, Jiaqi
    Zhong, Yuan
    Liu, Han
    Wang, Ting
    Ma, Fenglong
    PROCEEDINGS OF THE 2024 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2024, : 361 - 369
  • [50] ELECTRONIC MEDICAL RECORDS: DATA IS NOT ALWAYS WHAT IT SEEMS
    Fleenor, T. T.
    Hall, J.
    Kulkarni, A.
    Lake, M.
    Sunallah, R.
    Klasner, A.
    JOURNAL OF INVESTIGATIVE MEDICINE, 2016, 64 (02) : 508 - 508