A Refinement System for Medical Information Extraction from Text-based Bilingual Electronic Medical Records

被引:0
|
作者
Bae, Inho [1 ]
Kim, Jin-Sang [1 ]
机构
[1] Keimyung Univ, Dept Comp Engn, 2800 Dalgubeoldaero, Daegu 704701, South Korea
关键词
EMR Refinement; Spelling Error Correction; Abbreviation/Acronym Disambiguation; Information Extraction;
D O I
暂无
中图分类号
R-058 [];
学科分类号
摘要
Objective: Applications to extract medical information from electronic medical records(EMRs) confront some serious obstacles such as spelling errors, ambiguous abbreviations, and unrecognizable words. Those obstacles hinder the process of finding medical entities, relations, and events. We present an efficient EMR refinement system for the purpose of medical information extraction from EMRs, not just for traditional text error correction. Methods: The EMR refinement system has been designed and implemented through following steps: 1) Build domain constrained dictionary database, 2) Correct spelling errors in Korean-English EMR documents, 3) Resolve ambiguous abbreviations in the bilingual documents. The resulting EMR documents are now machine readable and can be applied to various applications including information extraction. Result: Precision rate of the refinement system for spelling error correction is 8 0.4% and for disambiguating abbreviations/acronyms is 94.7%. Conclusion: We developed an EMR refinement system to correct spelling errors and resolve ambiguous abbreviations as well as unrecognizable words. Our system can enhance the reliability of medical records and contribute to develop further application systems in the field of text mining and information extraction.
引用
收藏
页码:267 / 274
页数:8
相关论文
共 50 条
  • [1] Medical Information System Based on Electronic Healthcare Records
    Paun, D.
    Sauciuc, D.
    Stan, O.
    Iosif, O.
    Dehelean, C.
    Miclea, L.
    [J]. PROCEEDINGS OF 2010 IEEE INTERNATIONAL CONFERENCE ON AUTOMATION, QUALITY AND TESTING, ROBOTICS (AQTR 2010), VOLS. 1-3, 2010,
  • [2] Construction of cardiovascular information extraction corpus based on electronic medical records
    Chang, Hongyang
    Zan, Hongying
    Zhang, Shuai
    Zhao, Bingfei
    Zhang, Kunli
    [J]. MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (07) : 13379 - 13397
  • [3] Information Extraction for Intestinal Cancer Electronic Medical Records
    Wang, Sufen
    Pang, Minmin
    Pan, Changqing
    Yuan, Junyi
    Xu, Bo
    Du, Ming
    Zhang, Hong
    [J]. IEEE ACCESS, 2020, 8 : 125923 - 125934
  • [4] A System for Medical Information Extraction and Verification from Unstructured Text
    Juric, Damir
    Stoilos, Giorgos
    Melo, Andre
    Moore, Jonathan
    Khodadadi, Mohammad
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13314 - 13319
  • [5] Software Architecture for Improving Accessibility to Medical Text-Based Information
    Topac, Vasile
    Stoicu-Tivadar, Vasile
    [J]. MEDICAL INFORMATICS IN A UNITED AND HEALTHY EUROPE, 2009, 150 : 146 - 146
  • [6] A network-based analysis of medical information extracted from electronic medical records
    Reategui, Ruth
    Ratte, Sylvie
    Bautista-Valarezo, Estefania
    Beltran-Valdivieso, J. F.
    [J]. 2020 XLVI LATIN AMERICAN COMPUTING CONFERENCE (CLEI 2020), 2021, : 10 - 19
  • [7] Data-Driven Information Extraction from Chinese Electronic Medical Records
    Xu, Dong
    Zhang, Meizhuo
    Zhao, Tianwan
    Ge, Chen
    Gao, Weiguo
    Wei, Jia
    Zhu, Kenny Q.
    [J]. PLOS ONE, 2015, 10 (08):
  • [8] TEPAPA: a novel in silico feature learning pipeline for mining prognostic and associative factors from text-based electronic medical records
    Frank Po-Yen Lin
    Adrian Pokorny
    Christina Teng
    Richard J. Epstein
    [J]. Scientific Reports, 7
  • [9] TEPAPA: a novel in silico feature learning pipeline for mining prognostic and associative factors from text-based electronic medical records
    Lin, Frank Po-Yen
    Pokorny, Adrian
    Teng, Christina
    Epstein, Richard J.
    [J]. SCIENTIFIC REPORTS, 2017, 7
  • [10] A Medical Decision Support System Using Text Mining to Compare Electronic Medical Records
    Lee, Pei-ju
    Lee, Yen-Hsien
    Kang, Yihuang
    Chao, Ching-Ping
    [J]. HCI IN BUSINESS, GOVERNMENT AND ORGANIZATIONS: INFORMATION SYSTEMS AND ANALYTICS, 2019, 11589 : 199 - 208