Optical Character Recognition for Medical Records Digitization with Deep Learning

被引:2
|
作者
Zaryab, Muhammad Ateeque [1 ]
Ng, Chuen Rue [1 ]
机构
[1] Tech Univ Ilmenau, Inst Biomed Engn, Ilmenau, Germany
关键词
OCR; Deep Learning; Computer Vision; Text Recognition;
D O I
10.1109/ICIP49359.2023.10222038
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The importance of document digitization has increased due to recent technological advancements, including in the medical field. Digitization of medical records plays a vital role in the healthcare sector as it helps expedite emergency treatment. Due to the scarcity of published studies and public German textual resources, a medical records database with German handwriting was collected and digitized. In this study, document digitization was accomplished by implementing deep learning, region of interest (ROI) detection, and optical character recognition (OCR) on a dataset containing medical forms filled with German and English characters. To find the best model for ROI detection, YOLOv5, and SSDResNet50 models were utilized and compared with YOLOv5 producing a better mean average precision (mAP) of 0.91. OCR was then carried out on the output from YOLOv5 with two different methods again for comparison. The Gated-CNN-BLSTM algorithm yielded a character error rate (CER) of 9%, while transformer-based OCR (TrOCR) achieved a CER of 6%. The proposed system could be implemented and further tested in local hospitals, with the OCR dictionary being expandable to include other Roman character-based languages.
引用
收藏
页码:3260 / 3263
页数:4
相关论文
共 50 条
  • [31] Deep Learning Based Ancient Asian Character Recognition
    Atsumi, Masahiko
    Kawano, Syunsuke
    Morioka, Tomoki
    Meng, Lin
    2020 INTERNATIONAL CONFERENCE ON ADVANCED MECHATRONIC SYSTEMS (ICAMECHS), 2020, : 296 - 301
  • [32] The recognition and implementation of handwritten character based on deep learning
    Ye, Zhongyong
    Dai, Fengzhi
    Jin, Xia
    Yuan, Yasheng
    An, Lingran
    Yan, Yujie
    Qin, Yiqiao
    Li, Hao
    ICAROB 2019: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS, 2019, : 276 - 279
  • [33] Deep Learning Based Gujarati Handwritten Character Recognition
    Joshi, Dhara S.
    Risodkar, Yogesh R.
    2018 INTERNATIONAL CONFERENCE ON ADVANCES IN COMMUNICATION AND COMPUTING TECHNOLOGY (ICACCT), 2018, : 563 - 566
  • [34] Handwritten Character Recognition Using Deep-Learning
    Vaidya, Rohan
    Trivedi, Darshan
    Satra, Sagar
    Pimpale, Mrunalini
    PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INVENTIVE COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICICCT), 2018, : 772 - 775
  • [35] An Efficient Deep Learning Model with Interrelated Tagging Prototype with Segmentation for Telugu Optical Character Recognition
    Dhanikonda, Srinivasa Rao
    Sowjanya, Ponnuru
    Ramanaiah, M. Laxmidevi
    Joshi, Rahul
    Mohan, B. H. Krishna
    Dhabliya, Dharmesh
    Raja, N. Kannaiya
    SCIENTIFIC PROGRAMMING, 2022, 2022
  • [36] Mobile App for the Digitization and Deep-Learning-Based Classification of Electrocardiogram Printed Records
    Isabel, Alba
    Jimenez-Perez, Guillermo
    Camara, Oscar
    Silva, Etelvino
    2021 COMPUTING IN CARDIOLOGY (CINC), 2021,
  • [37] An Evaluation of the Layers of a Deep Network on the Optical Character Recognition Problem
    Saygin, Rahmani
    Oztimur Karadag, Ozge
    29TH IEEE CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS (SIU 2021), 2021,
  • [38] Optical Character Recognition using Deep Recurrent Attention Model
    Shaker, Mahmoud
    ElHelw, Mohamed
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ROBOTICS, CONTROL AND AUTOMATION (ICRCA 2017), 2017, : 56 - 59
  • [39] Clinical Named Entity Recognition from Chinese Electronic Medical Records Based on Deep Learning Pretraining
    Gong, Lejun
    Zhang, Zhifei
    Chen, Shiqi
    JOURNAL OF HEALTHCARE ENGINEERING, 2020, 2020
  • [40] Calamari - A High-Performance Tensorflow-based Deep Learning Package for Optical Character Recognition
    Wick, Christoph
    Reul, Christian
    Puppe, Frank
    DIGITAL HUMANITIES QUARTERLY, 2020, 14 (02):