Optical Character Recognition for Medical Records Digitization with Deep Learning

被引:2
|
作者
Zaryab, Muhammad Ateeque [1 ]
Ng, Chuen Rue [1 ]
机构
[1] Tech Univ Ilmenau, Inst Biomed Engn, Ilmenau, Germany
关键词
OCR; Deep Learning; Computer Vision; Text Recognition;
D O I
10.1109/ICIP49359.2023.10222038
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The importance of document digitization has increased due to recent technological advancements, including in the medical field. Digitization of medical records plays a vital role in the healthcare sector as it helps expedite emergency treatment. Due to the scarcity of published studies and public German textual resources, a medical records database with German handwriting was collected and digitized. In this study, document digitization was accomplished by implementing deep learning, region of interest (ROI) detection, and optical character recognition (OCR) on a dataset containing medical forms filled with German and English characters. To find the best model for ROI detection, YOLOv5, and SSDResNet50 models were utilized and compared with YOLOv5 producing a better mean average precision (mAP) of 0.91. OCR was then carried out on the output from YOLOv5 with two different methods again for comparison. The Gated-CNN-BLSTM algorithm yielded a character error rate (CER) of 9%, while transformer-based OCR (TrOCR) achieved a CER of 6%. The proposed system could be implemented and further tested in local hospitals, with the OCR dictionary being expandable to include other Roman character-based languages.
引用
收藏
页码:3260 / 3263
页数:4
相关论文
共 50 条
  • [41] Image denoising to enhance character recognition using deep learning
    Hussain J.
    Vanlalruata
    International Journal of Information Technology, 2022, 14 (7) : 3457 - 3469
  • [42] DeepNetDevanagari: a deep learning model for Devanagari ancient character recognition
    Sonika Rani Narang
    Munish Kumar
    M. K. Jindal
    Multimedia Tools and Applications, 2021, 80 : 20671 - 20686
  • [43] Reading Modi Lipi: A Deep Learning Journey in Character Recognition
    Varpe, Kanchan
    Sakhare, Sachin
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2024, 9 (01):
  • [44] Deep Learning based Isolated Arabic Scene Character Recognition
    Bin Ahmed, Saad
    Naz, Saeeda
    Razzak, Muhammad Imran
    Yousaf, Rubiyah
    2017 1ST INTERNATIONAL WORKSHOP ON ARABIC SCRIPT ANALYSIS AND RECOGNITION (ASAR), 2017, : 46 - 51
  • [45] Kurdish Handwritten character recognition using deep learning techniques
    Ahmed, Rebin M.
    Rashid, Tarik A.
    Fattah, Polla
    Alsadoon, Abeer
    Bacanin, Nebojsa
    Mirjalili, Seyedali
    Vimal, S.
    Chhabra, Amit
    GENE EXPRESSION PATTERNS, 2022, 46
  • [46] Deep Learning and Lexical Analysis Combined Rubbing Character Recognition
    Zhang, Zhiyu
    Wang, Zhichen
    Tomiyama, Hiroyuki
    Meng, Lin
    2019 INTERNATIONAL CONFERENCE ON ADVANCED MECHATRONIC SYSTEMS (ICAMECHS), 2019, : 57 - 62
  • [47] Handwritten Tifinagh Character Recognition using Deep Learning Architectures
    Sadouk, Lamyaa
    Gadi, Taoufiq
    Essoufi, El Hassan
    PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON INTERNET OF THINGS AND MACHINE LEARNING (IML'17), 2017,
  • [48] A Deep Learning-based Unified Solution for Character Recognition
    Das, Avishek
    Rabby, A. K. M. Shahariar Azad
    Kowsar, Ibna
    Rahman, Fuad
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1671 - 1677
  • [49] DeepNetDevanagari: a deep learning model for Devanagari ancient character recognition
    Narang, Sonika Rani
    Kumar, Munish
    Jindal, M. K.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (13) : 20671 - 20686
  • [50] A Case Study on Rubbing Character Recognition Based on Deep Learning
    Meng, Zelin
    Zhang, Zhiyu
    Meng, Lin
    Tomiyama, Hiroyuki
    2020 17TH INTERNATIONAL SOC DESIGN CONFERENCE (ISOCC 2020), 2020, : 318 - 319