Optical Character Recognition for Medical Records Digitization with Deep Learning

被引:2
|
作者
Zaryab, Muhammad Ateeque [1 ]
Ng, Chuen Rue [1 ]
机构
[1] Tech Univ Ilmenau, Inst Biomed Engn, Ilmenau, Germany
关键词
OCR; Deep Learning; Computer Vision; Text Recognition;
D O I
10.1109/ICIP49359.2023.10222038
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The importance of document digitization has increased due to recent technological advancements, including in the medical field. Digitization of medical records plays a vital role in the healthcare sector as it helps expedite emergency treatment. Due to the scarcity of published studies and public German textual resources, a medical records database with German handwriting was collected and digitized. In this study, document digitization was accomplished by implementing deep learning, region of interest (ROI) detection, and optical character recognition (OCR) on a dataset containing medical forms filled with German and English characters. To find the best model for ROI detection, YOLOv5, and SSDResNet50 models were utilized and compared with YOLOv5 producing a better mean average precision (mAP) of 0.91. OCR was then carried out on the output from YOLOv5 with two different methods again for comparison. The Gated-CNN-BLSTM algorithm yielded a character error rate (CER) of 9%, while transformer-based OCR (TrOCR) achieved a CER of 6%. The proposed system could be implemented and further tested in local hospitals, with the OCR dictionary being expandable to include other Roman character-based languages.
引用
收藏
页码:3260 / 3263
页数:4
相关论文
共 50 条
  • [1] Ensemble deep learning model for optical character recognition
    Ashish Shetty
    Sanjeev Sharma
    Multimedia Tools and Applications, 2024, 83 : 11411 - 11431
  • [2] Ensemble deep learning model for optical character recognition
    Shetty, Ashish
    Sharma, Sanjeev
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (04) : 11411 - 11431
  • [3] Combining Optical Character Recognition With Paper ECG Digitization
    Ganesh, Shambavi
    Bhatti, Pamela T.
    Alkhalaf, Mhmtjamil
    Gupta, Shishir
    Shah, Amit J.
    Tridandapani, Srini
    IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE, 2021, 9
  • [4] Deep Learning Based Sinhala Optical Character Recognition (OCR)
    Anuradha, Isuri
    Liyanage, Chamila
    Wijayawardhana, Harsha
    Weerasinghe, Ruvan
    2020 20TH INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER-2020), 2020, : 298 - 299
  • [5] Optical Character Recognition using Deep Learning: An enhanced Approach
    Amara, Marwa
    Zaghdoud, Radhia
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (05): : 545 - 552
  • [6] Deep Learning for Optical Character Recognition and Its Application to VAT Invoice Recognition
    Wang, Yu
    Gui, Guan
    Zhao, Nan
    Yin, Yue
    Huang, Hao
    Li, Yunyi
    Wang, Jie
    Yang, Jie
    Zhang, Haijun
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, CSPS 2018, VOL III: SYSTEMS, 2020, 517 : 87 - 95
  • [7] A Digitization Pipeline for Mixed-Typed Documents Using Machine Learning and Optical Character Recognition
    Matschak, Tizian
    Rampold, Florian
    Hellmeier, Malte
    Prinz, Christoph
    Trang, Simon
    TRANSDISCIPLINARY REACH OF DESIGN SCIENCE RESEARCH, DESRIST 2022, 2022, 13229 : 195 - 207
  • [8] Mass Digitization of Early Modern Texts With Optical Character Recognition
    Christy, Matthew
    Gupta, Anshul
    Grumbach, Elizabeth
    Mandell, Laura
    Furuta, Richard
    Gutierrez-Osuna, Ricardo
    ACM JOURNAL ON COMPUTING AND CULTURAL HERITAGE, 2018, 11 (01):
  • [9] OCR-MRD: performance analysis of different optical character recognition engines for medical report digitization
    Batra P.
    Phalnikar N.
    Kurmi D.
    Tembhurne J.
    Sahare P.
    Diwan T.
    International Journal of Information Technology, 2024, 16 (1) : 447 - 455
  • [10] Robust Character Recognition For Optical And Natural Images Using Deep Learning
    Abdali, Al Maamoon Rasool
    Ghani, Rana Fareed
    2019 17TH IEEE STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT (SCORED), 2019, : 152 - 156