Optical Character Recognition for Medical Records Digitization with Deep Learning

被引:2
|
作者
Zaryab, Muhammad Ateeque [1 ]
Ng, Chuen Rue [1 ]
机构
[1] Tech Univ Ilmenau, Inst Biomed Engn, Ilmenau, Germany
关键词
OCR; Deep Learning; Computer Vision; Text Recognition;
D O I
10.1109/ICIP49359.2023.10222038
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The importance of document digitization has increased due to recent technological advancements, including in the medical field. Digitization of medical records plays a vital role in the healthcare sector as it helps expedite emergency treatment. Due to the scarcity of published studies and public German textual resources, a medical records database with German handwriting was collected and digitized. In this study, document digitization was accomplished by implementing deep learning, region of interest (ROI) detection, and optical character recognition (OCR) on a dataset containing medical forms filled with German and English characters. To find the best model for ROI detection, YOLOv5, and SSDResNet50 models were utilized and compared with YOLOv5 producing a better mean average precision (mAP) of 0.91. OCR was then carried out on the output from YOLOv5 with two different methods again for comparison. The Gated-CNN-BLSTM algorithm yielded a character error rate (CER) of 9%, while transformer-based OCR (TrOCR) achieved a CER of 6%. The proposed system could be implemented and further tested in local hospitals, with the OCR dictionary being expandable to include other Roman character-based languages.
引用
收藏
页码:3260 / 3263
页数:4
相关论文
共 50 条
  • [21] Manuscripts Character Recognition Using Machine Learning and Deep Learning
    Islam, Mohammad Anwarul
    Iacob, Ionut E.
    MODELLING, 2023, 4 (02): : 168 - 188
  • [22] Character Recognition using Machine Learning and Deep Learning - A Survey
    Sharma, Reya
    Kaushik, Baijnath
    Gondhi, Naveen
    2020 INTERNATIONAL CONFERENCE ON EMERGING SMART COMPUTING AND INFORMATICS (ESCI), 2020, : 341 - 345
  • [23] Deep optical character recognition: a case of Pashto language
    Zahoor, Shizza
    Naz, Saeeda
    Khan, Naila H.
    Razzak, Muhammad, I
    JOURNAL OF ELECTRONIC IMAGING, 2020, 29 (02)
  • [24] Improved Optical Character Recognition with Deep Neural Network
    Wei, Tan Chiang
    Sheikh, U. U.
    Ab Rahman, Ab Al-Hadi
    2018 IEEE 14TH INTERNATIONAL COLLOQUIUM ON SIGNAL PROCESSING & ITS APPLICATIONS (CSPA 2018), 2018, : 245 - 249
  • [25] Deep Learning-based Arabic Optical Character Recognition: A New Comprehensive Dataset at Character and Word Levels.
    Gaashan, Khulood
    Younes, Maram Bani
    2024 15TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS, ICICS 2024, 2024,
  • [26] Hybrid Handwriting Character Recognition with Transfer Deep Learning
    Can, Ferit
    Yilmaz, Atinc
    2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
  • [27] The Recognition and Implementation of Handwritten Character based on Deep Learning
    Dai, Fengzhi
    Ye, Zhongyong
    Jin, Xia
    JOURNAL OF ROBOTICS NETWORKING AND ARTIFICIAL LIFE, 2019, 6 (01): : 52 - 55
  • [28] A Multiclass Classification Method Based on Deep Learning for Named Entity Recognition in Electronic Medical Records
    Dong, Xishuang
    Qian, Lijun
    Guan, Yi
    Huang, Lei
    Yu, Qiubin
    Yang, Jinfeng
    2016 NEW YORK SCIENTIFIC DATA SUMMIT (NYSDS), 2016,
  • [29] Deep Learning Networks for Handwritten Bangla Character Recognition
    Begum, H.
    Islam, M.M.
    Eva, H.S.
    Emon, N.H.
    Siddique, F.A.
    IAENG International Journal of Applied Mathematics, 2023, 53 (04)
  • [30] Deep Learning for Handwritten Java']Javanese Character Recognition
    Rismiyati
    Khadijah
    Nurhadiyatna, Adi
    2017 1ST INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTATIONAL SCIENCES (ICICOS), 2017, : 59 - 63