Ensemble deep learning model for optical character recognition

被引:1
|
作者
Shetty, Ashish [1 ]
Sharma, Sanjeev [1 ]
机构
[1] Indian Inst Informat Technol, Pune, India
关键词
Character recognition; OCR; Convolution Neural Network; CNN; Deep learning; The Chars74K dataset; Ensemble model;
D O I
10.1007/s11042-023-16018-0
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In modern deep learning, character recognition in images is a very important field of study due to its has many real life applications. The goal of this paper is to create the state-of-the-art character recognition model using a stacking ensemble of convolution neural networks (CNNs).To develop the proposed ensemble model, we evaluated several CNN models. The models were judged on how well they performed on the Chars74k dataset. The dataset contains 74,103 images divided into 62 classes with labels [A-Z], [a-z], and [0-9]. The accuracy distribution based on the dataset's subgroups (uppercase, lowercase, and digit) is shown in results. The proposed ensemble model achieves state-of-the-art performance with a maximum accuracy of 92.31% on complete dataset, 99.22% on Uppercase alphabets, 98.66% on Lowercase alphabets, 99.77% on Digits, 91.97% on Uppercase+Lowercase alphabets. On the complete and partial datasets, a comparison report between the proposed model and other existing approaches is also displayed. A comparative study of the proposed work and the previous methods is also shown in this paper, in order to demonstrate the effectiveness of the proposed work.
引用
收藏
页码:11411 / 11431
页数:21
相关论文
共 50 条
  • [1] Ensemble deep learning model for optical character recognition
    Ashish Shetty
    Sanjeev Sharma
    [J]. Multimedia Tools and Applications, 2024, 83 : 11411 - 11431
  • [2] Optical Character Recognition for Medical Records Digitization with Deep Learning
    Zaryab, Muhammad Ateeque
    Ng, Chuen Rue
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 3260 - 3263
  • [3] Deep Learning Based Sinhala Optical Character Recognition (OCR)
    Anuradha, Isuri
    Liyanage, Chamila
    Wijayawardhana, Harsha
    Weerasinghe, Ruvan
    [J]. 2020 20TH INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER-2020), 2020, : 298 - 299
  • [4] Optical Character Recognition using Deep Learning: An enhanced Approach
    Amara, Marwa
    Zaghdoud, Radhia
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (05): : 545 - 552
  • [5] Offline handwritten Tai Le character recognition using ensemble deep learning
    Guo, Hai
    Liu, Yifan
    Yang, Doudou
    Zhao, Jingying
    [J]. VISUAL COMPUTER, 2022, 38 (11): : 3897 - 3910
  • [6] Offline handwritten Tai Le character recognition using ensemble deep learning
    Hai Guo
    Yifan Liu
    Doudou Yang
    Jingying Zhao
    [J]. The Visual Computer, 2022, 38 : 3897 - 3910
  • [7] Deep Learning for Optical Character Recognition and Its Application to VAT Invoice Recognition
    Wang, Yu
    Gui, Guan
    Zhao, Nan
    Yin, Yue
    Huang, Hao
    Li, Yunyi
    Wang, Jie
    Yang, Jie
    Zhang, Haijun
    [J]. COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, CSPS 2018, VOL III: SYSTEMS, 2020, 517 : 87 - 95
  • [8] Enhanced Ensemble Technique for Optical Character Recognition
    Habeeb, Imad Qasim
    Al-Zaydi, Zeyad Qasim
    Abdulkhudhur, Hanan Najm
    [J]. NEW TRENDS IN INFORMATION AND COMMUNICATIONS TECHNOLOGY APPLICATIONS, NTICT 2018, 2018, 938 : 213 - 225
  • [9] An Efficient Deep Learning Model with Interrelated Tagging Prototype with Segmentation for Telugu Optical Character Recognition
    Dhanikonda, Srinivasa Rao
    Sowjanya, Ponnuru
    Ramanaiah, M. Laxmidevi
    Joshi, Rahul
    Mohan, B. H. Krishna
    Dhabliya, Dharmesh
    Raja, N. Kannaiya
    [J]. SCIENTIFIC PROGRAMMING, 2022, 2022
  • [10] Optical Character Recognition using Deep Recurrent Attention Model
    Shaker, Mahmoud
    ElHelw, Mohamed
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ROBOTICS, CONTROL AND AUTOMATION (ICRCA 2017), 2017, : 56 - 59