End-to-End Optical Character Recognition for Bengali Handwritten Words

被引:0
|
作者
Safir, Farisa Benta [1 ]
Ohi, Abu Quwsar [1 ]
Mridha, M. F. [1 ]
Monowar, Muhammad Mostafa [2 ]
Hamid, Md Abdul [2 ]
机构
[1] Bangladesh Univ Business & Technol, Dept Comp Sci & Engn, Dhaka, Bangladesh
[2] King Abdulaziz Univ, Fac Comp & Informat Technol, Dept Informat Technol, Jeddah 21589, Saudi Arabia
关键词
OCR; Bengali handwriting; Baseline; LSTM; CTC loss; BANGLA; OCR;
D O I
10.1109/NCCC49330.2021.9428809
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Optical character recognition (OCR) is a process of converting analogue documents into digital using document images. Currently, many commercial and non-commercial OCR systems exist for both handwritten and printed copies for different languages. Despite this, very few works are available in case of recognising Bengali words. Among them, most of the works focused on OCR of printed Bengali characters. This paper introduces an end-to-end OCR system for Bengali language. The proposed architecture implements an end to end strategy that recognises handwritten Bengali words from handwritten word images. We experiment with popular convolutional neural network (CNN) architectures, including DenseNet, Xception, NAS-Net, and MobileNet to build the OCR architecture. Further, we experiment with two different recurrent neural networks (RNN) methods, LSTM and GRU. We evaluate the proposed architecture using BanglaWritting dataset, which is a peer-reviewed Bengali handwritten image dataset. The proposed method achieves 0.091 character error rate and 0.273 word error rate performed using DenseNet121 model with GRU recurrent layer.
引用
收藏
页码:1067 / +
页数:7
相关论文
共 50 条
  • [21] Improvement of End-to-End Offline Handwritten Mathematical Expression Recognition by Weakly Supervised Learning
    Thanh-Nghia Truong
    Cuong Tuan Nguyen
    Khanh Minh Phan
    Nakagawa, Masaki
    [J]. 2020 17TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2020), 2020, : 181 - 186
  • [22] End-to-end Handwritten Chinese Paragraph Text Recognition Using Residual Attention Networks
    Wang, Yintong
    Yang, Yingjie
    Chen, Haiyan
    Zheng, Hao
    Chang, Heyou
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2022, 34 (01): : 371 - 388
  • [23] End-to-end handwritten Ge’ez multiple numerals recognition using deep learning
    Malhotra R.
    Addis M.T.
    [J]. SICE Journal of Control, Measurement, and System Integration, 2024, 17 (01) : 122 - 134
  • [24] An End-to-End Marking Recognition System for PCB Optical Inspection
    Ghosh, Shajib
    Bhavsar, Janshi Sunilkumar
    Elsayed, Nelly
    Asadizanjani, Navid
    [J]. 2022 IEEE PHYSICAL ASSURANCE AND INSPECTION OF ELECTRONICS (PAINE), 2022, : 43 - 50
  • [25] End-to-end optical music recognition for pianoform sheet music
    Rios-Vila, Antonio
    Rizo, David
    Inesta, Jose M.
    Calvo-Zaragoza, Jorge
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2023, 26 (03) : 347 - 362
  • [26] Emphasizing unseen words: New vocabulary acquisition for end-to-end speech recognition
    Qu, Leyuan
    Weber, Cornelius
    Wermter, Stefan
    [J]. NEURAL NETWORKS, 2023, 161 : 494 - 504
  • [27] End-to-End Neural Optical Music Recognition of Monophonic Scores
    Calvo-Zaragoza, Jorge
    Rizo, David
    [J]. APPLIED SCIENCES-BASEL, 2018, 8 (04):
  • [28] Approaching End-to-End Optical Music Recognition for Homophonic Scores
    Alfaro-Contreras, Maria
    Calvo-Zaragoza, Jorge
    Inesta, Jose M.
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS, IBPRIA 2019, PT II, 2019, 11868 : 147 - 158
  • [29] End-to-end optical music recognition for pianoform sheet music
    Antonio Ríos-Vila
    David Rizo
    José M. Iñesta
    Jorge Calvo-Zaragoza
    [J]. International Journal on Document Analysis and Recognition (IJDAR), 2023, 26 : 347 - 362
  • [30] CHARACTER-AWARE ATTENTION-BASED END-TO-END SPEECH RECOGNITION
    Meng, Zhong
    Gaur, Yashesh
    Li, Jinyu
    Gong, Yifan
    [J]. 2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 949 - 955