End-to-End Optical Character Recognition for Bengali Handwritten Words

被引:0
|
作者
Safir, Farisa Benta [1 ]
Ohi, Abu Quwsar [1 ]
Mridha, M. F. [1 ]
Monowar, Muhammad Mostafa [2 ]
Hamid, Md Abdul [2 ]
机构
[1] Bangladesh Univ Business & Technol, Dept Comp Sci & Engn, Dhaka, Bangladesh
[2] King Abdulaziz Univ, Fac Comp & Informat Technol, Dept Informat Technol, Jeddah 21589, Saudi Arabia
关键词
OCR; Bengali handwriting; Baseline; LSTM; CTC loss; BANGLA; OCR;
D O I
10.1109/NCCC49330.2021.9428809
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Optical character recognition (OCR) is a process of converting analogue documents into digital using document images. Currently, many commercial and non-commercial OCR systems exist for both handwritten and printed copies for different languages. Despite this, very few works are available in case of recognising Bengali words. Among them, most of the works focused on OCR of printed Bengali characters. This paper introduces an end-to-end OCR system for Bengali language. The proposed architecture implements an end to end strategy that recognises handwritten Bengali words from handwritten word images. We experiment with popular convolutional neural network (CNN) architectures, including DenseNet, Xception, NAS-Net, and MobileNet to build the OCR architecture. Further, we experiment with two different recurrent neural networks (RNN) methods, LSTM and GRU. We evaluate the proposed architecture using BanglaWritting dataset, which is a peer-reviewed Bengali handwritten image dataset. The proposed method achieves 0.091 character error rate and 0.273 word error rate performed using DenseNet121 model with GRU recurrent layer.
引用
收藏
页码:1067 / +
页数:7
相关论文
共 50 条
  • [1] An End-to-End Classifier Based on CNN for In-Air Handwritten-Chinese-Character Recognition
    Hu, Mianjun
    Qu, Xiwen
    Huang, Jun
    Wu, Xuangou
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (14):
  • [2] End-to-End page-Level assessment of handwritten text recognition
    Vidal, Enrique
    Toselli, Alejandro H.
    Rios-Vila, Antonio
    Calvo-Zaragoza, Jorge
    [J]. PATTERN RECOGNITION, 2023, 142
  • [3] An End-to-End Approach for Recognition of Modern and Historical Handwritten Numeral Strings
    Hochuli, Andre G.
    Britto, Alceu S., Jr.
    Barddal, Jean P.
    Oliveira, Luiz E. S.
    Sabourin, Robert
    [J]. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [4] A comprehensive comparison of end-to-end approaches for handwritten digit string recognition
    Hochuli, Andre G.
    Britto Jr, Alceu S.
    Saji, David A.
    Saavedra, Jose M.
    Sabourin, Robert
    Oliveira, Luiz S.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2021, 165 (165)
  • [5] Joint Line Segmentation and Transcription for End-to-End Handwritten Paragraph Recognition
    Bluche, Theodore
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016), 2016, 29
  • [6] End-to-End Scene Text Recognition with Character Centroid Prediction
    Zhao, Wei
    Ma, Jinwen
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2017), PT III, 2017, 10636 : 291 - 299
  • [7] End-to-end Tangut character database building and recognition method
    Ma, Jinlin
    Cao, Yunrui
    Ma, Ziping
    Wei, Lin
    Hao, Chaohua
    [J]. IET IMAGE PROCESSING, 2022, 16 (08) : 2087 - 2100
  • [8] AN END-TO-END DEEP NEURAL ARCHITECTURE FOR OPTICAL CHARACTER VERIFICATION AND RECOGNITION IN RETAIL FOOD PACKAGING
    Ribeiro, Fabio De Sousa
    Gong, Liyun
    Caliva, Francesco
    Swainson, Mark
    Gudmundsson, Kjartan
    Yu, Miao
    Leontidis, Georgios
    Ye, Xujiong
    Kollias, Stefanos
    [J]. 2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2376 - 2380
  • [9] Training an End-to-End System for Handwritten Mathematical Expression Recognition by Generated Patterns
    Anh Duc Le
    Nakagawa, Masaki
    [J]. 2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 1056 - 1061
  • [10] On usage of an end-to-end deep neural architecture for handwritten digit string recognition
    Zahra Omidi
    Bagher BabaAli
    [J]. Signal, Image and Video Processing, 2024, 18 : 3009 - 3020