Towards Building A Robust Large-Scale Bangla Text Recognition Solution Using A Unique Multiple-Domain Character-Based Document Recognition Approach

被引:0
|
作者
Rabby, A. K. M. Shahariar Azad [2 ,3 ]
Islam, Md Majedul [2 ]
Islam, Zahidul [2 ]
Hasan, Nazmul [2 ]
Rahman, Fuad [1 ]
机构
[1] Apurba Technol, Sunnyvale, CA 94085 USA
[2] Apurba Technol, Dhaka, Bangladesh
[3] Univ Alabama Birmingham, Birmingham, AL 35294 USA
来源
20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021) | 2021年
关键词
OCR; Document Processing; Handwriting; Segmentation; Recognition;
D O I
10.1109/ICMLA52953.2021.00225
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bangla is one of the world's top ten popular languages in terms of the number of speakers. It also happens to have a complex script primarily because of complex characters, e.g. graphemes, composed of multiple single characters, and the characteristic short-hands, e.g. vowel diacritics, and consonant diacritics making the number of classes of this script recognition quite large, varied, and challenging. In this paper, we present a unique large-scale Bangla document OCR solution based on character-level recognition modules. We have tested our approach on two independent domains: printed and handwritten documents. We also applied our solution to three subdomains within the printed domain: computer-composed documents, letterpress documents, and typewritten documents. Our extensive experiments show that our approach achieves state-of-the-art performance on handwritten and printed documents.
引用
收藏
页码:1393 / 1399
页数:7
相关论文
共 6 条
  • [1] TOWARDS LARGE-SCALE BUILDING ATTRIBUTE MAPPING USING CROWDSOURCED IMAGES: SCENE TEXT RECOGNITION ON FLICKR AND PROBLEMS TO BE SOLVED
    Sun, Y.
    Kruspe, A.
    Meng, L.
    Tian, Y.
    Hoffmann, E. J.
    Auer, S.
    Zhu, X. X.
    GEOSPATIAL WEEK 2023, VOL. 48-1, 2023, : 225 - 232
  • [2] Context-Dependent Robust Text Recognition using Large-scale Restricted Bayesian Network
    Nakada, Hidemoto
    Ichisugi, Yuuji
    8TH ANNUAL INTERNATIONAL CONFERENCE ON BIOLOGICALLY INSPIRED COGNITIVE ARCHITECTURES, BICA 2017 (EIGHTH ANNUAL MEETING OF THE BICA SOCIETY), 2018, 123 : 314 - 320
  • [3] Towards a Robust Visual Place Recognition in Large-Scale vSLAM Scenarios Based on a Deep Distance Learning
    Chen, Liang
    Jin, Sheng
    Xia, Zhoujun
    SENSORS, 2021, 21 (01) : 1 - 19
  • [4] GPGPU-based High Throughput Image Pre-processing Towards Large-Scale Optical Character Recognition
    Gener, Serhan
    Dattilo, Parker
    Gajaria, Dhruv
    Fusco, Alexander
    Akoglu, Ali
    2022 IEEE/ACS 19TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2022,
  • [5] An adaptive hidden Markov model-based gesture recognition approach using Kinect to simplify large-scale video data processing for humanoid robot imitation
    Ding, Ing-Jr
    Chang, Che-Wei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (23) : 15537 - 15551
  • [6] An adaptive hidden Markov model-based gesture recognition approach using Kinect to simplify large-scale video data processing for humanoid robot imitation
    Ing-Jr Ding
    Che-Wei Chang
    Multimedia Tools and Applications, 2016, 75 : 15537 - 15551