Towards building a Bangla text recognition solution with a Multi-Headed CNN architecture

被引:2
|
作者
Islam, Md Majedul [2 ]
Das, Avishek [2 ]
Kowsar, Ibna [2 ]
Rabby, A. K. M. Shahariar Azad [2 ,3 ]
Hasan, Nazmul [2 ]
Rahman, Fuad [1 ]
机构
[1] Apurba Technol, Sunnyvale, CA USA
[2] Apurba Technol, Dhaka, Bangladesh
[3] Univ Alabama Birmingham, Birmingham, AL USA
关键词
Bangla OCR; Character Recognition; Handwriting; Segmentation; Bangla Character;
D O I
10.1109/BigData52589.2021.9671653
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bangla is among the ten most popular languages in the world by the number of speakers. The task of Bangla recognition is quite challenging than other languages because of the existence of graphemes of multiple single characters, and diacritics of vowels and consonants. The purpose of this study is to develop an innovative large-scale Bangla OCR solution based on character-level recognition. Two types of documents were used to test our method: handwritten and printed. In addition, our method was applied to the handwritten documents as well as three subdomains of the printed domain: computer-composed, letterpress, and typewritten documents using our proposed attention-based multi-headed CNN architecture. Extensive testing shows that our method provides state-of-the-art performance on both handwritten and printed texts.
引用
收藏
页码:1061 / 1067
页数:7
相关论文
共 33 条
  • [1] Multi-headed Architecture Based on BERT for Grammatical Errors Correction
    Shaptala, Julia
    Didenko, Bohdan
    [J]. INNOVATIVE USE OF NLP FOR BUILDING EDUCATIONAL APPLICATIONS, 2019, : 246 - 251
  • [2] Multi-oriented Bangla and Devnagari text recognition
    Pal, Umapada
    Roy, Partha Pratim
    Tripathy, Nilamadhaba
    Llados, Josep
    [J]. PATTERN RECOGNITION, 2010, 43 (12) : 4124 - 4136
  • [3] Multi-headed ensemble residual CNN: A powerful tool for fibroblast growth factor prediction
    Almusallam, Naif
    Ali, Farman
    Kumar, Harish
    Alkhalifah, Tamim
    Alturise, Fahad
    Almuhaimeed, Abdullah
    [J]. Results in Engineering, 2024, 24
  • [4] The multi-headed dragon. Towards a new model for Chinese foreign policy?
    Troan, Magnus Langset
    [J]. INTERNASJONAL POLITIKK, 2014, 72 (01) : 31 - +
  • [5] Towards Building A Robust Large-Scale Bangla Text Recognition Solution Using A Unique Multiple-Domain Character-Based Document Recognition Approach
    Rabby, A. K. M. Shahariar Azad
    Islam, Md Majedul
    Islam, Zahidul
    Hasan, Nazmul
    Rahman, Fuad
    [J]. 20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021), 2021, : 1393 - 1399
  • [6] Scene Text Detection Based on Multi-Headed Self-Attention Using Shifted Windows
    Huang, Baohua
    Feng, Xiaoru
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (06):
  • [7] The Heads Hypothesis: A Unifying Statistical Approach Towards Understanding Multi-Headed Attention in BERT
    Pande, Madhura
    Budhraja, Aakriti
    Nema, Preksha
    Kumar, Pratyush
    Khapra, Mitesh M.
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 13613 - 13621
  • [8] Joint architecture and knowledge distillation in CNN for Chinese text recognition
    Wang, Zi-Rui
    Du, Jun
    [J]. PATTERN RECOGNITION, 2021, 111
  • [9] Multi-headed CNN for colon cancer classification using histopathological images with tikhonov-based unsharp masking
    Kumar, Anurodh
    Vishwakarma, Amit
    Bajaj, Varun
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (28) : 71753 - 71772
  • [10] Rational controlled morphological transitions in the self-assembled multi-headed giant surfactants in solution
    Chu, Yang
    Zhang, Wei
    Lu, Xinlin
    Mu, Gaoyan
    Zhang, Baofang
    Li, Yiwen
    Cheng, Stephen Z. D.
    Liu, Tianbo
    [J]. CHEMICAL COMMUNICATIONS, 2016, 52 (56) : 8687 - 8690