A Convolutional Recurrent Neural-Network-Based Machine Learning for Scene Text Recognition Application

被引:6
|
作者
Liu, Yiyi [1 ]
Wang, Yuxin [1 ]
Shi, Hongjian [1 ]
机构
[1] Beijing Normal Univ Hong Kong Baptist Univ United, Guangdong Prov Key Lab Interdisciplinary Res & App, Zhuhai 519087, Peoples R China
来源
SYMMETRY-BASEL | 2023年 / 15卷 / 04期
关键词
CRNN; DBNet; OCR; Retinex;
D O I
10.3390/sym15040849
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Optical character recognition (OCR) is the process of acquiring text and layout information through analysis and recognition of text data image files. It is also a process to identify the geometric location and orientation of the texts and their symmetrical behavior. It usually consists of two steps: text detection and text recognition. Scene text recognition is a subfield of OCR that focuses on processing text in natural scenes, such as streets, billboards, license plates, etc. Unlike traditional document category photographs, it is a challenging task to use computer technology to locate and read text information in natural scenes. Imaging sequence recognition is a longstanding subject of research in the field of computer vision. Great progress has been made in this field; however, most models struggled to recognize text in images of complex scenes with high accuracy. This paper proposes a new pattern of text recognition based on the convolutional recurrent neural network (CRNN) as a solution to address this issue. It combines real-time scene text detection with differentiable binarization (DBNet) for text detection and segmentation, text direction classifier, and the Retinex algorithm for image enhancement. To evaluate the effectiveness of the proposed method, we performed experimental analysis of the proposed algorithm, and carried out simulation on complex scene image data based on existing literature data and also on several real datasets designed for a variety of nonstationary environments. Experimental results demonstrated that our proposed model performed better than the baseline methods on three benchmark datasets and achieved on-par performance with other approaches on existing datasets. This model can solve the problem that CRNN cannot identify text in complex and multi-oriented text scenes. Furthermore, it outperforms the original CRNN model with higher accuracy across a wider variety of application scenarios.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Scene text recognition using residual convolutional recurrent neural network
    Lei, Zhengchao
    Zhao, Sanyuan
    Song, Hongmei
    Shen, Jianbing
    MACHINE VISION AND APPLICATIONS, 2018, 29 (05) : 861 - 871
  • [2] Scene text recognition using residual convolutional recurrent neural network
    Zhengchao Lei
    Sanyuan Zhao
    Hongmei Song
    Jianbing Shen
    Machine Vision and Applications, 2018, 29 : 861 - 871
  • [3] Accurate Scene Text Recognition Based on Recurrent Neural Network
    Su, Bolan
    Lu, Shijian
    COMPUTER VISION - ACCV 2014, PT I, 2015, 9003 : 35 - 48
  • [4] An Attention-Based Convolutional Recurrent Neural Networks for Scene Text Recognition
    Alshawi, Adil Abdullah Abdulhussein
    Tanha, Jafar
    Balafar, Mohammad Ali
    IEEE ACCESS, 2024, 12 : 8123 - 8134
  • [5] Cursive Text Recognition in Natural Scene Images Using Deep Convolutional Recurrent Neural Network
    Chandio, Asghar Ali
    Asikuzzaman, MD.
    Pickering, Mark R.
    Leghari, Mehwish
    IEEE ACCESS, 2022, 10 : 10062 - 10078
  • [6] Text Baseline Recognition Using a Recurrent Convolutional Neural Network
    Woedlinger, Matthias
    Sablatnig, Robert
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 4673 - 4679
  • [7] Scene Classification with Simple Machine Learning and Convolutional Neural Network
    Yosboon, Simon
    2022 INTERNATIONAL CONFERENCE ON DECISION AID SCIENCES AND APPLICATIONS (DASA), 2022, : 616 - 619
  • [8] Convolutional recurrent neural networks with hidden Markov model bootstrap for scene text recognition
    Wang, Fenglei
    Guo, Qiang
    Lei, Jun
    Zhang, Jun
    IET COMPUTER VISION, 2017, 11 (06) : 497 - 504
  • [9] RECURRENT GLOBAL CONVOLUTIONAL NETWORK FOR SCENE TEXT DETECTION
    Mohanty, Sabyasachi
    Dutta, Tanima
    Gupta, Hari Prabhat
    2018 25TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2018, : 2750 - 2754
  • [10] Attention-Based Deep Neural Network and Its Application to Scene Text Recognition
    He, Haizhen
    Li, Jiehan
    2019 IEEE 11TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN 2019), 2019, : 672 - 677