A Convolutional Recurrent Neural-Network-Based Machine Learning for Scene Text Recognition Application

被引:6
|
作者
Liu, Yiyi [1 ]
Wang, Yuxin [1 ]
Shi, Hongjian [1 ]
机构
[1] Beijing Normal Univ Hong Kong Baptist Univ United, Guangdong Prov Key Lab Interdisciplinary Res & App, Zhuhai 519087, Peoples R China
来源
SYMMETRY-BASEL | 2023年 / 15卷 / 04期
关键词
CRNN; DBNet; OCR; Retinex;
D O I
10.3390/sym15040849
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Optical character recognition (OCR) is the process of acquiring text and layout information through analysis and recognition of text data image files. It is also a process to identify the geometric location and orientation of the texts and their symmetrical behavior. It usually consists of two steps: text detection and text recognition. Scene text recognition is a subfield of OCR that focuses on processing text in natural scenes, such as streets, billboards, license plates, etc. Unlike traditional document category photographs, it is a challenging task to use computer technology to locate and read text information in natural scenes. Imaging sequence recognition is a longstanding subject of research in the field of computer vision. Great progress has been made in this field; however, most models struggled to recognize text in images of complex scenes with high accuracy. This paper proposes a new pattern of text recognition based on the convolutional recurrent neural network (CRNN) as a solution to address this issue. It combines real-time scene text detection with differentiable binarization (DBNet) for text detection and segmentation, text direction classifier, and the Retinex algorithm for image enhancement. To evaluate the effectiveness of the proposed method, we performed experimental analysis of the proposed algorithm, and carried out simulation on complex scene image data based on existing literature data and also on several real datasets designed for a variety of nonstationary environments. Experimental results demonstrated that our proposed model performed better than the baseline methods on three benchmark datasets and achieved on-par performance with other approaches on existing datasets. This model can solve the problem that CRNN cannot identify text in complex and multi-oriented text scenes. Furthermore, it outperforms the original CRNN model with higher accuracy across a wider variety of application scenarios.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] An Improved Convolutional Neural Network-Based Scene Image Recognition Method
    Wang, Pinhe
    Qiao, Jianzhong
    Liu, Nannan
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [32] An Improved Convolutional Neural Network-Based Scene Image Recognition Method
    Wang, Pinhe
    Qiao, Jianzhong
    Liu, Nannan
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [33] Sequence Recognition of Natural Scene House Number Based on Convolutional Neural Network
    Zhong, Juping
    Gao, Jing
    Fang, Guoxin
    Zhao, Huimin
    Li, Jun
    ELEVENTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2019), 2019, 11179
  • [34] Convolutional Neural Network Based on Extreme Learning Machine for Maritime Ships Recognition in Infrared Images
    Khellal, Atmane
    Ma, Hongbin
    Fei, Qing
    SENSORS, 2018, 18 (05)
  • [35] Chip surface character recognition based on convolutional recurrent neural network
    Xiong F.
    Chen T.
    Bian B.-C.
    Liu J.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2023, 57 (05): : 948 - 956
  • [36] A Neural-Network-based Sketch Recognition System
    Su, Mu-Chun
    Hsio, Ting-Huan
    Hsieh, Yi-Zeng
    Lin, Shih-Chieh
    Chou, Chien-Hsing
    IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS SYSTEMS (ISPACS 2012), 2012,
  • [37] Facial Expressions Recognition through Convolutional Neural Network and Extreme Learning Machine
    Jammoussi, Imen
    Ben Nasr, Mounir
    Chtourou, Mohamed
    PROCEEDINGS OF THE 2020 17TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD 2020), 2020, : 162 - 166
  • [38] Scene Recognition from Image Using Convolutional Neural Network
    Masood, Sarfaraz
    Ahsan, Umer
    Munawwar, Fatima
    Rizvi, Danish Raza
    Ahmed, Mumtaz
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND DATA SCIENCE, 2020, 167 : 1005 - 1012
  • [39] Face Recognition Based on Convolutional Neural Network and Support Vector Machine
    Guo, Shanshan
    Chen, Shiyu
    Li, Yanjie
    2016 IEEE INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION (ICIA), 2016, : 1787 - 1792
  • [40] Fully Convolutional Recurrent Network for Handwritten Chinese Text Recognition
    Xie, Zecheng
    Sun, Zenghui
    Jin, Lianwen
    Feng, Ziyong
    Zhang, Shuye
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 4011 - 4016