Text localization and recognition of Chinese characters in natural scenes based on improved faster R-CNN

被引:0
|
作者
Li, Yuejie [1 ,2 ]
Liu, Chang'an [3 ]
Li, Shijun [4 ]
机构
[1] Ordos Inst Technol, Ordos, Inner Mongolia, Peoples R China
[2] North China Elect Power Univ, Beijing, Peoples R China
[3] North China Univ Technol, Beijing, Peoples R China
[4] Hunan Inst Engn, Xiangtan, Peoples R China
关键词
Text recognition; deep learning; algorithm optimization; Faster R-CNN;
D O I
10.3233/JIFS-233700
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Text detection and recognition are widely used in daily life. Although it is a very rich market, it has very difficult challenges in practical application. The complex and changeable natural scenes lead to the complex background of the text in the image, which also reflects the research value of text information recognition and extraction in natural scenes. To solve the many problems faced by the recognition of Chinese characters, such as complex shapes and diverse structures, this paper uses VGG16 to extract features and introduces a two-layer bidirectional LSTM network. It improves Faster R-CNN by using a RPN to extract candidate boxes and adjust the position of candidate regions. In this paper, the improved model Faster BLSTM-CNN is tested, and the effectiveness experiment of feature extraction, the difference comparison before and after the improvement of the algorithm, and the comparison experiment with the traditional recognition algorithm are carried out respectively. And it finally carried out an experimental comparison of the combination of text recognition and positioning, and obtained the results. The algorithm Faster BLSTM-CNN in this paper is better in the localization and recognition of Chinese characters in the dataset. In the natural scene, the recognition rate of Faster BLSTM-CNN in this paper is 81.54%, the positioning accuracy is 88.14%, and the detection speed is 86 ms, which has improved performance. Therefore, the improvement of Faster R-CNN is effective. It can effectively locate and recognize Chinese characters in natural scenes.
引用
收藏
页码:8623 / 8636
页数:14
相关论文
共 50 条
  • [1] Method for Detecting Chinese Texts in Natural Scenes Based on Improved Faster R-CNN
    Liu, Shuhua
    Ban, Hua
    Song, Yu
    Zhang, Mengyu
    Yang, Fengqin
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2020, 34 (02)
  • [2] Handwriting Text Recognition Based on Faster R-CNN
    Yang, Junqing
    Ren, Peng
    Kong, Xiaoxiao
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 2450 - 2454
  • [3] Improved Localization Accuracy by LocNet for Faster R-CNN Based Text Detection
    Zhong, Zhuoyao
    Sun, Lei
    Huo, Qiang
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 923 - 928
  • [4] Improved localization accuracy by LocNet for Faster R-CNN based text detection in natural scene images
    Zhong, Zhuoyao
    Sun, Lei
    Huo, Qiang
    PATTERN RECOGNITION, 2019, 96
  • [5] Strawberry R-CNN: Recognition and counting model of strawberry based on improved faster R-CNN
    Li, Jiajun
    Zhu, Zifeng
    Liu, Hongxin
    Su, Yurong
    Deng, Limiao
    ECOLOGICAL INFORMATICS, 2023, 77
  • [6] Railway Catenary Insulator Recognition Based on Improved Faster R-CNN
    Ju, Jiang Xiang
    Liang, Du Xiao
    AUTOMATIC CONTROL AND COMPUTER SCIENCES, 2022, 56 (06) : 553 - 563
  • [7] A Mountain Summit Recognition Method Based on Improved Faster R-CNN
    Kong, Yueping
    Wang, Yun
    Guo, Song
    Wang, Jiajing
    COMPLEXITY, 2021, 2021
  • [8] Recognition Method for Potato Buds Based on Improved Faster R-CNN
    Xi R.
    Jiang K.
    Zhang W.
    Lü Z.
    Hou J.
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2020, 51 (04): : 216 - 223
  • [9] Railway Catenary Insulator Recognition Based on Improved Faster R-CNN
    Automatic Control and Computer Sciences, 2022, 56 : 553 - 563
  • [10] CAPTCHA Recognition Based on Faster R-CNN
    Du, Feng-Lin
    Li, Jia-Xing
    Yang, Zhi
    Chen, Peng
    Wang, Bing
    Zhang, Jun
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2017, PT II, 2017, 10362 : 597 - 605