Deep Convolution Neural Network Method for Skew Angle Detection in Text Images

被引:0
|
作者
Guo Congzhou [1 ]
Li Ke [1 ]
Zhu Yikun [1 ]
Tong Xiaochong [2 ]
Wang Xiwen [1 ]
机构
[1] Informat Engn Univ, Dept Basic, Zhengzhou 450001, Henan, Peoples R China
[2] Informat Engn Univ, Sch Surveying & Mapping, Zhengzhou 450001, Henan, Peoples R China
关键词
image processing; text image; skew angle; convolution neural network; skew correction; character recognition;
D O I
10.3788/L0P202158.1410007
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Text image skew correction is an important preprocessing step in the front-end of character recognition. To overcome the disadvantage in the limited range of tilt angle detection of the existing methods which is only 90-90, this study transforms the text image skew angle detection problem into a skew angle class detection problem. Several types of skew angle classes of text images are detected using the classification function of deep convolution neural network by selecting the appropriate loss function and designing the detection structures of onestage two classification and multi-stage multi-classification, and then getting the tilt angle range of the text image. The experimental results show that the tilt angle class's detection accuracy, recall, and precision rates are all above 0.93. The classical deep learning method is used to recognize the text image after skew correction. Moreover, the recognition accuracy is greatly improved compared to that before the correction.
引用
收藏
页数:8
相关论文
共 14 条
  • [1] Airplane Detection of Optical Remote Sensing Images Based on Deep Learning
    Dong Yongfeng
    Zhang Changtao
    Wang Peng
    Feng Zhe
    [J]. LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (04)
  • [2] Duan L, 2017, J SOFTWARE, V28
  • [3] [荆雷 Jing Lei], 2010, [激光与红外, Laser and Infrared], V40, P1116
  • [4] Lin M, 2020, NETWORK NETWORK EB O
  • [5] MORAN: A Multi-Object Rectified Attention Network for scene text recognition
    Luo, Canjie
    Jin, Lianwen
    Sun, Zenghui
    [J]. PATTERN RECOGNITION, 2019, 90 : 109 - 118
  • [6] Mask TextSpotter: An End-to-End Trainable Neural Network for Spotting Text with Arbitrary Shapes
    Lyu, Pengyuan
    Liao, Minghui
    Yao, Cong
    Wu, Wenhao
    Bai, Xiang
    [J]. COMPUTER VISION - ECCV 2018, PT XIV, 2018, 11218 : 71 - 88
  • [7] Minghui Liao, 2020, Computer Vision - ECCV 2020 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12356), P706, DOI 10.1007/978-3-030-58621-8_41
  • [8] ASTER: An Attentional Scene Text Recognizer with Flexible Rectification
    Shi, Baoguang
    Yang, Mingkun
    Wang, Xinggang
    Lyu, Pengyuan
    Yao, Cong
    Bai, Xiang
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (09) : 2035 - 2048
  • [9] An End-to-End Trainable Neural Network for Image-Based Sequence Recognition and Its Application to Scene Text Recognition
    Shi, Baoguang
    Bai, Xiang
    Yao, Cong
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (11) : 2298 - 2304
  • [10] [吴一全 WU Yiquan], 2009, [光学技术, Optical Technology], V35, P152