Chinese Character CAPTCHA Recognition and performance estimation via deep neural network

被引:29
|
作者
Lin, Dazhen [1 ,2 ,4 ]
Lin, Fan [3 ]
Lv, Yanping [1 ,2 ]
Cai, Feipeng [1 ,2 ]
Cao, Donglin [1 ,2 ,4 ]
机构
[1] Xiamen Univ, Cognit Sci Dept, Xiamen, Peoples R China
[2] Fujian Key Lab Machine Intelligence & Robot, Xiamen, Peoples R China
[3] Xiamen Univ, Software Sch, Xiamen, Peoples R China
[4] Minjiang Univ, Fujian Prov Key Lab Informat Proc & Intelligent C, Fuzhou, Fujian, Peoples R China
关键词
Chinese Character CAPTCHA Recognition; Web application; Performance estimation; Exponential relationship;
D O I
10.1016/j.neucom.2017.02.105
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To identify machine and human, Completely Automated Public Turing test to tell Computers and Humans Apart (CAPTCHA) is increasingly used in many web applications. The classical English and digital characters based CAPTCHAs are recognized with high accuracy. Due to the complication of Chinese characters which greatly enhance the difficulty of automatic recognition, an increasing number of Chinese web sites use Chinese Character CAPTCHAs. To recognize Chinese Character CAPTCHAs, we propose a Convolution Neural Network (CNN) based approach to learn strokes, radicals and character features of Chinese characters, and prove that our network structure is superior to LENET-5 in this task. Furthermore, we formulate the relation among accuracy, the number of training samples and iterations, which is used to estimate the performance of our approach. Firstly, this approach greatly improves the recognition accuracy of Chinese Character CAPTCHAs with distortion, rotation and background noise. Our experiments results show that this approach achieves over 95% accuracy for single Chinese character and 84% accuracy for three types of Chinese Character CAPTCHAs with four Chinese characters. Secondly, our experiment results and theoretical analysis show that the accuracy of recognition has the exponential relationship with the product of the number of training samples and iterations in the condition of enough and representative training samples. Therefore, we can estimate the training time for a certain accuracy. Finally, we certify that our approach is superior to the most famous Chinese Optical Character Recognition (OCR) software, Hanvon, in Chinese Character CAPTCHAs recognition. (C) 2018 Elsevier B.V. All rights reserved.
引用
收藏
页码:11 / 19
页数:9
相关论文
共 50 条
  • [1] Chinese Character CAPTCHA Recognition Based on Convolution Neural Network
    Lv, Yanping
    Cai, Feipeng
    Lin, Dazhen
    Cao, Donglin
    2016 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2016, : 4854 - 4859
  • [2] CAPTCHA recognition based on deep convolutional neural network
    Wang, Jing
    Qin, Jiaohua
    Xiang, Xuyu
    Tan, Yun
    Pan, Nan
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2019, 16 (05) : 5851 - 5861
  • [3] Deep Neural Networks for Handwritten Chinese Character Recognition
    Maidana, Renan G.
    Monteiro, Juarez
    Granada, Roger
    Amory, Alexandre M.
    Barros, Rodrigo C.
    2017 6TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2017, : 192 - 197
  • [4] Deep Matching Network for Handwritten Chinese Character Recognition
    Li, Zhiyuan
    Wu, Qi
    Xiao, Yi
    Jin, Min
    Lu, Huaxiang
    Pattern Recognition, 2020, 107
  • [5] Deep Matching Network for Handwritten Chinese Character Recognition
    Li, Zhiyuan
    Wu, Qi
    Xiao, Yi
    Jin, Min
    Lu, Huaxiang
    PATTERN RECOGNITION, 2020, 107
  • [6] Printed Chinese optical character recognition by neural network
    Zhao, MS
    Wu, YS
    ICONIP'98: THE FIFTH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING JOINTLY WITH JNNS'98: THE 1998 ANNUAL CONFERENCE OF THE JAPANESE NEURAL NETWORK SOCIETY - PROCEEDINGS, VOLS 1-3, 1998, : 1090 - 1093
  • [7] Improved Optical Character Recognition with Deep Neural Network
    Wei, Tan Chiang
    Sheikh, U. U.
    Ab Rahman, Ab Al-Hadi
    2018 IEEE 14TH INTERNATIONAL COLLOQUIUM ON SIGNAL PROCESSING & ITS APPLICATIONS (CSPA 2018), 2018, : 245 - 249
  • [8] Character Recognition via a Compact Convolutional Neural Network
    Zhao, Haifeng
    Hu, Yong
    Zhang, Jinxia
    2017 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING - TECHNIQUES AND APPLICATIONS (DICTA), 2017, : 793 - 798
  • [9] A neural network based classifier for handwritten Chinese character recognition
    Wu, MR
    Zhang, B
    Zhang, L
    15TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS: PATTERN RECOGNITION AND NEURAL NETWORKS, 2000, : 561 - 564
  • [10] Handwritten Chinese Character Recognition Based on Residual Neural Network
    Li, Min
    PROCEEDINGS OF 2023 7TH INTERNATIONAL CONFERENCE ON ELECTRONIC INFORMATION TECHNOLOGY AND COMPUTER ENGINEERING, EITCE 2023, 2023, : 1715 - 1719