Reading Numbers in Natural Scene Images with Convolutional Neural Networks

被引:0
|
作者
Guo, Qiang [1 ]
Lei, Jun [1 ]
Tu, Dan [1 ]
Li, Guohui [1 ]
机构
[1] Natl Univ Def Technol, Dept Informat Syst & Management, Changsha, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reading text from natural images is a hard computer vision task. We present a method for applying deep convolutional neural networks to recognize numbers in natural scene images. In this paper, we proposed a noval method to eliminating the need of explicit segmentation when deal with multi-digit number recognition in natural scene images. Convolution Neural Network(CNN) requires fixed dimensional input while number images contain unknown amount of digits. Our method integrats CNN with probabilistic graphical model to deal with the problem. We use hidden Markov model(HMM) to model the image and use CNN to model digits appearance. This method combines the advantages of both the two models and make them fit to the problem. By using this method we can perform the training and recognition procedure both at word level. There is no explicit segmentation operation at all which save lots of labour for sophisticated segmentation algorithm design or finegrained character labeling. Experiments show that deep CNN can dramaticly improve the performance compared with using Gaussian Mixture model as the digit model. We obtaied competitive results on the street view house number(SVHN) dataset.
引用
收藏
页码:48 / 53
页数:6
相关论文
共 50 条
  • [41] Semisupervised Scene Classification for Remote Sensing Images: A Method Based on Convolutional Neural Networks and Ensemble Learning
    Dai, Xueyuan
    Wu, Xiaofeng
    Wang, Bin
    Zhang, Liming
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2019, 16 (06) : 869 - 873
  • [42] SAR Target Recognition in Large Scene Images via Region-Based Convolutional Neural Networks
    Cui, Zongyong
    Dang, Sihang
    Cao, Zongjie
    Wang, Sifei
    Liu, Nengyuan
    REMOTE SENSING, 2018, 10 (05)
  • [43] Sparse representation with spike convolutional neural networks for scene classification of remote sensing images of high resolution
    Zhang Z.-Y.
    Cao W.-H.
    Zhu R.
    Hu W.-K.
    Wu M.
    Kongzhi yu Juece/Control and Decision, 2022, 37 (09): : 2305 - 2313
  • [44] AN APPLICATION OF NEURAL NETWORKS TO NATURAL SCENE SEGMENTATION
    VICENS, M
    ALBERT, J
    ARNAU, V
    LECTURE NOTES IN COMPUTER SCIENCE, 1991, 540 : 333 - 339
  • [45] Grayscale images colorization with convolutional neural networks
    Jiancheng An
    Koffi Gagnon Kpeyiton
    Qingnan Shi
    Soft Computing, 2020, 24 : 4751 - 4758
  • [46] Grayscale images colorization with convolutional neural networks
    An, Jiancheng
    Kpeyiton, Koffi Gagnon
    Shi, Qingnan
    SOFT COMPUTING, 2020, 24 (07) : 4751 - 4758
  • [47] Distinguishing Between Natural and Computer-Generated Images Using Convolutional Neural Networks
    Quan, Weize
    Wang, Kai
    Yan, Dong-Ming
    Zhang, Xiaopeng
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2018, 13 (11) : 2772 - 2787
  • [48] Siamese Convolutional Neural Networks for Remote Sensing Scene Classification
    Liu, Xuning
    Zhou, Yong
    Zhao, Jiaqi
    Yao, Rui
    Liu, Bing
    Zheng, Yi
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2019, 16 (08) : 1200 - 1204
  • [49] Outdoor Scene Labeling Using Deep Convolutional Neural Networks
    Wen Jun
    Zhong Chaolliang
    Liu Shirong
    Wang Jian
    2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 3953 - 3958
  • [50] Dance Art Scene Classification Based on Convolutional Neural Networks
    Li, Le
    SCIENTIFIC PROGRAMMING, 2022, 2022