Reading Numbers in Natural Scene Images with Convolutional Neural Networks

被引:0
|
作者
Guo, Qiang [1 ]
Lei, Jun [1 ]
Tu, Dan [1 ]
Li, Guohui [1 ]
机构
[1] Natl Univ Def Technol, Dept Informat Syst & Management, Changsha, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reading text from natural images is a hard computer vision task. We present a method for applying deep convolutional neural networks to recognize numbers in natural scene images. In this paper, we proposed a noval method to eliminating the need of explicit segmentation when deal with multi-digit number recognition in natural scene images. Convolution Neural Network(CNN) requires fixed dimensional input while number images contain unknown amount of digits. Our method integrats CNN with probabilistic graphical model to deal with the problem. We use hidden Markov model(HMM) to model the image and use CNN to model digits appearance. This method combines the advantages of both the two models and make them fit to the problem. By using this method we can perform the training and recognition procedure both at word level. There is no explicit segmentation operation at all which save lots of labour for sophisticated segmentation algorithm design or finegrained character labeling. Experiments show that deep CNN can dramaticly improve the performance compared with using Gaussian Mixture model as the digit model. We obtaied competitive results on the street view house number(SVHN) dataset.
引用
收藏
页码:48 / 53
页数:6
相关论文
共 50 条
  • [1] Reading Text in Natural Scene Images via Deep Neural Networks
    Zhao, Haifeng
    Hu, Yong
    Zhang, Jinxia
    PROCEEDINGS 2017 4TH IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2017, : 43 - 48
  • [2] Text Detection and Recognition for Natural Scene Images Using Deep Convolutional Neural Networks
    Wu, Xianyu
    Luo, Chao
    Zhang, Qian
    Zhou, Jiliu
    Yang, Hao
    Li, Yulian
    CMC-COMPUTERS MATERIALS & CONTINUA, 2019, 61 (01): : 289 - 300
  • [3] Sentiment Prediction in Scene Images via Convolutional Neural Networks
    Yao, Junfeng
    Yu, Yao
    Xue, Xiaoling
    2016 31ST YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2016, : 196 - 200
  • [4] Natural Scene Digit Classification Using Convolutional Neural Networks
    Wang, Ziqin
    Jiang, Peilin
    Zhang, Xuetao
    Wang, Fei
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2016, PT II, 2016, 9772 : 311 - 321
  • [5] Object-Scene Convolutional Neural Networks for Event Recognition in Images
    Wang, Limin
    Wang, Zhe
    Du, Wenbin
    Qiao, Yu
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2015,
  • [6] Urdu Natural Scene Character Recognition using Convolutional Neural Networks
    Ali, Asghar
    Pickering, Mark
    Shafi, Kamran
    2018 IEEE 2ND INTERNATIONAL WORKSHOP ON ARABIC AND DERIVED SCRIPT ANALYSIS AND RECOGNITION (ASAR), 2018, : 29 - 34
  • [7] Thai Text Localization in Natural Scene Images using Convolutional Neural Network
    Kobchaisawat, Thananop
    Chalidabhongse, Thanarat H.
    2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2014,
  • [8] Scene Classification of Remote Sensing Images Based on Integrated Convolutional Neural Networks
    Zhang Xiaonan
    Zhong Xing
    Zhu Ruifei
    Gao Fang
    Zhang Zuoxing
    Bao Songze
    Li Zhuqiang
    ACTA OPTICA SINICA, 2018, 38 (11)
  • [9] Learning two-pathway convolutional neural networks for categorizing scene images
    Bai, Shuang
    Li, Zhaohong
    Hou, Jianjun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (15) : 16145 - 16162
  • [10] Learning two-pathway convolutional neural networks for categorizing scene images
    Shuang Bai
    Zhaohong Li
    Jianjun Hou
    Multimedia Tools and Applications, 2017, 76 : 16145 - 16162