Reading Numbers in Natural Scene Images with Convolutional Neural Networks

被引:0
|
作者
Guo, Qiang [1 ]
Lei, Jun [1 ]
Tu, Dan [1 ]
Li, Guohui [1 ]
机构
[1] Natl Univ Def Technol, Dept Informat Syst & Management, Changsha, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reading text from natural images is a hard computer vision task. We present a method for applying deep convolutional neural networks to recognize numbers in natural scene images. In this paper, we proposed a noval method to eliminating the need of explicit segmentation when deal with multi-digit number recognition in natural scene images. Convolution Neural Network(CNN) requires fixed dimensional input while number images contain unknown amount of digits. Our method integrats CNN with probabilistic graphical model to deal with the problem. We use hidden Markov model(HMM) to model the image and use CNN to model digits appearance. This method combines the advantages of both the two models and make them fit to the problem. By using this method we can perform the training and recognition procedure both at word level. There is no explicit segmentation operation at all which save lots of labour for sophisticated segmentation algorithm design or finegrained character labeling. Experiments show that deep CNN can dramaticly improve the performance compared with using Gaussian Mixture model as the digit model. We obtaied competitive results on the street view house number(SVHN) dataset.
引用
收藏
页码:48 / 53
页数:6
相关论文
共 50 条
  • [21] Augmented Text Character Proposals and Convolutional Neural Networks for Text Spotting from Scene Images
    Zamberletti, Alessandro
    Gallo, Ignazio
    Noce, Lucia
    PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 196 - 200
  • [22] Visualizing Deep Convolutional Neural Networks Using Natural Pre-images
    Aravindh Mahendran
    Andrea Vedaldi
    International Journal of Computer Vision, 2016, 120 : 233 - 255
  • [23] Text Detection in Natural Images with Convolutional Neural Networks and Synthetic Training Data
    Grond, Marco
    Brink, Willie
    Herbst, Ben
    2016 PATTERN RECOGNITION ASSOCIATION OF SOUTH AFRICA AND ROBOTICS AND MECHATRONICS INTERNATIONAL CONFERENCE (PRASA-ROBMECH), 2016,
  • [24] Visualizing Deep Convolutional Neural Networks Using Natural Pre-images
    Mahendran, Aravindh
    Vedaldi, Andrea
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 120 (03) : 233 - 255
  • [25] Applying Convolutional Neural Networks to Detect Natural Gas Leaks in Wellhead Images
    Melo, Roberlanio Oliveira
    Costa, M. G. F.
    Costa Filho, Cicero F. F.
    IEEE ACCESS, 2020, 8 (08): : 191775 - 191784
  • [26] Convolutional Neural Networks and Transfer Learning Based Classification of Natural Landscape Images
    Krstinic, Damir
    Braovic, Maja
    Bozic-Stulic, Dunja
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2020, 26 (02) : 244 - 267
  • [27] DYNAMIC SCENE CLASSIFICATION USING CONVOLUTIONAL NEURAL NETWORKS
    Gangopadhyay, Aalok
    Tripathi, Shivam Mani
    Jindal, Ishan
    Raman, Shanmuganathan
    2016 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2016, : 1255 - 1259
  • [28] Shallow Convolutional Neural Networks for Acoustic Scene Classification
    LU Lu
    YANG Yuhong
    JIANG Yuzhi
    AI Haojun
    TU Weiping
    WuhanUniversityJournalofNaturalSciences, 2018, 23 (02) : 178 - 184
  • [29] Scene text detection with fully convolutional neural networks
    Zhandong Liu
    Wengang Zhou
    Houqiang Li
    Multimedia Tools and Applications, 2019, 78 : 18205 - 18227
  • [30] Acoustic Scene Recognition Based on Convolutional Neural Networks
    Sun, Fengjiao
    Wang, Mingjiang
    Xu, Qihang
    Xuan, Xiaogung
    Zhang, Xin
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP 2019), 2019, : 122 - 126