Reading Numbers in Natural Scene Images with Convolutional Neural Networks

被引:0
|
作者
Guo, Qiang [1 ]
Lei, Jun [1 ]
Tu, Dan [1 ]
Li, Guohui [1 ]
机构
[1] Natl Univ Def Technol, Dept Informat Syst & Management, Changsha, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reading text from natural images is a hard computer vision task. We present a method for applying deep convolutional neural networks to recognize numbers in natural scene images. In this paper, we proposed a noval method to eliminating the need of explicit segmentation when deal with multi-digit number recognition in natural scene images. Convolution Neural Network(CNN) requires fixed dimensional input while number images contain unknown amount of digits. Our method integrats CNN with probabilistic graphical model to deal with the problem. We use hidden Markov model(HMM) to model the image and use CNN to model digits appearance. This method combines the advantages of both the two models and make them fit to the problem. By using this method we can perform the training and recognition procedure both at word level. There is no explicit segmentation operation at all which save lots of labour for sophisticated segmentation algorithm design or finegrained character labeling. Experiments show that deep CNN can dramaticly improve the performance compared with using Gaussian Mixture model as the digit model. We obtaied competitive results on the street view house number(SVHN) dataset.
引用
收藏
页码:48 / 53
页数:6
相关论文
共 50 条
  • [31] Depth in convolutional neural networks solves scene segmentation
    Seijdel, Noor
    Tsakmakidis, Nikos
    de Haan, Edward H. F.
    Bohte, Sander M.
    Scholte, H. Steven
    PLOS COMPUTATIONAL BIOLOGY, 2020, 16 (07)
  • [32] SCENE TEXT RECOGNITION WITH DEEPER CONVOLUTIONAL NEURAL NETWORKS
    Zhang, Yuqi
    Wang, Wei
    Wang, Liang
    Wang, Liuan
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 2384 - 2388
  • [33] Scene text detection with fully convolutional neural networks
    Liu, Zhandong
    Zhou, Wengang
    Li, Houqiang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (13) : 18205 - 18227
  • [34] Lip reading with Hahn Convolutional Neural Networks
    Mesbah, Abderrahim
    Berrahou, Aissam
    Hammouchi, Hicham
    Berbia, Hassan
    Qjidaa, Hassan
    Daoudi, Mohamed
    IMAGE AND VISION COMPUTING, 2019, 88 : 76 - 83
  • [35] Reading Text in the Wild with Convolutional Neural Networks
    Max Jaderberg
    Karen Simonyan
    Andrea Vedaldi
    Andrew Zisserman
    International Journal of Computer Vision, 2016, 116 : 1 - 20
  • [36] Reading Text in the Wild with Convolutional Neural Networks
    Jaderberg, Max
    Simonyan, Karen
    Vedaldi, Andrea
    Zisserman, Andrew
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2016, 116 (01) : 1 - 20
  • [37] Convolutional neural networks for automatic meter reading
    Laroca, Rayson
    Barroso, Victor
    Diniz, Matheus A.
    Goncalves, Gabriel R.
    Schwartz, William Robson
    Menotti, David
    JOURNAL OF ELECTRONIC IMAGING, 2019, 28 (01)
  • [38] Combining Multilevel Contexts of Superpixel Using Convolutional Neural Networks to Perform Natural Scene Labeling
    Das, Aritra
    Ghosh, Swarnendu
    Sarkhel, Ritesh
    Choudhuri, Sandipan
    Das, Nibaran
    Nasipuri, Mita
    RECENT DEVELOPMENTS IN MACHINE LEARNING AND DATA ANALYTICS, 2019, 740 : 297 - 306
  • [39] CRF based text detection for natural scene images using convolutional neural network and context information
    Wang, Yanna
    Shi, Cunzhao
    Xiao, Baihua
    Wang, Chunheng
    Qi, Chengzuo
    NEUROCOMPUTING, 2018, 295 : 46 - 58
  • [40] Scene Classification of Remotely Sensed Images via Densely Connected Convolutional Neural Networks and an Ensemble Classifier
    Cheng, Qimin
    Xu, Yuan
    Fu, Peng
    Li, Jinling
    Wang, Wei
    Ren, Yingchao
    PHOTOGRAMMETRIC ENGINEERING AND REMOTE SENSING, 2021, 87 (04): : 295 - 308