Using generative models for handwritten digit recognition

被引:96
|
作者
Revow, M [1 ]
Williams, CKI [1 ]
Hinton, GE [1 ]
机构
[1] ASTON UNIV, DEPT COMP SCI & APPL MATH, BIRMINGHAM B4 7ET, W MIDLANDS, ENGLAND
关键词
deformable model; elastic net; optical character recognition; generative model; probabilistic model; mixture model;
D O I
10.1109/34.506410
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe a method of recognizing handwritten digits by fitting generative models that are built from deformable B-splines with Gaussian ''ink generators'' spaced along the length of the spline. The splines are adjusted using a novel elastic matching procedure based on the Expectation Maximization (EM) algorithm that maximizes the likelihood of the model generating the data. This approach has many advantages. 1) After identifying the model most likely to have generated the data, the system not only produces a classification of the digit but also a rich description of the instantiation parameters which can yield information such as the writing style. 2) During the process of explaining the image, generative models can perform recognition driven segmentation. 3) The method involves a relatively small number or parameters and hence training is relatively easy and fast. 4) Unlike many other recognition schemes, if does not rely on some form of pre-normalization of input images, but can handle arbitrary scalings, translations and a limited degree of image rotation. We have demonstrated our method of fitting models to images does not get trapped in poor local minima. The main disadvantage of the method is it requires much more computation than more standard OCR techniques.
引用
收藏
页码:592 / 606
页数:15
相关论文
共 50 条
  • [1] Data augmentation for handwritten digit recognition using generative adversarial networks
    Ganesh Jha
    Hubert Cecotti
    [J]. Multimedia Tools and Applications, 2020, 79 : 35055 - 35068
  • [2] Data augmentation for handwritten digit recognition using generative adversarial networks
    Jha, Ganesh
    Cecotti, Hubert
    [J]. Multimedia Tools and Applications, 2020, 79 (47-48): : 35055 - 35068
  • [3] Data augmentation for handwritten digit recognition using generative adversarial networks
    Jha, Ganesh
    Cecotti, Hubert
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (47-48) : 35055 - 35068
  • [4] Handwritten digit recognition based on conditional generative adversarial network
    Wang Ai-li
    Xue Dong
    Wu Hai-bin
    Wang Min-hui
    [J]. CHINESE JOURNAL OF LIQUID CRYSTALS AND DISPLAYS, 2020, 35 (12) : 1284 - 1290
  • [5] Discriminative Bernoulli Mixture Models for Handwritten Digit Recognition
    Gimenez, Adria
    Andres-Ferrer, J.
    Juan, Alfons
    Serrano, Nicolas
    [J]. 11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 558 - 562
  • [6] Handwritten Digit Recognition Using Bayesian ResNet
    Mhasakar P.
    Trivedi P.
    Mandal S.
    Mitra S.K.
    [J]. SN Computer Science, 2021, 2 (5)
  • [7] The recognition of handwritten digit strings of unknown length using hidden Markov models
    Procter, S
    Illingworth, J
    Elms, AJ
    [J]. FOURTEENTH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1 AND 2, 1998, : 1515 - 1517
  • [8] Handwritten Digit Recognition using DCT and HMMs
    Ali, Syed Salman
    Ghani, Muhammad Usman
    [J]. PROCEEDINGS OF 2014 12TH INTERNATIONAL CONFERENCE ON FRONTIERS OF INFORMATION TECHNOLOGY, 2014, : 303 - 306
  • [9] Using Random Forests for handwritten digit recognition
    Bernard, Simon
    Heutte, Laurent
    Adam, Sebastien
    [J]. ICDAR 2007: NINTH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION, VOLS I AND II, PROCEEDINGS, 2007, : 1043 - 1047
  • [10] Separation and recognition of overlapping handwritten digit images based on generative adversarial networks
    Wei, Jiacheng
    Dong, Ran
    Cai, Chengtao
    Lin, Xiaozhu
    Song, Huijia
    Wang, Xiangyu
    [J]. Harbin Gongcheng Daxue Xuebao/Journal of Harbin Engineering University, 45 (11): : 2226 - 2234