Metrics and models for handwritten character recognition

被引:1
|
作者
Hastie, T [1 ]
Simard, PY
机构
[1] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
[2] AT&T Res Labs, Red Bank, NJ 07701 USA
关键词
nearest neighbor classification; invariance;
D O I
暂无
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A digitized handwritten numeral can be represented as a binary or greyscale image. An important pattern recognition task that has received much attention lately is to automatically determine the digit, given the image. While many different techniques have been pushed very hard to solve this task, the most successful and intuitively appropriate is due to Simard, Le Cun and Denker (1993). Their approach combined nearest-neighbor classification with a subject-specific invariant metric that allows for small rotations, translations and other natural transformations. We report on Simard's classifier and compare it to other approaches. One important negative aspect of near-neighbor classification is that all the work gets done at lookup time, and with around 10,000 training images in high dimensions this can be exorbitant. In this paper we develop rich models for representing large subsets of the prototypes. One example is a low-dimensional hyperplane defined by a point and a set of basis or tangent vectors. The components of these models are learned from the training set, chosen to minimize the average tangent distance from a subset of the training images-as such they are similar in flavor to the singular value decomposition (SVD), which finds closest hyperplanes in Euclidean distance. These models are either used singly per class or used as basic building blocks in conjunction with the K-means clustering algorithm.
引用
收藏
页码:54 / 65
页数:12
相关论文
共 50 条
  • [11] Online continues Vietnamese handwritten character recognition based on microsoft handwritten character recognition library
    Tao, Ngo Quoc
    Van Hung, Pham
    [J]. 2006 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS, 2006, : 2024 - +
  • [12] Optical character recognition of handwritten Arabic using hidden Markov models
    Aulama, Mohannad M.
    Natsheh, Asem M.
    Abandah, Gheith A.
    Olama, Mohammed M.
    [J]. OPTICAL PATTERN RECOGNITION XXII, 2011, 8055
  • [13] Handwritten Hindi character recognition: a review
    Yadav, Madhuri
    Purwar, Ravindra Kumar
    Mittal, Mamta
    [J]. IET IMAGE PROCESSING, 2018, 12 (11) : 1919 - 1933
  • [14] High-order statistics based distance metrics and its application in handwritten character recognition
    Ren, JL
    Wang, CQ
    Guo, J
    [J]. ISTM/2005: 6th International Symposium on Test and Measurement, Vols 1-9, Conference Proceedings, 2005, : 7880 - 7883
  • [15] A new algorithm for handwritten character recognition
    Zhu, XY
    Shi, YF
    [J]. 2001 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL I, PROCEEDINGS, 2001, : 1130 - 1133
  • [16] A Survey on Arabic Handwritten Character Recognition
    Ali A.A.A.
    Suresha M.
    Ahmed H.A.M.
    [J]. SN Computer Science, 2020, 1 (3)
  • [17] Invariant handwritten Chinese character recognition
    Liu, JNK
    Lee, RST
    [J]. ICONIP'98: THE FIFTH INTERNATIONAL CONFERENCE ON NEURAL INFORMATION PROCESSING JOINTLY WITH JNNS'98: THE 1998 ANNUAL CONFERENCE OF THE JAPANESE NEURAL NETWORK SOCIETY - PROCEEDINGS, VOLS 1-3, 1998, : 275 - 278
  • [18] Holistic recognition of handwritten character pairs
    Wang, X
    Govindaraju, V
    Srihari, S
    [J]. PATTERN RECOGNITION, 2000, 33 (12) : 1967 - 1973
  • [19] Handwritten English Character and Digit Recognition
    Al-Mahmud
    Tanvin, Asnuva
    Rahman, Sazia
    [J]. PROCEEDINGS OF INTERNATIONAL CONFERENCE ON ELECTRONICS, COMMUNICATIONS AND INFORMATION TECHNOLOGY 2021 (ICECIT 2021), 2021,
  • [20] Experimenting with Assamese Handwritten Character Recognition
    Singh, Jaisal
    Natesan, Srinivasan
    Paprzycki, Marcin
    Ganzha, Maria
    [J]. BIG-DATA-ANALYTICS IN ASTRONOMY, SCIENCE, AND ENGINEERING, BDA 2021, 2022, 13167 : 219 - 229