Metrics and models for handwritten character recognition

被引:1
|
作者
Hastie, T [1 ]
Simard, PY
机构
[1] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
[2] AT&T Res Labs, Red Bank, NJ 07701 USA
关键词
nearest neighbor classification; invariance;
D O I
暂无
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A digitized handwritten numeral can be represented as a binary or greyscale image. An important pattern recognition task that has received much attention lately is to automatically determine the digit, given the image. While many different techniques have been pushed very hard to solve this task, the most successful and intuitively appropriate is due to Simard, Le Cun and Denker (1993). Their approach combined nearest-neighbor classification with a subject-specific invariant metric that allows for small rotations, translations and other natural transformations. We report on Simard's classifier and compare it to other approaches. One important negative aspect of near-neighbor classification is that all the work gets done at lookup time, and with around 10,000 training images in high dimensions this can be exorbitant. In this paper we develop rich models for representing large subsets of the prototypes. One example is a low-dimensional hyperplane defined by a point and a set of basis or tangent vectors. The components of these models are learned from the training set, chosen to minimize the average tangent distance from a subset of the training images-as such they are similar in flavor to the singular value decomposition (SVD), which finds closest hyperplanes in Euclidean distance. These models are either used singly per class or used as basic building blocks in conjunction with the K-means clustering algorithm.
引用
收藏
页码:54 / 65
页数:12
相关论文
共 50 条
  • [1] Metrics and models for handwritten character recognition
    Hastie, T
    Simard, P
    [J]. CONFERENCE ON STATISTICAL SCIENCE HONOURING THE BICENTENNIAL OF STEFANO FRANCINI'S BIRTH, 1997, : 203 - 219
  • [2] Recognition of Arabic handwritten words using contextual character models
    El-Hajj, Ramy
    Mokbel, Chafic
    Ukforman-Sulm, Laurence
    [J]. DOCUMENT RECOGNITION AND RETRIEVAL XV, 2008, 6815
  • [3] Handwritten Assamese Character Recognition
    Sarma, Parismita
    Chourasia, Chandan Kumar
    Barman, Manashjyoti
    [J]. 2019 IEEE 5TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2019,
  • [4] Handwritten Gurmukhi Character Recognition
    Aggarwal, Ashutosh
    Singh, Karamjeet
    [J]. 2015 INTERNATIONAL CONFERENCE ON COMPUTER, COMMUNICATION AND CONTROL (IC4), 2015,
  • [5] Handwritten Tamil Character Recognition
    Wahi, Amitabh
    Sundaramurthy, S.
    Poovizhi, P.
    [J]. 2013 FIFTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING (ICOAC), 2013, : 389 - 394
  • [6] Optical Handwritten with Character Recognition
    Zahra, Syeda Binish
    Moaen, Shanza
    Munir, Sundus
    Hassan, Arfa
    Nadeem, Afrozah
    Farooq, Muhammad Sajid
    [J]. 4TH INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING (IC)2, 2021, : 562 - 569
  • [7] On Georgian Handwritten Character Recognition
    Soselia, Davit
    Tsintsadze, Magda
    Shugliashvili, Levan
    Koberidze, Irakli
    Amashukeli, Shota
    Jijavadze, Sandro
    [J]. IFAC PAPERSONLINE, 2018, 51 (30): : 161 - 165
  • [8] Recognition of Character from Handwritten
    Murugan, N.
    Sivakumar, R.
    Yukesh, G.
    Vishnupriyan, J.
    [J]. 2020 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATION SYSTEMS (ICACCS), 2020, : 1417 - 1419
  • [9] Online continues Vietnamese handwritten character recognition based on microsoft handwritten character recognition library
    Tao, Ngo Quoc
    Van Hung, Pham
    [J]. 2006 IEEE ASIA PACIFIC CONFERENCE ON CIRCUITS AND SYSTEMS, 2006, : 2024 - +
  • [10] Optical character recognition of handwritten Arabic using hidden Markov models
    Aulama, Mohannad M.
    Natsheh, Asem M.
    Abandah, Gheith A.
    Olama, Mohammed M.
    [J]. OPTICAL PATTERN RECOGNITION XXII, 2011, 8055