HANDWRITTEN CHARACTER CLASSIFICATION USING NEAREST-NEIGHBOR IN LARGE DATABASES

被引:46
|
作者
SMITH, SJ [1 ]
BOURGOIN, MO [1 ]
SIMS, K [1 ]
VOORHEES, HL [1 ]
机构
[1] TASC,READING,MA 01867
关键词
D O I
10.1109/34.310689
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We show that systems built on a simple statistical technique and a large training database can be automatically optimized to produce classification accuracies of 99% in the domain of handwritten digits. It is also shown that the performance of these systems scale consistently with the size of the training database, where the error rate is cut by more than half for every tenfold increase in the size of the training set from 10 to 100,000 examples. Three distance metrics for the standard Nearest Neighbor classification system are investigated: a simple Hamming distance metric, a pixel distance metric, and a metric based on the extraction of penstroke features. Systems employing these metrics were trained and tested on a standard, publicly available, database of nearly 225,000 digits provided by the National Institute of Standards and Technology. Additionally, a confidence metric is both introduced by the authors and also discovered and optimized by the system. The new confidence measure proves to be superior to the commonly used Nearest Neighbor distance.
引用
收藏
页码:915 / 919
页数:5
相关论文
共 50 条
  • [1] Comparative Study of Devanagari Handwritten and printed Character & Numerals Recognition using Nearest-Neighbor Classifiers
    Holambe, Anilkumar N.
    Holambe, Sushilkumar N.
    Thool, Ravinder C.
    [J]. PROCEEDINGS 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, (ICCSIT 2010), VOL 1, 2010, : 426 - 430
  • [2] HANDWRITTEN DIGIT RECOGNITION USING AN OPTIMIZED NEAREST-NEIGHBOR CLASSIFIER
    YAN, H
    [J]. PATTERN RECOGNITION LETTERS, 1994, 15 (02) : 207 - 211
  • [3] Evaluation of prototype learning algorithms for nearest-neighbor classifier in application to handwritten character recognition
    Liu, CL
    Nakagawa, M
    [J]. PATTERN RECOGNITION, 2001, 34 (03) : 601 - 615
  • [4] Fast Nearest-Neighbor Classification Using RNN in Domains with Large Number of Classes
    Singh, Gautam
    Dasgupta, Gargi
    Deng, Yu
    [J]. SERVICE-ORIENTED COMPUTING, ICSOC 2018, 2019, 11434 : 309 - 321
  • [5] CHOICE OF NEIGHBOR ORDER IN NEAREST-NEIGHBOR CLASSIFICATION
    Hall, Peter
    Park, Byeong U.
    Samworth, Richard J.
    [J]. ANNALS OF STATISTICS, 2008, 36 (05): : 2135 - 2152
  • [6] Adaptive κ-nearest-neighbor classification using a dynamic number of nearest neighbors
    Ougiaroglou, Stefanos
    Nanopoulos, Alexandros
    Papadopoulos, Apostolos N.
    Manolopoulos, Yannis
    Welzer-Druzovec, Tatjana
    [J]. ADVANCES IN DATABASES AND INFORMATION SYSTEMS, PROCEEDINGS, 2007, 4690 : 66 - +
  • [7] Prototype optimization for nearest-neighbor classification
    Huang, YS
    Chiang, CC
    Shieh, JW
    Grimson, E
    [J]. PATTERN RECOGNITION, 2002, 35 (06) : 1237 - 1245
  • [8] A Bayesian Reassessment of Nearest-Neighbor Classification
    Cucala, Lionel
    Marin, Jean-Michel
    Robert, Christian P.
    Titterington, D. M.
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2009, 104 (485) : 263 - 273
  • [9] Nearest-neighbor classification with categorical variables
    Buttrey, SE
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1998, 28 (02) : 157 - 169
  • [10] Nearest-neighbor classification for facies delineation
    Tartakovsky, Daniel M.
    Wohlberg, Brendt
    Guadagnini, Alberto
    [J]. WATER RESOURCES RESEARCH, 2007, 43 (07)