Affine-invariant Recognition of Handwritten Characters via Accelerated KL Divergence Minimization

被引:0
|
作者
Wakahara, Toru [1 ]
Yamashita, Yukihiko [2 ]
机构
[1] Hosei Univ, Fac Comp & Informat Sci, 3-7-2 Kajino Cho, Koganei, Tokyo 1848584, Japan
[2] Tokyo Inst Technol, Grad Sch Engn & Sci, Meguro Ku, Tokyo 1528550, Japan
关键词
affine-invariant image matching; Gaussian kernel density estimation; KL divergence; character recognition; NUMERAL RECOGNITION;
D O I
10.1109/ICDAR.2011.221
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a new, affine-invariant image matching technique via accelerated KL (Kullback-Leibler) divergence minimization. First, we represent an image as a probability distribution by setting the sum of pixel values at one. Second, we introduce affine parameters into either of the two images' probability distributions using the Gaussian kernel density estimation. Finally, we determine optimal affine parameters that minimize KL divergence via an iterative method. In particular, without using such conventional nonlinear optimization techniques as the Levenberg-Marquardt method we devise an accelerated iterative method adapted to the KL divergence minimization problem through effective linear approximation. Recognition experiments using the handwritten numeral database IPTP CDROM1B show that the proposed method achieves a much higher recognition rate of 91.5% at suppressed computational cost than that of 83.7% obtained by a simple image matching method based on a normal KL divergence.
引用
收藏
页码:1095 / 1099
页数:5
相关论文
共 41 条
  • [21] Affine-invariant pattern recognition using momentums in log-polar images
    Son, YH
    You, BJ
    Oh, SR
    Park, GT
    SIXTH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION, PROCEEDINGS, 2004, : 797 - 802
  • [22] Affine-invariant visual features contain supplementary information to enhance speech recognition
    Gurbuz, S
    Patterson, E
    Tufekci, Z
    Gowdy, JN
    AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2001, 2091 : 175 - 181
  • [23] Novel affine-invariant curve descriptor for curve matching and occluded object recognition
    Fu, Huijing
    Tian, Zheng
    Ran, Maohua
    Fan, Ming
    IET COMPUTER VISION, 2013, 7 (04) : 279 - 292
  • [24] Robust feature matching for geospatial images via an affine-invariant coordinate system
    Li, Jiayuan
    Hu, Qingwu
    Ai, Mingyao
    PHOTOGRAMMETRIC RECORD, 2017, 32 (159): : 317 - 331
  • [25] Towards Learning Affine-Invariant Representations via Data-Efficient CNNs
    Xu, Wenju
    Wang, Guanghui
    Sullivan, Alan
    Zhang, Ziming
    2020 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV), 2020, : 893 - 902
  • [26] Application of affine-invariant Fourier descriptors to lipreading for audio-visual speech recognition
    Gurbuz, S
    Tufekci, Z
    Patterson, E
    Gowdy, JN
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 177 - 180
  • [27] A novel algorithm using affine-invariant features for pose-variant face recognition
    Zhao, Youen
    Li, Li
    Liu, Zhaoguang
    COMPUTERS & ELECTRICAL ENGINEERING, 2015, 46 : 217 - 230
  • [28] Robust feature matching via support-line voting and affine-invariant ratios
    Li, Jiayuan
    Hu, Qingwu
    Ai, Mingyao
    Zhong, Ruofei
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2017, 132 : 61 - 76
  • [29] RECOGNITION OF HANDWRITTEN CHINESE CHARACTERS VIA SHORT LINE SEGMENTS
    LEE, HJ
    CHEN, B
    PATTERN RECOGNITION, 1992, 25 (05) : 543 - 552
  • [30] Gaussian kernels for affine-invariant iconic representation and object recognition by multi-dimensional indexing
    BenArie, J
    Wang, ZQ
    Rao, KR
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING '96, 1996, 2727 : 156 - 167