Manchu Word Recognition Based on Convolutional Neural Network with Spatial Pyramid Pooling

被引:0
|
作者
Li, Min [1 ]
Zheng, Ruirui [1 ]
Xu, Shuang [1 ]
Fu, Yu [1 ]
Huang, Di [2 ]
机构
[1] Dalian Minzu Univ, Coll Informat & Commun Engn, Dalian, Peoples R China
[2] Northern Univ Nationalities, Coll Math & Informat Sci, Yinchuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Manchu word recognition; convolutional neural network; spatial pyramid pooling; optical character recognition;
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Manchu character recognition is important in protecting and researching Manchu culture and history. Previous methods of Manchu character recognition are mainly based on conventional machine learning using shallow artificial selection features, thus recognition results are unsatisfactory. The method with convolutional neural networks achieves high accuracy on optical character recognition as the convolution operators can automatically extract deep structure features. The convolutional neural network needs input images with the fixed size, but as a kind of phonemic language, the Manchu word has an arbitrary length. So it is needed to normalize the size of images if applying conventional convolutional neural network directly on Manchu word recognition. This normalization process will restrain the promotion of Manchu character recognition accuracy. This paper utilizes the spatial pyramid pooling layer instead of the last max-pooling layer in a convolutional neural network, and proposes a classifier for recognizing the arbitrary size Manchu word without segmenting the word. Without need of normalizing image sizes, the proposed model obtains the better recognition accuracy. The experiments indicate that the proposed Manchu word recognition models achieve the highest accuracy of 0.9768, higher than the conventional convolutional neural network. Furthermore there is no normalization on input images with arbitrary sizes in recognizing process. The proposed Manchu word recognition models outperform conventional counterparts in both accuracy and flexibility.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Convolutional neural network with spatial pyramid pooling for hand gesture recognition
    Tan, Yong Soon
    Lim, Kian Ming
    Tee, Connie
    Lee, Chin Poo
    Low, Cheng Yaw
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (10): : 5339 - 5351
  • [2] Convolutional neural network with spatial pyramid pooling for hand gesture recognition
    Yong Soon Tan
    Kian Ming Lim
    Connie Tee
    Chin Poo Lee
    Cheng Yaw Low
    Neural Computing and Applications, 2021, 33 : 5339 - 5351
  • [3] Evaluation of Robust Spatial Pyramid Pooling Based on Convolutional Neural Network for Traffic Sign Recognition System
    Dewi, Christine
    Chen, Rung-Ching
    Tai, Shao-Kuo
    ELECTRONICS, 2020, 9 (06)
  • [4] Temporal Pyramid Pooling-Based Convolutional Neural Network for Action Recognition
    Wang, Peng
    Cao, Yuanzhouhan
    Shen, Chunhua
    Liu, Lingqiao
    Shen, Heng Tao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2017, 27 (12) : 2613 - 2622
  • [5] Handwritten Word Image Categorization with Convolutional Neural Networks and Spatial Pyramid Pooling
    Ignacio Toledo, J.
    Sudholt, Sebastian
    Fornes, Alicia
    Cucurull, Jordi
    Fink, Gernot A.
    Llados, Josep
    STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2016, 2016, 10029 : 543 - 552
  • [6] A Spatial Pyramid Pooling Convolutional Neural Network for Smoky Vehicle Detection
    Cao, Yichao
    Lu, Chang
    Lu, Xiaobo
    Xia, Xue
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 9170 - 9175
  • [7] Compact Spatial Pyramid Pooling Deep Convolutional Neural Network Based Hand Gestures Decoder
    Ashiquzzaman, Akm
    Lee, Hyunmin
    Kim, Kwangki
    Kim, Hye-Young
    Park, Jaehyung
    Kim, Jinsul
    APPLIED SCIENCES-BASEL, 2020, 10 (21): : 1 - 22
  • [8] A Spatial Pyramid Pooling-Based Deep Convolutional Neural Network for the Classification of Electrocardiogram Beats
    Li, Jia
    Si, Yujuan
    Lang, Liuqi
    Liu, Lixun
    Xu, Tao
    APPLIED SCIENCES-BASEL, 2018, 8 (09):
  • [9] Comprehensive analysis of network robustness evaluation based on convolutional neural networks with spatial pyramid pooling
    Jiang, Wenjun
    Fan, Tianlong
    Li, Changhao
    Zhang, Chuanfu
    Zhang, Tao
    Luo, Zong-fu
    CHAOS SOLITONS & FRACTALS, 2024, 184
  • [10] DeepScene: Scene classification via convolutional neural network with spatial pyramid pooling
    Yee, Pui Sin
    Lim, Kian Ming
    Lee, Chin Poo
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 193