Manchu Word Recognition Based on Convolutional Neural Network with Spatial Pyramid Pooling

被引:0
|
作者
Li, Min [1 ]
Zheng, Ruirui [1 ]
Xu, Shuang [1 ]
Fu, Yu [1 ]
Huang, Di [2 ]
机构
[1] Dalian Minzu Univ, Coll Informat & Commun Engn, Dalian, Peoples R China
[2] Northern Univ Nationalities, Coll Math & Informat Sci, Yinchuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Manchu word recognition; convolutional neural network; spatial pyramid pooling; optical character recognition;
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Manchu character recognition is important in protecting and researching Manchu culture and history. Previous methods of Manchu character recognition are mainly based on conventional machine learning using shallow artificial selection features, thus recognition results are unsatisfactory. The method with convolutional neural networks achieves high accuracy on optical character recognition as the convolution operators can automatically extract deep structure features. The convolutional neural network needs input images with the fixed size, but as a kind of phonemic language, the Manchu word has an arbitrary length. So it is needed to normalize the size of images if applying conventional convolutional neural network directly on Manchu word recognition. This normalization process will restrain the promotion of Manchu character recognition accuracy. This paper utilizes the spatial pyramid pooling layer instead of the last max-pooling layer in a convolutional neural network, and proposes a classifier for recognizing the arbitrary size Manchu word without segmenting the word. Without need of normalizing image sizes, the proposed model obtains the better recognition accuracy. The experiments indicate that the proposed Manchu word recognition models achieve the highest accuracy of 0.9768, higher than the conventional convolutional neural network. Furthermore there is no normalization on input images with arbitrary sizes in recognizing process. The proposed Manchu word recognition models outperform conventional counterparts in both accuracy and flexibility.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Attributes and Action Recognition Based on Convolutional Neural Networks and Spatial Pyramid VLAD Encoding
    Yan, Shiyang
    Smith, Jeremy S.
    Zhang, Bailing
    COMPUTER VISION - ACCV 2016 WORKSHOPS, PT III, 2017, 10118 : 500 - 514
  • [32] Classification of Foods Using Spatial Pyramid Convolutional Neural Network
    Heravi, Elnaz J.
    Aghdam, Hamed H.
    Puig, Domenec
    ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2016, 288 : 163 - 168
  • [33] Pyramid Pooling Dense Convolutional Neural Network for Multi-focus image Fusion
    Li, Yi
    Shen, Xuanjing
    Chen, Haipeng
    PROCEEDINGS OF 2019 6TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2019, : 164 - 168
  • [34] SAR IMAGE CHANGE DETECTION METHOD VIA A PYRAMID POOLING CONVOLUTIONAL NEURAL NETWORK
    Wang, Rongfang
    Ding, Fan
    Chen, Jia-Wei
    Liu, Bo
    Zhang, Jie
    Jiao, Licheng
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 312 - 315
  • [35] Household tools classification recognition based on spatial pyramid pooling features
    Wu P.-L.
    He B.
    Hou Z.-G.
    Kongzhi yu Juece/Control and Decision, 2019, 34 (07): : 1481 - 1486
  • [36] Mode Recognition of Orbital Angular Momentum Based on Attention Pyramid Convolutional Neural Network
    Qu, Tan
    Zhao, Zhiming
    Zhang, Yan
    Wu, Jiaji
    Wu, Zhensen
    REMOTE SENSING, 2022, 14 (18)
  • [37] POOLING MAP ADAPTATION IN CONVOLUTIONAL NEURAL NETWORK FOR FACIAL EXPRESSION RECOGNITION
    Li, Zhiyuan
    Han, Shizhong
    Khan, Ahmed Shehab
    Cai, Jie
    Meng, Zibo
    O'Reilly, James
    Tong, Yan
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1108 - 1113
  • [38] Gesture recognition of graph convolutional neural network based on spatial domain
    Chen, Hong
    Zhao, Hongdong
    Qi, Baoqiang
    Zhang, Shuai
    Yu, Zhanghong
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (03): : 2157 - 2167
  • [39] Gesture recognition of graph convolutional neural network based on spatial domain
    Hong Chen
    Hongdong Zhao
    Baoqiang Qi
    Shuai Zhang
    Zhanghong Yu
    Neural Computing and Applications, 2023, 35 : 2157 - 2167
  • [40] Traffic Sign Recognition in Harsh Environment Using Attention Based Convolutional Pooling Neural Network
    Jun Ho Chung
    Dong Won Kim
    Tae Koo Kang
    Myo Taeg Lim
    Neural Processing Letters, 2020, 51 : 2551 - 2573