Learning image by-parts using early and late fusion of auto-encoder features

被引:6
|
作者
Susan, Seba [1 ]
Malhotra, Jatin [1 ]
机构
[1] Delhi Technol Univ, Dept Informat Technol, Delhi 110042, India
关键词
Handwritten numeral recognition; Sub-part learning; Convolutional auto-encoder; Early fusion; Late fusion; Early-cum-late fusion; RECOGNITION; GRADIENT;
D O I
10.1007/s11042-021-11092-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A novel sub-part learning scheme is introduced in our work for the purpose of recognizing handwritten numeral images. The idea is borrowed from the concept of visual perception and part-wise integration of visual information by the cortical regions of the brain. In this context, each numeral image is divided into four half-parts: top-half, bottom-half, left-half and right-half; the other half of the image being kept masked. An efficient data representation is derived in an unsupervised manner, from each image part, using convolutional auto-encoders (CAE), for our learning scheme that involves both early and late fusion of features. The chief advantage of the features derived from convolutional auto-encoders is the preservation of 2D spatial locality while the features are being filtered layer-by-layer through the convolutional architecture. The features derived from each individual CAE are fused by concatenation in our early fusion scheme, and learnt using an appropriate classifier. The late fusion strategy involves learning the probability density pertaining to the predicted values emanating from the four base classifiers using a meta-learner classifier. The early-cum-late fusion is proposed in the later stage of our work to combine the goodness of both schemes and enhance the performance. The support vector machine is used in all the classification stages. Experiments on the benchmark MNIST dataset of handwritten English numerals prove that our method competes favorably to the state of the art, as inferred from the high classification scores achieved. Our method thus provides a computationally simple and effective methodology for sub-part learning and part-wise integration of information from different parts of the image. The method also contributes to saving in computational expense since, at a time, only a small part of the image is processed, speeding up the inferencing process.
引用
收藏
页码:29601 / 29615
页数:15
相关论文
共 50 条
  • [31] Reinforcement Learning on Robot with Variational Auto-Encoder
    Chen, Yiwen
    Yang, Chenguang
    Feng, Ying
    PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON MODELLING, IDENTIFICATION AND CONTROL (ICMIC2019), 2020, 582 : 675 - 684
  • [32] Infrared and Visible Image Fusion Method Based on Convolutional Auto-Encoder and Residual Block
    Jiang Zetao
    He Yuting
    ACTA OPTICA SINICA, 2019, 39 (10)
  • [33] Infrared and visible image fusion based on variational auto-encoder and infrared feature compensation
    Ren, Long
    Pan, Zhibin
    Cao, Jianzhong
    Liao, Jiawen
    INFRARED PHYSICS & TECHNOLOGY, 2021, 117
  • [34] Image-To-Image Translation Using a Cross-Domain Auto-Encoder and Decoder
    Yoo, Jaechang
    Eom, Heesong
    Choi, Yong Suk
    APPLIED SCIENCES-BASEL, 2019, 9 (22):
  • [35] DEEP AUTO-ENCODER NETWORK FOR HYPERSPECTRAL IMAGE UNMIXING
    Su, Yuanchao
    Li, Jun
    Plaza, Antonio
    Marinoni, Andrea
    Gamba, Paolo
    Huang, Yuancheng
    IGARSS 2018 - 2018 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2018, : 6400 - 6403
  • [36] A deep learning method for lincRNA detection using auto-encoder algorithm
    Yu, Ning
    Yu, Zeng
    Pan, Yi
    BMC BIOINFORMATICS, 2017, 18 : 511
  • [37] Path Tracking Control Using Imitation Learning with Variational Auto-Encoder
    Lee, Su-Jin
    Chun, Tae Yoon
    Lim, Hyoung Woo
    Lee, Sang-Ho
    2019 19TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2019), 2019, : 501 - 505
  • [38] Image enhancement algorithm with convolutional auto-encoder network
    Wang W.-L.
    Yang X.-H.
    Zhao Y.-W.
    Gao N.
    Lv C.
    Zhang Z.-J.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2019, 53 (09): : 1728 - 1740
  • [39] A selectional auto-encoder approach for document image binarization
    Calvo-Zaragoza, Jorge
    Gallego, Antonio-Javier
    PATTERN RECOGNITION, 2019, 86 : 37 - 47
  • [40] Target Classification Using Convolutional Deep Learning and Auto-Encoder Models
    Zaied, Sarra
    Toumi, Abdelmalek
    Khenchaf, Ali
    2018 4TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2018,