Learning image by-parts using early and late fusion of auto-encoder features

被引:6
|
作者
Susan, Seba [1 ]
Malhotra, Jatin [1 ]
机构
[1] Delhi Technol Univ, Dept Informat Technol, Delhi 110042, India
关键词
Handwritten numeral recognition; Sub-part learning; Convolutional auto-encoder; Early fusion; Late fusion; Early-cum-late fusion; RECOGNITION; GRADIENT;
D O I
10.1007/s11042-021-11092-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A novel sub-part learning scheme is introduced in our work for the purpose of recognizing handwritten numeral images. The idea is borrowed from the concept of visual perception and part-wise integration of visual information by the cortical regions of the brain. In this context, each numeral image is divided into four half-parts: top-half, bottom-half, left-half and right-half; the other half of the image being kept masked. An efficient data representation is derived in an unsupervised manner, from each image part, using convolutional auto-encoders (CAE), for our learning scheme that involves both early and late fusion of features. The chief advantage of the features derived from convolutional auto-encoders is the preservation of 2D spatial locality while the features are being filtered layer-by-layer through the convolutional architecture. The features derived from each individual CAE are fused by concatenation in our early fusion scheme, and learnt using an appropriate classifier. The late fusion strategy involves learning the probability density pertaining to the predicted values emanating from the four base classifiers using a meta-learner classifier. The early-cum-late fusion is proposed in the later stage of our work to combine the goodness of both schemes and enhance the performance. The support vector machine is used in all the classification stages. Experiments on the benchmark MNIST dataset of handwritten English numerals prove that our method competes favorably to the state of the art, as inferred from the high classification scores achieved. Our method thus provides a computationally simple and effective methodology for sub-part learning and part-wise integration of information from different parts of the image. The method also contributes to saving in computational expense since, at a time, only a small part of the image is processed, speeding up the inferencing process.
引用
收藏
页码:29601 / 29615
页数:15
相关论文
共 50 条
  • [21] Learning a good representation with unsymmetrical auto-encoder
    Sun, Yanan
    Mao, Hua
    Guo, Quan
    Yi, Zhang
    NEURAL COMPUTING & APPLICATIONS, 2016, 27 (05): : 1361 - 1367
  • [22] CNN Auto-Encoder Network Using Dilated Inception for Image Steganography
    Kich, Ismail
    Ameur, El Bachir
    Taouil, Youssef
    INTERNATIONAL JOURNAL OF FUZZY LOGIC AND INTELLIGENT SYSTEMS, 2021, 21 (04) : 358 - 368
  • [23] Online deep learning based on auto-encoder
    Zhang, Si-si
    Liu, Jian-wei
    Zuo, Xin
    Lu, Run-kun
    Lian, Si-ming
    APPLIED INTELLIGENCE, 2021, 51 (08) : 5420 - 5439
  • [24] Discriminative Representation Learning with Supervised Auto-encoder
    Fang Du
    Jiangshe Zhang
    Nannan Ji
    Junying Hu
    Chunxia Zhang
    Neural Processing Letters, 2019, 49 : 507 - 520
  • [25] Auto-Encoder based Structured Dictinoary Learning
    Liu, Deyin
    Wu, Yuanbo Lin
    Liu, Liangchen
    Hu, Qichang
    Qi, Lin
    2020 IEEE 22ND INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2020,
  • [26] INFORMATION THEORETIC-LEARNING AUTO-ENCODER
    Santana, Eder
    Emigh, Matthew
    Principe, Jose C.
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3296 - 3301
  • [27] Learning a good representation with unsymmetrical auto-encoder
    Yanan Sun
    Hua Mao
    Quan Guo
    Zhang Yi
    Neural Computing and Applications, 2016, 27 : 1361 - 1367
  • [28] Discriminative Representation Learning with Supervised Auto-encoder
    Du, Fang
    Zhang, Jiangshe
    Ji, Nannan
    Hu, Junying
    Zhang, Chunxia
    NEURAL PROCESSING LETTERS, 2019, 49 (02) : 507 - 520
  • [29] Online deep learning based on auto-encoder
    Si-si Zhang
    Jian-wei Liu
    Xin Zuo
    Run-kun Lu
    Si-ming Lian
    Applied Intelligence, 2021, 51 : 5420 - 5439
  • [30] Infrared and Visible Image Fusion Based on Residual Dense Block and Auto-Encoder Network
    Wang J.
    Xu H.
    Wang H.
    Yu Z.
    Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2021, 41 (10): : 1077 - 1083