Learning image by-parts using early and late fusion of auto-encoder features

被引:6
|
作者
Susan, Seba [1 ]
Malhotra, Jatin [1 ]
机构
[1] Delhi Technol Univ, Dept Informat Technol, Delhi 110042, India
关键词
Handwritten numeral recognition; Sub-part learning; Convolutional auto-encoder; Early fusion; Late fusion; Early-cum-late fusion; RECOGNITION; GRADIENT;
D O I
10.1007/s11042-021-11092-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A novel sub-part learning scheme is introduced in our work for the purpose of recognizing handwritten numeral images. The idea is borrowed from the concept of visual perception and part-wise integration of visual information by the cortical regions of the brain. In this context, each numeral image is divided into four half-parts: top-half, bottom-half, left-half and right-half; the other half of the image being kept masked. An efficient data representation is derived in an unsupervised manner, from each image part, using convolutional auto-encoders (CAE), for our learning scheme that involves both early and late fusion of features. The chief advantage of the features derived from convolutional auto-encoders is the preservation of 2D spatial locality while the features are being filtered layer-by-layer through the convolutional architecture. The features derived from each individual CAE are fused by concatenation in our early fusion scheme, and learnt using an appropriate classifier. The late fusion strategy involves learning the probability density pertaining to the predicted values emanating from the four base classifiers using a meta-learner classifier. The early-cum-late fusion is proposed in the later stage of our work to combine the goodness of both schemes and enhance the performance. The support vector machine is used in all the classification stages. Experiments on the benchmark MNIST dataset of handwritten English numerals prove that our method competes favorably to the state of the art, as inferred from the high classification scores achieved. Our method thus provides a computationally simple and effective methodology for sub-part learning and part-wise integration of information from different parts of the image. The method also contributes to saving in computational expense since, at a time, only a small part of the image is processed, speeding up the inferencing process.
引用
收藏
页码:29601 / 29615
页数:15
相关论文
共 50 条
  • [41] Shape cognition in map space using deep auto-encoder learning
    Yan X.
    Ai T.
    Yang M.
    Zheng J.
    Ai, Tinghua (tinghuaai@whu.edu.cn), 2021, SinoMaps Press (50): : 757 - 765
  • [42] A deep learning method for lincRNA detection using auto-encoder algorithm
    Ning Yu
    Zeng Yu
    Yi Pan
    BMC Bioinformatics, 18
  • [43] A Deep Learning Method for lincRNA Identification Using Auto-encoder Algorithm
    Yu, Ning
    Yu, Zeng
    Pan, Yi
    2016 IEEE 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL ADVANCES IN BIO AND MEDICAL SCIENCES (ICCABS), 2016,
  • [44] Deep Supervised Auto-encoder Hashing for Image Retrieval
    Tang, Sanli
    Chi, Haoyuan
    Yang, Jie
    Huang, Xiaolin
    Zareapoor, Masoumeh
    PATTERN RECOGNITION AND COMPUTER VISION, PT II, 2018, 11257 : 193 - 205
  • [45] Semantic image representation for image recognition and retrieval using multilayer variational auto-encoder, InceptionNet and low-level image features
    Giveki, Davar
    Esfandyari, Sajad
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (01):
  • [46] Nonlinear Dimensionality Reduction for Intrusion Detection Using Auto-Encoder Bottleneck Features
    Abolhasanzadeh, Bahareh
    2015 7TH CONFERENCE ON INFORMATION AND KNOWLEDGE TECHNOLOGY (IKT), 2015,
  • [47] Feature Extraction of Lathe Tool Crater Wear Image Using Auto-Encoder
    Choi, Jae Uk
    Heo, Hyo Beom
    Park, Seung Hwan
    Extraction, Feature
    TRANSACTIONS OF THE KOREAN SOCIETY OF MECHANICAL ENGINEERS A, 2023, 47 (03) : 273 - 281
  • [48] Unsupervised Dimension Reduction for Image Classification Using Regularized Convolutional Auto-Encoder
    Xu, Chaoyang
    Wu, Ling
    Wang, Shiping
    ADVANCES IN COMPUTER VISION, CVC, VOL 1, 2020, 943 : 99 - 108
  • [49] Sparse auto-encoder based feature learning for human body detection in depth image
    Su, Song-Zhi
    Liu, Zhi-Hui
    Xu, Su-Ping
    Li, Shao-Zi
    Ji, Rongrong
    SIGNAL PROCESSING, 2015, 112 : 43 - 52
  • [50] Image label transfer: Short video labelling by using frame auto-encoder
    Lü Chaohui
    Huang Yiyang
    TheJournalofChinaUniversitiesofPostsandTelecommunications, 2020, 27 (01) : 92 - 99