Learning image by-parts using early and late fusion of auto-encoder features

被引:6
|
作者
Susan, Seba [1 ]
Malhotra, Jatin [1 ]
机构
[1] Delhi Technol Univ, Dept Informat Technol, Delhi 110042, India
关键词
Handwritten numeral recognition; Sub-part learning; Convolutional auto-encoder; Early fusion; Late fusion; Early-cum-late fusion; RECOGNITION; GRADIENT;
D O I
10.1007/s11042-021-11092-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A novel sub-part learning scheme is introduced in our work for the purpose of recognizing handwritten numeral images. The idea is borrowed from the concept of visual perception and part-wise integration of visual information by the cortical regions of the brain. In this context, each numeral image is divided into four half-parts: top-half, bottom-half, left-half and right-half; the other half of the image being kept masked. An efficient data representation is derived in an unsupervised manner, from each image part, using convolutional auto-encoders (CAE), for our learning scheme that involves both early and late fusion of features. The chief advantage of the features derived from convolutional auto-encoders is the preservation of 2D spatial locality while the features are being filtered layer-by-layer through the convolutional architecture. The features derived from each individual CAE are fused by concatenation in our early fusion scheme, and learnt using an appropriate classifier. The late fusion strategy involves learning the probability density pertaining to the predicted values emanating from the four base classifiers using a meta-learner classifier. The early-cum-late fusion is proposed in the later stage of our work to combine the goodness of both schemes and enhance the performance. The support vector machine is used in all the classification stages. Experiments on the benchmark MNIST dataset of handwritten English numerals prove that our method competes favorably to the state of the art, as inferred from the high classification scores achieved. Our method thus provides a computationally simple and effective methodology for sub-part learning and part-wise integration of information from different parts of the image. The method also contributes to saving in computational expense since, at a time, only a small part of the image is processed, speeding up the inferencing process.
引用
收藏
页码:29601 / 29615
页数:15
相关论文
共 50 条
  • [1] Learning image by-parts using early and late fusion of auto-encoder features
    Seba Susan
    Jatin Malhotra
    Multimedia Tools and Applications, 2021, 80 : 29601 - 29615
  • [2] Multimodal Medical Image Fusion Using Stacked Auto-encoder in NSCT Domain
    Nahed Tawfik
    Heba A. Elnemr
    Mahmoud Fakhr
    Moawad I. Dessouky
    Fathi E. Abd El-Samie
    Journal of Digital Imaging, 2022, 35 : 1308 - 1325
  • [3] Multimodal Medical Image Fusion Using Stacked Auto-encoder in NSCT Domain
    Tawfik, Nahed
    Elnemr, Heba A.
    Fakhr, Mahmoud
    Dessouky, Moawad I.
    Abd El-Samie, Fathi E.
    JOURNAL OF DIGITAL IMAGING, 2022, 35 (05) : 1308 - 1325
  • [4] Generating adversarial samples by manipulating image features with auto-encoder
    Jianxin Yang
    Mingwen Shao
    Huan Liu
    Xinkai Zhuang
    International Journal of Machine Learning and Cybernetics, 2023, 14 : 2499 - 2509
  • [5] Generating adversarial samples by manipulating image features with auto-encoder
    Yang, Jianxin
    Shao, Mingwen
    Liu, Huan
    Zhuang, Xinkai
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (07) : 2499 - 2509
  • [6] Tire Pattern Image Classification using Variational Auto-Encoder with Contrastive Learning
    Yang, Jianning
    Xue, Jiahao
    Feng, Xiaodong
    Song, Chaoqi
    Hao, Yu
    2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,
  • [7] Layered Image Compression using Scalable Auto-encoder
    Jia, Chuanmin
    Liu, Zhaoyi
    Wang, Yao
    Ma, Siwei
    Gao, Wen
    2019 2ND IEEE CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2019), 2019, : 431 - 436
  • [8] Underwater image reconstruction using convolutional auto-encoder
    Yasukawa, Shinsuke
    Raghura, Sreeraman Srinivasa
    Nishida, Yuya
    Ishii, Kazuo
    PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS (ICAROB 2021), 2021, : P86 - P86
  • [9] Underwater image reconstruction using convolutional auto-encoder
    Yasukawa, Shinsuke
    Raghura, Sreeraman Srinivasa
    Nishida, Yuya
    Ishii, Kazuo
    PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON ARTIFICIAL LIFE AND ROBOTICS (ICAROB 2021), 2021, : 262 - 265
  • [10] A joint convolution auto-encoder network for infrared and visible image fusion
    Zhang, Zhancheng
    Gao, Yuanhao
    Xiong, Mengyu
    Luo, Xiaoqing
    Wu, Xiao-Jun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (19) : 29017 - 29035