Learning image by-parts using early and late fusion of auto-encoder features

被引：6

作者：

Susan, Seba ^{[1
]}

Malhotra, Jatin ^{[1
]}

机构：

[1] Delhi Technol Univ, Dept Informat Technol, Delhi 110042, India

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2021年 / 80卷 / 19期

关键词：

Handwritten numeral recognition; Sub-part learning; Convolutional auto-encoder; Early fusion; Late fusion; Early-cum-late fusion; RECOGNITION; GRADIENT;

D O I：

10.1007/s11042-021-11092-8

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

A novel sub-part learning scheme is introduced in our work for the purpose of recognizing handwritten numeral images. The idea is borrowed from the concept of visual perception and part-wise integration of visual information by the cortical regions of the brain. In this context, each numeral image is divided into four half-parts: top-half, bottom-half, left-half and right-half; the other half of the image being kept masked. An efficient data representation is derived in an unsupervised manner, from each image part, using convolutional auto-encoders (CAE), for our learning scheme that involves both early and late fusion of features. The chief advantage of the features derived from convolutional auto-encoders is the preservation of 2D spatial locality while the features are being filtered layer-by-layer through the convolutional architecture. The features derived from each individual CAE are fused by concatenation in our early fusion scheme, and learnt using an appropriate classifier. The late fusion strategy involves learning the probability density pertaining to the predicted values emanating from the four base classifiers using a meta-learner classifier. The early-cum-late fusion is proposed in the later stage of our work to combine the goodness of both schemes and enhance the performance. The support vector machine is used in all the classification stages. Experiments on the benchmark MNIST dataset of handwritten English numerals prove that our method competes favorably to the state of the art, as inferred from the high classification scores achieved. Our method thus provides a computationally simple and effective methodology for sub-part learning and part-wise integration of information from different parts of the image. The method also contributes to saving in computational expense since, at a time, only a small part of the image is processed, speeding up the inferencing process.

引用

页码：29601 / 29615

页数：15

共 50 条

[21] Learning a good representation with unsymmetrical auto-encoder
Sun, Yanan
Mao, Hua
Guo, Quan
Yi, Zhang
NEURAL COMPUTING & APPLICATIONS, 2016, 27 (05): : 1361 - 1367
[22] CNN Auto-Encoder Network Using Dilated Inception for Image Steganography
Kich, Ismail
Ameur, El Bachir
Taouil, Youssef
INTERNATIONAL JOURNAL OF FUZZY LOGIC AND INTELLIGENT SYSTEMS, 2021, 21 (04) : 358 - 368
[23] Online deep learning based on auto-encoder
Zhang, Si-si
Liu, Jian-wei
Zuo, Xin
Lu, Run-kun
Lian, Si-ming
APPLIED INTELLIGENCE, 2021, 51 (08) : 5420 - 5439
[24] Discriminative Representation Learning with Supervised Auto-encoder
Fang Du
Jiangshe Zhang
Nannan Ji
Junying Hu
Chunxia Zhang
Neural Processing Letters, 2019, 49 : 507 - 520
[25] Auto-Encoder based Structured Dictinoary Learning
Liu, Deyin
Wu, Yuanbo Lin
Liu, Liangchen
Hu, Qichang
Qi, Lin
2020 IEEE 22ND INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2020,
[26] INFORMATION THEORETIC-LEARNING AUTO-ENCODER
Santana, Eder
Emigh, Matthew
Principe, Jose C.
2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3296 - 3301
[27] Learning a good representation with unsymmetrical auto-encoder
Yanan Sun
Hua Mao
Quan Guo
Zhang Yi
Neural Computing and Applications, 2016, 27 : 1361 - 1367
[28] Discriminative Representation Learning with Supervised Auto-encoder
Du, Fang
Zhang, Jiangshe
Ji, Nannan
Hu, Junying
Zhang, Chunxia
NEURAL PROCESSING LETTERS, 2019, 49 (02) : 507 - 520
[29] Online deep learning based on auto-encoder
Si-si Zhang
Jian-wei Liu
Xin Zuo
Run-kun Lu
Si-ming Lian
Applied Intelligence, 2021, 51 : 5420 - 5439
[30] Infrared and Visible Image Fusion Based on Residual Dense Block and Auto-Encoder Network
Wang J.
Xu H.
Wang H.
Yu Z.
Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2021, 41 (10): : 1077 - 1083

← 1 2 3 4 5 →