AMFF-net: adaptive multi-modal feature fusion network for image classification

被引:0
|
作者
Wei Liu
Xiaobo Lu
Yun Wei
机构
[1] Southeast University,School of Automation
[2] Ministry of Education,Key Laboratory of Measurement and Control of Complex Systems of Engineering
[3] Beijing Mass Transit Railway Operation Corporation Limited,undefined
来源
关键词
Convolutional neural networks; AMFF network; CNN local-global features; Traditional global features;
D O I
暂无
中图分类号
学科分类号
摘要
Convolutional neural networks(CNNs) have been applied to different computer vision tasks such as image classification and recognition, object detection, and segmentation due to the excellent capability of feature extraction and strong generalization ability in recent years. However, CNNs mainly represent the semantic information of images by aggregating local features. It is proved that some global features, such as histograms of oriented gradients, color information, and local binary pattern features, are useful for image recognition. Nonetheless, some researchers simply concatenate these features together, overlooking the differences between features, which leads to the inability to obtain desired performance or even worse results. To better integrate multi-modal features, in this paper a novel feature fusion module is proposed, named AMFF Network, which can adaptively fuse CNNs’ local-global features and traditional global features. That’s to say, the high-level semantic characteristic of objects and the low-level detailed information and appearance features can be combined dynamically by this network. It is convenient to embed the network in various architectures and can generalize effectively in various datasets. Furtherly, we show that the AMFF module brings obvious performance improvements for current state-of-the-art methods at some additional calculation cost. Experiments performed on multiple benchmark datasets, such as Fashion-MNIST, CIFAR10, CIFAR100, Tiny-Imagenet-200, and Market1501, demonstrate that the proposed AMFF-Net module can bring significant promotion in different datasets for image classification.
引用
收藏
页码:17069 / 17091
页数:22
相关论文
共 50 条
  • [1] AMFF-net: adaptive multi-modal feature fusion network for image classification
    Liu, Wei
    Lu, Xiaobo
    Wei, Yun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (06) : 17069 - 17091
  • [2] AMFF-Net: An attention-based multi-scale feature fusion network for allergic pollen detection
    Li, Jianqiang
    Wang, Quanzeng
    Xiong, Chengyao
    Zhao, Linna
    Cheng, Wenxiu
    Xu, Xi
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 235
  • [3] AMFF-Net: An attention-based multi-scale feature fusion network for allergic pollen detection
    Li, Jianqiang
    Wang, Quanzeng
    Xiong, Chengyao
    Zhao, Linna
    Cheng, Wenxiu
    Xu, Xi
    Expert Systems with Applications, 2024, 235
  • [4] AMFF-Net: An Effective 3D Object Detector Based on Attention and Multi-Scale Feature Fusion
    Li, Guangping
    Mo, Zuanfang
    Ling, Bingo Wing-Kuen
    SENSORS, 2023, 23 (23)
  • [5] Adaptive Feature Fusion for Multi-modal Entity Alignment
    Guo H.
    Li X.-Y.
    Tang J.-Y.
    Guo Y.-M.
    Zhao X.
    Zidonghua Xuebao/Acta Automatica Sinica, 2024, 50 (04): : 758 - 770
  • [6] Multi-modal feature fusion for geographic image annotation
    Li, Ke
    Zou, Changqing
    Bu, Shuhui
    Liang, Yun
    Zhang, Jian
    Gong, Minglun
    PATTERN RECOGNITION, 2018, 73 : 1 - 14
  • [7] Image and Encoded Text Fusion for Multi-Modal Classification
    Gallo, I.
    Calefati, A.
    Nawaz, S.
    Janjua, M. K.
    2018 INTERNATIONAL CONFERENCE ON DIGITAL IMAGE COMPUTING: TECHNIQUES AND APPLICATIONS (DICTA), 2018, : 203 - 209
  • [8] FuseNet: a multi-modal feature fusion network for 3D shape classification
    Zhao, Xin
    Chen, Yinhuang
    Yang, Chengzhuan
    Fang, Lincong
    VISUAL COMPUTER, 2024, : 2973 - 2985
  • [9] Disease Classification Model Based on Multi-Modal Feature Fusion
    Wan, Zhengyu
    Shao, Xinhui
    IEEE ACCESS, 2023, 11 : 27536 - 27545
  • [10] Feature Disentanglement and Adaptive Fusion for Improving Multi-modal Tracking
    Li, Zheng
    Cai, Weibo
    Dong, Junhao
    Lai, Jianhuang
    Xie, Xiaohua
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT XII, 2024, 14436 : 68 - 80