AMFF-net: adaptive multi-modal feature fusion network for image classification

被引:0
|
作者
Wei Liu
Xiaobo Lu
Yun Wei
机构
[1] Southeast University,School of Automation
[2] Ministry of Education,Key Laboratory of Measurement and Control of Complex Systems of Engineering
[3] Beijing Mass Transit Railway Operation Corporation Limited,undefined
来源
关键词
Convolutional neural networks; AMFF network; CNN local-global features; Traditional global features;
D O I
暂无
中图分类号
学科分类号
摘要
Convolutional neural networks(CNNs) have been applied to different computer vision tasks such as image classification and recognition, object detection, and segmentation due to the excellent capability of feature extraction and strong generalization ability in recent years. However, CNNs mainly represent the semantic information of images by aggregating local features. It is proved that some global features, such as histograms of oriented gradients, color information, and local binary pattern features, are useful for image recognition. Nonetheless, some researchers simply concatenate these features together, overlooking the differences between features, which leads to the inability to obtain desired performance or even worse results. To better integrate multi-modal features, in this paper a novel feature fusion module is proposed, named AMFF Network, which can adaptively fuse CNNs’ local-global features and traditional global features. That’s to say, the high-level semantic characteristic of objects and the low-level detailed information and appearance features can be combined dynamically by this network. It is convenient to embed the network in various architectures and can generalize effectively in various datasets. Furtherly, we show that the AMFF module brings obvious performance improvements for current state-of-the-art methods at some additional calculation cost. Experiments performed on multiple benchmark datasets, such as Fashion-MNIST, CIFAR10, CIFAR100, Tiny-Imagenet-200, and Market1501, demonstrate that the proposed AMFF-Net module can bring significant promotion in different datasets for image classification.
引用
收藏
页码:17069 / 17091
页数:22
相关论文
共 50 条
  • [41] Multi-modal hierarchical fusion network for fine-grained paper classification
    Yue, Tan
    Li, Yong
    Qin, Jiedong
    Hu, Zonghai
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (11) : 31527 - 31543
  • [42] A Multi-modal SPM Model for Image Classification
    Zheng, Peng
    Zhao, Zhong-Qiu
    Gao, Jun
    INTELLIGENT COMPUTING METHODOLOGIES, ICIC 2017, PT III, 2017, 10363 : 525 - 535
  • [43] Multi-modal Learning for Social Image Classification
    Liu, Chunyang
    Zhang, Xu
    Li, Xiong
    Li, Rui
    Zhang, Xiaoming
    Chao, Wenhan
    2016 12TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2016, : 1174 - 1179
  • [44] Multi-modal hierarchical fusion network for fine-grained paper classification
    Tan Yue
    Yong Li
    Jiedong Qin
    Zonghai Hu
    Multimedia Tools and Applications, 2024, 83 : 31527 - 31543
  • [45] MAFF-Net: Filter False Positive for 3D Vehicle Detection with Multi-modal Adaptive Feature Fusion
    Zhang, Zehan
    Shen, Yuxi
    Li, Hao
    Zhao, Xian
    Yang, Ming
    Tan, Wenming
    Pu, ShiLiang
    Mao, Hui
    2022 IEEE 25TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2022, : 369 - 376
  • [46] MFF-Net: Towards Efficient Monocular Depth Completion With Multi-Modal Feature Fusion
    Liu, Lina
    Song, Xibin
    Sun, Jiadai
    Lyu, Xiaoyang
    Li, Lin
    Liu, Yong
    Zhang, Liangjun
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (02) : 920 - 927
  • [47] MDFF-Net: A multi-dimensional feature fusion network for breast histopathology image classification
    Xu, Cheng
    Yi, Ke
    Jiang, Nan
    Li, Xiong
    Zhong, Meiling
    Zhang, Yuejin
    COMPUTERS IN BIOLOGY AND MEDICINE, 2023, 165
  • [48] MIA-Net: Multi-Modal Interactive Attention Network for Multi-Modal Affective Analysis
    Li, Shuzhen
    Zhang, Tong
    Chen, Bianna
    Chen, C. L. Philip
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (04) : 2796 - 2809
  • [49] DDIFN: A Dual-discriminator Multi-modal Medical Image Fusion Network
    Liu, Hui
    Li, Shanshan
    Zhu, Jicheng
    Deng, Kai
    Liu, Meng
    Nie, Liqiang
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (04)
  • [50] Multi-Modal Image Fusion via Deep Laplacian Pyramid Hybrid Network
    Luo, Xing
    Fu, Guizhong
    Yang, Jiangxin
    Cao, Yanlong
    Cao, Yanpeng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (12) : 7354 - 7369