A deep multimodal generative and fusion framework for class-imbalanced multimodal data

被引:0
|
作者
Qing Li
Guanyuan Yu
Jun Wang
Yuehao Liu
机构
[1] Southwestern University of Finance and Economics,Fintech Innovation Center and School of Economic Information Engineering
来源
关键词
Multimodal classification; Class-imbalanced data; Deep multimodal generative adversarial network; Deep multimodal hybrid fusion network;
D O I
暂无
中图分类号
学科分类号
摘要
The purpose of multimodal classification is to integrate features from diverse information sources to make decisions. The interactions between different modalities are crucial to this task. However, common strategies in previous studies have been to either concatenate features from various sources into a single compound vector or input them separately into several different classifiers that are then assembled into a single robust classifier to generate the final prediction. Both of these approaches weaken or even ignore the interactions among different feature modalities. In addition, in the case of class-imbalanced data, multimodal classification becomes troublesome. In this study, we propose a deep multimodal generative and fusion framework for multimodal classification with class-imbalanced data. This framework consists of two modules: a deep multimodal generative adversarial network (DMGAN) and a deep multimodal hybrid fusion network (DMHFN). The DMGAN is used to handle the class imbalance problem. The DMHFN identifies fine-grained interactions and integrates different information sources for multimodal classification. Experiments on a faculty homepage dataset show the superiority of our framework compared to several start-of-the-art methods.
引用
收藏
页码:25023 / 25050
页数:27
相关论文
共 50 条
  • [41] Deep Imbalanced Learning for Multimodal Emotion Recognition in Conversations
    Meng, Tao
    Shou, Yuntao
    Ai, Wei
    Yin, Nan
    Li, Keqin
    IEEE Transactions on Artificial Intelligence, 2024, 5 (12): : 6472 - 6487
  • [42] CHALLENGES IN MULTIMODAL DATA FUSION
    Lahat, Dana
    Adali, Tulay
    Jutten, Christian
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 101 - 105
  • [43] A generative adaptive convolutional neural network with attention mechanism for driver fatigue detection with class-imbalanced and insufficient data
    He, Le
    Zhang, Li
    Sun, Qiang
    Lin, XiangTian
    BEHAVIOURAL BRAIN RESEARCH, 2024, 464
  • [44] A Multimodal Framework for Unsupervised Feature Fusion
    Li, Xiaoyi
    Gao, Jing
    Li, Hui
    Yang, Le
    Srihari, Rohini K.
    PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 897 - 902
  • [45] MedFusionGAN: multimodal medical image fusion using an unsupervised deep generative adversarial network
    Safari, Mojtaba
    Fatemi, Ali
    Archambault, Louis
    BMC MEDICAL IMAGING, 2023, 23 (01)
  • [46] MedFusionGAN: multimodal medical image fusion using an unsupervised deep generative adversarial network
    Mojtaba Safari
    Ali Fatemi
    Louis Archambault
    BMC Medical Imaging, 23
  • [47] Escaping Saddle Points for Effective Generalization on Class-Imbalanced Data
    Rangwani, Harsh
    Aithal, Sumukh K.
    Mishra, Mayank
    Babu, R. Venkatesh
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [48] OligoIS: Scalable Instance Selection for Class-Imbalanced Data Sets
    Garcia-Pedrajas, Nicolas
    Perez-Rodriguez, Javier
    de Haro-Garcia, Aida
    IEEE TRANSACTIONS ON CYBERNETICS, 2013, 43 (01) : 332 - 346
  • [49] An Improved Deep Learning Framework for Multimodal Medical Data Analysis
    Kumar, Sachin
    Sharma, Shivani
    Big Data and Cognitive Computing, 2024, 8 (10)
  • [50] A novel graph oversampling framework for node classification in class-imbalanced graphs
    Riting XIA
    Chunxu ZHANG
    Yan ZHANG
    Xueyan LIU
    Bo YANG
    Science China(Information Sciences), 2024, 67 (06) : 214 - 229