A Two-Stage Multi-Modal Multi-Label Emotion Recognition Decision System Based on GCN

被引:0
|
作者
Wu, Weiwei [1 ]
Chen, Daomin [2 ]
Li, Qingping [1 ]
机构
[1] Zhejiang Yuying Coll Vocat Technol, Hangzhou, Peoples R China
[2] Guangdong Univ Sci & Technol, Guangzhou, Peoples R China
关键词
Multiple Modalities; Multi-Label Classification; Graph Convolutional Networks; Emotion Detection;
D O I
10.4018/IJDSST.352398
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Compared with single-modal methods, emotion recognition research is increasingly focusing on the use of multi-modal methods to improve accuracy. Despite the advantages of multimodality, challenges such as feature fusion and redundancy remain. In this study, we propose a multi-modal multi-label emotion recognition decision system based on graph convolution. Our approach utilizes text, speech, and video data for feature extraction, while combining tag attention to capture fine-grained modal dependencies. The two-stage feature reconstruction module facilitates complementary feature fusion while preserving mode-specific information. Emotional decisions are made using a fully connected layer to optimize performance without adding complexity to the model. Experimental results on IEMOCAP, CMU-MOSEI and MELD datasets show that our algorithm has higher accuracy than existing models, highlighting the effectiveness and innovation of our proposed algorithm.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] A multi-modal deep learning system for Arabic emotion recognition
    Abu Shaqra F.
    Duwairi R.
    Al-Ayyoub M.
    International Journal of Speech Technology, 2023, 26 (01) : 123 - 139
  • [22] GCN-BASED MULTI-MODAL MULTI-LABEL ATTRIBUTE CLASSIFICATION IN ANIME ILLUSTRATION USING DOMAIN-SPECIFIC SEMANTIC FEATURES
    Lan, Ziwen
    Maeda, Keisuke
    Ogawa, Takahiro
    Haseyama, Miki
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2021 - 2025
  • [23] Multi-modal multi-label semantic indexing of images based on hybrid ensemble learning
    Li, Wei
    Sun, Maosong
    Habel, Christopher
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2007, 2007, 4810 : 744 - +
  • [24] MHM: Multi-modal Clinical Data based Hierarchical Multi-label Diagnosis Prediction
    Qiao, Zhi
    Zhang, Zhen
    Wu, Xian
    Ge, Shen
    Fan, Wei
    PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20), 2020, : 1841 - 1844
  • [25] Semantic Enhancement Network Integrating Label Knowledge for Multi-modal Emotion Recognition
    Zheng, HongFeng
    Miao, ShengFa
    Yu, Qian
    Mu, YongKang
    Jin, Xin
    Yan, KeShan
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT V, ICIC 2024, 2024, 14879 : 473 - 484
  • [26] Common and Discriminative Semantic Pursuit for Multi-Modal Multi-Label Learning
    Zhang, Yi
    Shen, Jundong
    Zhang, Zhecheng
    Wang, Chongjun
    ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 1666 - 1673
  • [27] Multi-modal Attention for Speech Emotion Recognition
    Pan, Zexu
    Luo, Zhaojie
    Yang, Jichen
    Li, Haizhou
    INTERSPEECH 2020, 2020, : 364 - 368
  • [28] Towards Efficient Multi-Modal Emotion Recognition
    Dobrisek, Simon
    Gajsek, Rok
    Mihelic, France
    Pavesic, Nikola
    Struc, Vitomir
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2013, 10
  • [29] Evaluation and Discussion of Multi-modal Emotion Recognition
    Rabie, Ahmad
    Wrede, Britta
    Vogt, Thurid
    Hanheide, Marc
    SECOND INTERNATIONAL CONFERENCE ON COMPUTER AND ELECTRICAL ENGINEERING, VOL 1, PROCEEDINGS, 2009, : 598 - +
  • [30] Emotion Recognition from Multi-Modal Information
    Wu, Chung-Hsien
    Lin, Jen-Chun
    Wei, Wen-Li
    Cheng, Kuan-Chun
    2013 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2013,