DBT: multimodal emotion recognition based on dual-branch transformer

被引:0
|
作者
Yufan Yi
Yan Tian
Cong He
Yajing Fan
Xinli Hu
Yiping Xu
机构
[1] Huazhong University of Science and Technology,School of Electronic Information and Communications
[2] Chinese Academy of Sciences,Aerospace Information Research Institute
来源
关键词
wav2vec2.0; Model fine-tuning; Adaptive interlayer fusion; Weighted label smoothing; Weighted DS strategy;
D O I
暂无
中图分类号
学科分类号
摘要
There are very few labeled datasets in speech emotion recognition. The reason is that emotion is subjective and requires much time for labeling experts to identify emotion categories, while the wav2vec2.0 model is a general model for obtaining speech representations through self-supervised training. Therefore, we try to apply it to speech-emotion recognition tasks. We propose a multimodal dual-branch transformer network. For the speech processing branch, first, we use wav2vec2.0 to extract speech features. Then, a fine-tuning strategy and a self-attention-based interlayer feature fusion strategy are used. Second, a fully convolutional classification network is used for emotion classification. Then, we use RoBERTa for text emotion recognition and bimodal fusion by an improved weighted Dempster–Shafer (DS) strategy. In addition, we propose an accuracy-weighted label smoothing method, which can improve recognition accuracy. We perform comprehensive experiments on two benchmarks: IEMOCAP and CASIA, covering both Chinese and English datasets. The experimental results show that the proposed method has higher accuracy than state-of-the-art methods.
引用
收藏
页码:8611 / 8633
页数:22
相关论文
共 50 条
  • [1] DBT: multimodal emotion recognition based on dual-branch transformer
    Yi, Yufan
    Tian, Yan
    He, Cong
    Fan, Yajing
    Hu, Xinli
    Xu, Yiping
    [J]. JOURNAL OF SUPERCOMPUTING, 2023, 79 (08): : 8611 - 8633
  • [2] Dual-Branch Multimodal Fusion Network for Driver Facial Emotion Recognition
    Wang, Le
    Chang, Yuchen
    Wang, Kaiping
    [J]. Applied Sciences (Switzerland), 2024, 14 (20):
  • [3] Dual-branch network based on transformer for texture recognition
    Liu, Yangqi
    Dong, Hao
    Wang, Guodong
    Chen, Chenglizhao
    [J]. DIGITAL SIGNAL PROCESSING, 2024, 153
  • [4] A Dual-Branch Dynamic Graph Convolution Based Adaptive TransFormer Feature Fusion Network for EEG Emotion Recognition
    Sun, Mingyi
    Cui, Weigang
    Yu, Shuyue
    Han, Hongbin
    Hu, Bin
    Li, Yang
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2022, 13 (04) : 2218 - 2228
  • [5] Dual-branch collaborative transformer for effective
    Qi, Xuanyu
    Song, Tianyu
    Dong, Haobo
    Jin, Jiyu
    Jin, Guiyue
    Li, Pengpeng
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 100
  • [6] Ship Recognition for Complex SAR Images via Dual-Branch Transformer Fusion Network
    Sun, Zhongzhen
    Leng, Xiangguang
    Zhang, Xianghui
    Xiong, Boli
    Ji, Kefeng
    Kuang, Gangyao
    [J]. IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [7] DDT: Dual-branch Deformable Transformer for Image Denoising
    Liu, Kangliang
    Du, Xiangcheng
    Liu, Sijie
    Zheng, Yingbin
    Wu, Xingjiao
    Jin, Cheng
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2765 - 2770
  • [8] Dual-Branch Collaborative Transformer for Virtual Try-On
    Fenocchi, Emanuele
    Morelli, Davide
    Cornia, Marcella
    Baraldi, Lorenzo
    Cesari, Fabio
    Cucchiara, Rita
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, : 2246 - 2250
  • [9] CosineTR: A dual-branch transformer-based network for semantic line detection
    Zhang, Yuqi
    Ma, Bole
    Jin, Luyang
    Yang, Yuancheng
    Tong, Chao
    [J]. PATTERN RECOGNITION, 2025, 158
  • [10] Robust pavement crack segmentation network based on transformer and dual-branch decoder
    Yu, Zhenwei
    Chen, Qinyu
    Shen, Yonggang
    Zhang, Yiping
    [J]. Construction and Building Materials, 2024, 453