C-BGA: Multimodal Speech Emotion Recognition Network Combining Contrastive Learning

被引:0
|
作者
Miao, Borui [1 ]
Xu, Yunfeng [1 ]
Zhao, Shaojie [1 ]
Wang, Jialin [1 ]
机构
[1] School of Information Science and Engineering, Hebei University of Science and Technology, Shijiazhuang,050000, China
来源
关键词
Emotion Recognition - Modal analysis - Speech recognition;
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
页码:168 / 176
相关论文
共 50 条
  • [1] FCAN : Speech emotion recognition network based on focused contrastive learning
    Kang, Hong
    Xu, Yunfeng
    Jin, Guowei
    Wang, Jialin
    Miao, Borui
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 96
  • [2] CONTRASTIVE UNSUPERVISED LEARNING FOR SPEECH EMOTION RECOGNITION
    Li, Mao
    Yang, Bo
    Levy, Joshua
    Stolcke, Andreas
    Rozgic, Viktor
    Matsoukas, Spyros
    Papayiannis, Constantinos
    Bone, Daniel
    Wang, Chao
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6329 - 6333
  • [3] Multimodal Knowledge-enhanced Interactive Network with Mixed Contrastive Learning for Emotion Recognition in Conversation
    Shen, Xudong
    Huang, Xianying
    Zou, Shihao
    Gan, Xinyi
    Neurocomputing, 2024, 582
  • [4] Multimodal Knowledge-enhanced Interactive Network with Mixed Contrastive Learning for Emotion Recognition in Conversation
    Shen, Xudong
    Huang, Xianying
    Zou, Shihao
    Gan, Xinyi
    NEUROCOMPUTING, 2024, 582
  • [5] Learning Alignment for Multimodal Emotion Recognition from Speech
    Xu, Haiyang
    Zhang, Hui
    Han, Kun
    Wang, Yun
    Peng, Yiping
    Li, Xiangang
    INTERSPEECH 2019, 2019, : 3569 - 3573
  • [6] Multimodal Prompt Transformer with Hybrid Contrastive Learning for Emotion Recognition in Conversation
    Zou, Shihao
    Huang, Xianying
    Shen, Xudong
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 5994 - 6003
  • [7] Temporal Relation Inference Network for Multimodal Speech Emotion Recognition
    Dong, Guan-Nan
    Pun, Chi-Man
    Zhang, Zheng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 6472 - 6485
  • [8] ScSer: Supervised Contrastive Learning for Speech Emotion Recognition using Transformers
    Alaparthi, Varun Sai
    Pasam, Tejeswara Reddy
    Inagandla, Deepak Abhiram
    Prakash, Jay
    Singh, Pramod Kumar
    2022 15TH INTERNATIONAL CONFERENCE ON HUMAN SYSTEM INTERACTION (HSI), 2022,
  • [9] Learning Mutual Correlation in Multimodal Transformer for Speech Emotion Recognition
    Wang, Yuhua
    Shen, Guang
    Xu, Yuezhu
    Li, Jiahang
    Zhao, Zhengdao
    INTERSPEECH 2021, 2021, : 4518 - 4522
  • [10] Combining Multimodal Features within a Fusion Network for Emotion Recognition in the Wild
    Sun, Bo
    Li, Liandong
    Zhou, Guoyan
    Wu, Xuewen
    He, Jun
    Yu, Lejun
    Li, Dongxue
    Wei, Qinglan
    ICMI'15: PROCEEDINGS OF THE 2015 ACM INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2015, : 497 - 502