Multimodal Conversation Emotion Recognition Combining Multi- Level Attention and Multi-Stream Graph Neural Networks

被引:0
|
作者
Feng, Hongqi [1 ]
Guo, Yongxiang [1 ]
Zhang, Denghui [2 ]
Yang, Xinli [2 ]
机构
[1] School of Computer Science and Artificial Intelligence, Changzhou University, Jiangsu, Changzhou,213100, China
[2] College of Information Technology, Zhejiang Shuren University, Hangzhou,310000, China
关键词
Graph neural networks;
D O I
10.3778/j.issn.1002-8331.2307-0196
中图分类号
学科分类号
摘要
引用
收藏
页码:154 / 163
相关论文
共 50 条
  • [1] Multivariate, Multi-frequency and Multimodal: Rethinking Graph Neural Networks for Emotion Recognition in Conversation
    Chen, Feiyu
    Shao, Jie
    Zhu, Shuyuan
    Shen, Heng Tao
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10761 - 10770
  • [2] Multi-Stream Convolution-Recurrent Neural Networks Based on Attention Mechanism Fusion for Speech Emotion Recognition
    Tao, Huawei
    Geng, Lei
    Shan, Shuai
    Mai, Jingchao
    Fu, Hongliang
    ENTROPY, 2022, 24 (08)
  • [3] MLGAT: multi-layer graph attention networks for multimodal emotion recognition in conversations
    Wu, Jun
    Wu, Junwei
    Zheng, Yu
    Zhan, Pengfei
    Han, Min
    Zuo, Gan
    Yang, Li
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2024, : 375 - 394
  • [4] Multimodal Gesture Recognition Using Multi-stream Recurrent Neural Network
    Nishida, Noriki
    Nakayama, Hideki
    IMAGE AND VIDEO TECHNOLOGY, PSIVT 2015, 2016, 9431 : 682 - 694
  • [5] Multi-stream graph attention network for recommendation with knowledge graph
    Hu, Zhifei
    Xia, Feng
    JOURNAL OF WEB SEMANTICS, 2024, 82
  • [6] Fusing multi-stream deep neural networks for facial expression recognition
    Fatima Zahra Salmam
    Abdellah Madani
    Mohamed Kissi
    Signal, Image and Video Processing, 2019, 13 : 609 - 616
  • [7] Fusing multi-stream deep neural networks for facial expression recognition
    Zahra Salmam, Fatima
    Madani, Abdellah
    Kissi, Mohamed
    SIGNAL IMAGE AND VIDEO PROCESSING, 2019, 13 (03) : 609 - 616
  • [8] Multi-stream Attention-based BLSTM with Feature Segmentation for Speech Emotion Recognition
    Chiba, Yuya
    Nose, Takashi
    Ito, Akinori
    INTERSPEECH 2020, 2020, : 3301 - 3305
  • [9] FrameERC: Framelet Transform Based Multimodal Graph Neural Networks for Emotion Recognition in Conversation
    Li, Ming
    Shi, Jiandong
    Bai, Lu
    Huang, Changqin
    Jiang, Yunliang
    Lu, Ke
    Wang, Shijin
    Hancock, Edwin R.
    PATTERN RECOGNITION, 2025, 161
  • [10] Multimodal Egocentric Activity Recognition Using Multi-stream CNN
    Imran, Javed
    Raman, Balasubramanian
    ELEVENTH INDIAN CONFERENCE ON COMPUTER VISION, GRAPHICS AND IMAGE PROCESSING (ICVGIP 2018), 2018,