Enhancing Cross-Language Multimodal Emotion Recognition With Dual Attention Transformers

被引:0
|
作者
Muhammad Zaidi, Syed Aun [1 ]
Latif, Siddique [2 ]
Qadir, Junaid [3 ]
机构
[1] Information Technology University (ITU), Lahore,54700, Pakistan
[2] Queensland University of Technology (QUT), Brisbane,QLD,4000, Australia
[3] Computer Science and Engineering Department, College of Engineering, Qatar University, Doha, Qatar
关键词
D O I
10.1109/OJCS.2024.3486904
中图分类号
学科分类号
摘要
65
引用
收藏
页码:684 / 693
相关论文
共 50 条
  • [41] Speech Emotion Recognition Using Dual-Stream Representation and Cross-Attention Fusion
    Yu, Shaode
    Meng, Jiajian
    Fan, Wenqing
    Chen, Ye
    Zhu, Bing
    Yu, Hang
    Xie, Yaoqin
    Sun, Qiuirui
    ELECTRONICS, 2024, 13 (11)
  • [42] Attentive to Individual: A Multimodal Emotion Recognition Network with Personalized Attention Profile
    Li, Jeng-Lin
    Lee, Chi-Chun
    INTERSPEECH 2019, 2019, : 211 - 215
  • [43] Enhancing Cross-Language Question Answering by combining multiple question translations
    Aceves-Perez, Rita M.
    Montes-y-Gomez, Manuel
    Villasenor-Pineda, Luis
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2007, 4394 : 485 - 493
  • [44] Audio-Video Fusion with Double Attention for Multimodal Emotion Recognition
    Mocanu, Bogdan
    Tapu, Ruxandra
    2022 IEEE 14TH IMAGE, VIDEO, AND MULTIDIMENSIONAL SIGNAL PROCESSING WORKSHOP (IVMSP), 2022,
  • [45] Hierarchical Attention Approach in Multimodal Emotion Recognition for Human Robot Interaction
    Abdullah, Muhammad
    Ahmad, Mobeen
    Han, Dongil
    2021 36TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC), 2021,
  • [46] Multimodal Emotion Recognition using Cross-Modal Attention and 1D Convolutional Neural Networks
    Krishna, D. N.
    Patil, Ankita
    INTERSPEECH 2020, 2020, : 4243 - 4247
  • [47] Multimodal emotion recognition using cross modal audio-video fusion with attention and deep metric learning
    Mocanu, Bogdan
    Tapu, Ruxandra
    Zaharia, Titus
    IMAGE AND VISION COMPUTING, 2023, 133
  • [48] Cross-language multimodal scene semantic guidance and leap sampling for video captioning
    Sun, Bo
    Wu, Yong
    Zhao, Yijia
    Hao, Zhuo
    Yu, Lejun
    He, Jun
    VISUAL COMPUTER, 2023, 39 (01): : 9 - 25
  • [49] Cross-language multimodal scene semantic guidance and leap sampling for video captioning
    Bo Sun
    Yong Wu
    Yijia Zhao
    Zhuo Hao
    Lejun Yu
    Jun He
    The Visual Computer, 2023, 39 : 9 - 25
  • [50] CROSS-CULTURE MULTIMODAL EMOTION RECOGNITION WITH ADVERSARIAL LEARNING
    Liang, Jingjun
    Chen, Shizhe
    Zhao, Jinming
    Jin, Qin
    Liu, Haibo
    Lu, Li
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 4000 - 4004