Combining audio and video by dominance in bimodal emotion recognition

被引:0
|
作者
Huang, Lixing [1 ]
Xin, Le [1 ]
Zhao, Liyue [1 ]
Tao, Jianhua [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, NLPR, Beijing 100864, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel bimodal emotion recognition approach by using the boosting-based framework, in which we can automatically determine the adaptive weights for audio and visual features. In this way, we balance the dominances of audio and visual features dynamically in feature-level to obtain better performance.
引用
收藏
页码:729 / 730
页数:2
相关论文
共 50 条
  • [1] Audio and Video Bimodal Emotion Recognition in Social Networks Based on Improved AlexNet Network and Attention Mechanism
    Liu, Min
    Tang, Jun
    [J]. JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2021, 17 (04): : 754 - 771
  • [2] Ubiquitous Emotion Recognition Using Audio and Video Data
    Jannat, Rahatul
    Tynes, Iyonna
    LaLime, Lott
    Adorno, Juan
    Canavan, Shaun
    [J]. PROCEEDINGS OF THE 2018 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2018 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS (UBICOMP/ISWC'18 ADJUNCT), 2018, : 956 - 959
  • [3] Emotion Recognition Using Fusion of Audio and Video Features
    Ortega, Juan D. S.
    Cardinal, Patrick
    Koerich, Alessandro L.
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 3847 - 3852
  • [4] Audio-Visual Emotion Recognition in Video Clips
    Noroozi, Fatemeh
    Marjanovic, Marina
    Njegus, Angelina
    Escalera, Sergio
    Anbarjafari, Gholamreza
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2019, 10 (01) : 60 - 75
  • [5] Deep neural networks for emotion recognition combining audio and transcripts
    Cho, Jaejin
    Pappagari, Raghavendra
    Kulkarni, Purva
    Villalba, Jesus
    Carmiel, Yishay
    Dehak, Najim
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 247 - 251
  • [6] Bimodal Emotion Recognition
    Paleari, Marco
    Chellali, Ryad
    Huet, Benoit
    [J]. SOCIAL ROBOTICS, ICSR 2010, 2010, 6414 : 305 - 314
  • [7] Exploring Emotion Features and Fusion Strategies for Audio-Video Emotion Recognition
    Zhou, Hengshun
    Meng, Debin
    Zhang, Yuanyuan
    Peng, Xiaojiang
    Du, Jun
    Wang, Kai
    Qiao, Yu
    [J]. ICMI'19: PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, 2019, : 562 - 566
  • [8] Multi-Modal Emotion Recognition Fusing Video and Audio
    Xu, Chao
    Du, Pufeng
    Feng, Zhiyong
    Meng, Zhaopeng
    Cao, Tianyi
    Dong, Caichao
    [J]. APPLIED MATHEMATICS & INFORMATION SCIENCES, 2013, 7 (02): : 455 - 462
  • [9] Emotion Recognition of College Students Based on Audio and Video Image
    Zhu, Chenjie
    Ding, Ting
    Min, Xue
    [J]. TRAITEMENT DU SIGNAL, 2022, 39 (05) : 1475 - 1481
  • [10] Audio-Video Fusion with Double Attention for Multimodal Emotion Recognition
    Mocanu, Bogdan
    Tapu, Ruxandra
    [J]. 2022 IEEE 14TH IMAGE, VIDEO, AND MULTIDIMENSIONAL SIGNAL PROCESSING WORKSHOP (IVMSP), 2022,