Emotion recognition using multimodal deep learning in multiple psychophysiological signals and video

被引:38
|
作者
Wang, Zhongmin [1 ,2 ]
Zhou, Xiaoxiao [1 ]
Wang, Wenlang [1 ,2 ]
Liang, Chen [1 ,2 ]
机构
[1] Xian Univ Posts & Telecommun, Sch Comp Sci & Technol, 618 West Changan St, Xian 710121, Shaanxi, Peoples R China
[2] Xian Univ Posts & Telecommun, Shaanxi Key Lab Network Data Anal & Intelligent P, 618 West Changan St, Xian 710121, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Emotion recognition; Psychophysiological signals; Video streams; Multimodal features; Deep belief networks;
D O I
10.1007/s13042-019-01056-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emotion recognition has attracted great interest. Numerous emotion recognition approaches have been proposed, most of which focus on visual, acoustic or psychophysiological information individually. Although more recent research has considered multimodal approaches, individual modalities are often combined only by simple fusion or are directly fused with deep learning networks at the feature level. In this paper, we propose an approach to training several specialist networks that employs deep learning techniques to fuse the features of individual modalities. This approach includes a multimodal deep belief network (MDBN), which optimizes and fuses unified psychophysiological features derived from the features of multiple psychophysiological signals, a bimodal deep belief network (BDBN) that focuses on representative visual features among the features of a video stream, and another BDBN that focuses on the high multimodal features in the unified features obtained from two modalities. Experiments are conducted on the BioVid Emo DB database and 80.89% accuracy is achieved, which outperforms the state-of-the-art approaches. The results demonstrate that the proposed approach can solve the problems of feature redundancy and lack of key features caused by multimodal fusion.
引用
收藏
页码:923 / 934
页数:12
相关论文
共 50 条
  • [1] Emotion recognition using multimodal deep learning in multiple psychophysiological signals and video
    Zhongmin Wang
    Xiaoxiao Zhou
    Wenlang Wang
    Chen Liang
    [J]. International Journal of Machine Learning and Cybernetics, 2020, 11 : 923 - 934
  • [2] Deep Representation Learning for Multimodal Emotion Recognition Using Physiological Signals
    Zubair, Muhammad
    Woo, Sungpil
    Lim, Sunhwan
    Yoon, Changwoo
    [J]. IEEE ACCESS, 2024, 12 : 106605 - 106617
  • [3] Emotion Recognition Using Multimodal Deep Learning
    Liu, Wei
    Zheng, Wei-Long
    Lu, Bao-Liang
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2016, PT II, 2016, 9948 : 521 - 529
  • [4] EmoNets: Multimodal deep learning approaches for emotion recognition in video
    Samira Ebrahimi Kahou
    Xavier Bouthillier
    Pascal Lamblin
    Caglar Gulcehre
    Vincent Michalski
    Kishore Konda
    Sébastien Jean
    Pierre Froumenty
    Yann Dauphin
    Nicolas Boulanger-Lewandowski
    Raul Chandias Ferrari
    Mehdi Mirza
    David Warde-Farley
    Aaron Courville
    Pascal Vincent
    Roland Memisevic
    Christopher Pal
    Yoshua Bengio
    [J]. Journal on Multimodal User Interfaces, 2016, 10 : 99 - 111
  • [5] EmoNets: Multimodal deep learning approaches for emotion recognition in video
    Kahou, Samira Ebrahimi
    Bouthillier, Xavier
    Lamblin, Pascal
    Gulcehre, Caglar
    Michalski, Vincent
    Konda, Kishore
    Jean, Sebastien
    Froumenty, Pierre
    Dauphin, Yann
    Boulanger-Lewandowski, Nicolas
    Ferrari, Raul Chandias
    Mirza, Mehdi
    Warde-Farley, David
    Courville, Aaron
    Vincent, Pascal
    Memisevic, Roland
    Pal, Christopher
    Bengio, Yoshua
    [J]. JOURNAL ON MULTIMODAL USER INTERFACES, 2016, 10 (02) : 99 - 111
  • [6] Audio-Video Based Multimodal Emotion Recognition Using SVMs and Deep Learning
    Sun, Bo
    Xu, Qihua
    He, Jun
    Yu, Lejun
    Li, Liandong
    Wei, Qinglan
    [J]. PATTERN RECOGNITION (CCPR 2016), PT II, 2016, 663 : 621 - 631
  • [7] Multimodal Arabic emotion recognition using deep learning
    Al Roken, Noora
    Barlas, Gerassimos
    [J]. SPEECH COMMUNICATION, 2023, 155
  • [8] Multimodal Emotion Recognition using Deep Learning Architectures
    Ranganathan, Hiranmayi
    Chakraborty, Shayok
    Panchanathan, Sethuraman
    [J]. 2016 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2016), 2016,
  • [9] Automatic Emotion Recognition Using Temporal Multimodal Deep Learning
    Nakisa, Bahareh
    Rastgoo, Mohammad Naim
    Rakotonirainy, Andry
    Maire, Frederic
    Chandran, Vinod
    [J]. IEEE ACCESS, 2020, 8 : 225463 - 225474
  • [10] Multimodal machine learning approach for emotion recognition using physiological signals
    Ramadan, Mohamad A.
    Salem, Nancy M.
    Mahmoud, Lamees N.
    Sadek, Ibrahim
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 96