Multimodal Emotion Recognition in Response to Videos

被引:457
|
作者
Soleymani, Mohammad [1 ]
Pantic, Maja [2 ,3 ]
Pun, Thierry [1 ]
机构
[1] Univ Geneva, Dept Comp Sci, Comp Vis & Multimedia Lab, CH-1227 Carouge, GE, Switzerland
[2] Univ London Imperial Coll Sci Technol & Med, Dept Comp, London SW7 2AZ, England
[3] Univ Twente, Fac Elect Engn Math & Comp Sci, NL-7522 NB Enschede, Netherlands
基金
欧洲研究理事会; 瑞士国家科学基金会;
关键词
Emotion recognition; EEG; pupillary reflex; pattern classification; affective computing; PUPIL LIGHT REFLEX; CLASSIFICATION; OSCILLATIONS; SYSTEMS;
D O I
10.1109/T-AFFC.2011.37
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a user-independent emotion recognition method with the goal of recovering affective tags for videos using electroencephalogram (EEG), pupillary response and gaze distance. We first selected 20 video clips with extrinsic emotional content from movies and online resources. Then, EEG responses and eye gaze data were recorded from 24 participants while watching emotional video clips. Ground truth was defined based on the median arousal and valence scores given to clips in a preliminary study using an online questionnaire. Based on the participants' responses, three classes for each dimension were defined. The arousal classes were calm, medium aroused, and activated and the valence classes were unpleasant, neutral, and pleasant. One of the three affective labels of either valence or arousal was determined by classification of bodily responses. A one-participant-out cross validation was employed to investigate the classification performance in a user-independent approach. The best classification accuracies of 68.5 percent for three labels of valence and 76.4 percent for three labels of arousal were obtained using a modality fusion strategy and a support vector machine. The results over a population of 24 participants demonstrate that user-independent emotion recognition can outperform individual self-reports for arousal assessments and do not underperform for valence assessments.
引用
收藏
页码:211 / 223
页数:13
相关论文
共 50 条
  • [1] Multimodal Emotion Recognition in Response to Videos (Extended Abstract)
    Soleymani, Mohammad
    Pantic, Maja
    Pun, Thierry
    2015 INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2015, : 491 - 497
  • [2] Emotion Recognition in Videos via Fusing Multimodal Features
    Chen, Shizhe
    Dian, Yujie
    Li, Xinrui
    Lin, Xiaozhu
    Jin, Qin
    Liu, Haibo
    Lu, Li
    PATTERN RECOGNITION (CCPR 2016), PT II, 2016, 663 : 632 - 644
  • [3] Exploring the Contextual Factors Affecting Multimodal Emotion Recognition in Videos
    Bhattacharya, Prasanta
    Gupta, Raj Kumar
    Yang, Yinping
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (02) : 1547 - 1557
  • [4] Emotion Recognition from Videos Using Multimodal Large Language Models
    Vaiani, Lorenzo
    Cagliero, Luca
    Garza, Paolo
    FUTURE INTERNET, 2024, 16 (07)
  • [5] MEmoR: A Dataset for Multimodal Emotion Reasoning in Videos
    Shen, Guangyao
    Wang, Xin
    Duan, Xuguang
    Li, Hongzhi
    Zhu, Wenwu
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 493 - 502
  • [6] Emotion recognition and reaction prediction in videos
    Ronghe, Nimish
    Nakashe, Sayali
    Pawar, Ashish
    Bobde, Sarika
    2017 THIRD IEEE INTERNATIONAL CONFERENCE ON RESEARCH IN COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (ICRCICN), 2017, : 26 - 32
  • [7] An Emotion-Space Model of Multimodal Emotion Recognition
    Choe, Kyung-Il
    ADVANCED SCIENCE LETTERS, 2018, 24 (01) : 699 - 702
  • [8] Tandem Modelling Based Emotion Recognition in Videos
    Kasraoui, Salma
    Lachiri, Zied
    Madani, Kurosh
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2019, PT II, 2019, 11507 : 325 - 336
  • [9] Twosome Modelling based Emotion Recognition in Videos
    Kasraoui, Salma
    Lachiri, Zied
    Madani, Kurosh
    PROCEEDINGS OF THE 2019 10TH IEEE INTERNATIONAL CONFERENCE ON INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS - TECHNOLOGY AND APPLICATIONS (IDAACS), VOL. 1, 2019, : 290 - 293
  • [10] Multimodal emotion recognition and expressivity analysis
    Kollias, S
    Karpouzis, K
    2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2, 2005, : 779 - 783