MSP-IMPROV: An Acted Corpus of Dyadic Interactions to Study Emotion Perception

被引:228
|
作者
Busso, Carlos [1 ]
Parthasarathy, Srinivas [1 ]
Burmania, Alec [1 ]
AbdelWahab, Mohammed [1 ]
Sadoughi, Najmeh [1 ]
Provost, Emily Mower [2 ]
机构
[1] Univ Texas Dallas, Erik Jonsson Sch Engn & Comp Sci, Richardson, TX 75080 USA
[2] Univ Michigan, Comp Sci & Engn Dept, Ann Arbor, MI 48109 USA
基金
美国国家科学基金会;
关键词
Emotion elicitation; audiovisual emotional dataset; emotional evaluation; emotion recognition; SPEECH; CLASSIFICATION;
D O I
10.1109/TAFFC.2016.2515617
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present the MSP-IMPROV corpus, a multimodal emotional database, where the goal is to have control over lexical content and emotion while also promoting naturalness in the recordings. Studies on emotion perception often require stimuli with fixed lexical content, but that convey different emotions. These stimuli can also serve as an instrument to understand how emotion modulates speech at the phoneme level, in a manner that controls for coarticulation. Such audiovisual data are not easily available from natural recordings. A common solution is to record actors reading sentences that portray different emotions, which may not produce natural behaviors. We propose an alternative approach in which we define hypothetical scenarios for each sentence that are carefully designed to elicit a particular emotion. Two actors improvise these emotion-specific situations, leading them to utter contextualized, non-read renditions of sentences that have fixed lexical content and convey different emotions. We describe the context in which this corpus was recorded, the key features of the corpus, the areas in which this corpus can be useful, and the emotional content of the recordings. The paper also provides the performance for speech and facial emotion classifiers. The analysis brings novel classification evaluations where we study the performance in terms of inter-evaluator agreement and naturalness perception, leveraging the large size of the audiovisual database.
引用
收藏
页码:67 / 80
页数:14
相关论文
共 3 条
  • [1] Seeing mixed emotions: Alexithymia, emotion perception bias, and quality in dyadic interactions
    Kafetsios, Konstantinos
    Hess, Ursula
    PERSONALITY AND INDIVIDUAL DIFFERENCES, 2019, 137 : 80 - 85
  • [2] The Demo / Kemo corpus A principled approach to the study of cross-cultural differences in the vocal expression and perception of emotion
    Goudbeek, Martijn
    Broersma, Mirjam
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010,
  • [3] Dynamic Interactions between Emotion Perception and Action Preparation for Reacting to Social Threat: A Combined cTBS-fMRI Study
    Engelen, Tahnee
    Zhan, Minye
    Sack, Alexander T.
    de Gelder, Beatrice
    ENEURO, 2018, 5 (03)