The MSP-Conversation Corpus

被引:20
|
作者
Martinez-Lucas, Luz [1 ]
Abdelwahab, Mohammed [1 ]
Busso, Carlos [1 ]
机构
[1] Univ Texas Dallas, Dept Elect & Comp Engn, Multimodal Signal Proc MSP Lab, Richardson, TX 75080 USA
来源
关键词
Speech emotion recognition; human-computer interaction; time-continuous emotional attributes; EMOTIONAL SPEECH; CORE AFFECT; THINGS;
D O I
10.21437/Interspeech.2020-2444
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Human-computer interactions can be very effective, especially if computers can automatically recognize the emotional state of the user. A key barrier for effective speech emotion recognition systems is the lack of large corpora annotated with emotional labels that reflect the temporal complexity of expressive behaviors, especially during multiparty interactions. This paper introduces the MSP-Conversation corpus, which contains interactions annotated with time-continuous emotional traces for arousal (calm to active), valence (negative to positive), and dominance (weak to strong). Time-continuous annotations offer the flexibility to explore emotional displays at different temporal resolutions while leveraging contextual information. This is an ongoing effort, where the corpus currently contains more than 15 hours of speech annotated by at least five annotators. The data is sourced from the MSP-Podcast corpus, which contains speech data from online audio-sharing websites annotated with sentence-level emotional scores. This data collection scheme is an easy, affordable, and scalable approach to obtain natural data with diverse emotional content from multiple speakers. This study describes the key features of the corpus. It also compares the time-continuous evaluations from the MSP-Conversation corpus with the sentence-level annotations of the MSP-Podcast corpus for the speech segments that overlap between the two corpora.
引用
收藏
页码:1823 / 1827
页数:5
相关论文
共 50 条
  • [21] A CORPUS OF ENGLISH CONVERSATION - SVARTVIK,J, QUIRK,R
    WESTNEY, P
    IRAL-INTERNATIONAL REVIEW OF APPLIED LINGUISTICS IN LANGUAGE TEACHING, 1983, 21 (04): : 336 - 338
  • [22] MSP-IMPROV: An Acted Corpus of Dyadic Interactions to Study Emotion Perception
    Busso, Carlos
    Parthasarathy, Srinivas
    Burmania, Alec
    AbdelWahab, Mohammed
    Sadoughi, Najmeh
    Provost, Emily Mower
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2017, 8 (01) : 67 - 80
  • [23] Ethical and legal aspects in the construction of an oral corpus of conflictual conversation
    Guerrero, A. N. D. R. E. A. CARCELeN
    CULTURA LENGUAJE Y REPRESENTACION-REVISTA DE ESTUDIOS CULTURALES DE LA UNIVERSITAT JAUME I, 2024, 35 : 37 - 51
  • [24] Conversation in Context. A Corpus-driven Approach.
    Karkkainen, Elise
    FUNCTIONS OF LANGUAGE, 2010, 17 (01) : 113 - 125
  • [25] Japanese conversation corpus for training and evaluation of backchannel prediction model
    Noguchi, Hiroaki
    Katagiri, Yasuhiro
    Den, Yasuharu
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 4429 - 4433
  • [26] SLIPS OF THE TONGUE IN THE LONDON-LUND CORPUS OF SPONTANEOUS CONVERSATION
    GARNHAM, A
    SHILLCOCK, RC
    BROWN, GDA
    MILL, AID
    CUTLER, A
    LINGUISTICS, 1981, 19 (7-8) : 805 - 817
  • [27] MSP-GEO Corpus: A Multimodal Database for Understanding Video-Learning Experience
    Salman, Ali N.
    Wang, Ning
    Martinez-Lucas, Luz
    Vidal, Andrea
    Busso, Carlos
    PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2024, 2024, : 488 - 497
  • [28] Working Together: Contributions of Corpus Analyses and Experimental Psycholinguistics to Understanding Conversation
    Meyer, Antje S.
    Alday, Phillip M.
    Decuyper, Caitlin
    Knudsen, Birgit
    FRONTIERS IN PSYCHOLOGY, 2018, 9
  • [29] The CANDOR corpus: Insights from a large multimodal dataset of naturalistic conversation
    Reece, Andrew
    Cooney, Gus
    Bull, Peter
    Chung, Christine
    Dawson, Bryn
    Fitzpatrick, Casey
    Glazer, Tamara
    Knox, Dean
    Liebscher, Alex
    Marin, Sebastian
    SCIENCE ADVANCES, 2023, 9 (13):
  • [30] From the conversation to the corpus, or how to collect and archive spoken language data
    Gocol, Damian
    Zasko-Zielinska, Monika
    Majewska-Tworek, Anna
    Sleziak, Marta
    Tworek, Artur
    WROCLAWSKI ROCZNIK HISTORII MOWIONEJ, 2024, 14 : 306 - 311