Toward detecting emotions in spoken dialogs

被引:541
|
作者
Lee, CM [1 ]
Narayanan, SS
机构
[1] Univ So Calif, Dept Elect Engn, Los Angeles, CA 90089 USA
[2] Univ So Calif, IMSC, Los Angeles, CA 90089 USA
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 2005年 / 13卷 / 02期
基金
美国国家科学基金会;
关键词
acoustic correlates; dialog systems; emotion recognition; emotional salience; feature selection; information fusion; principal component analysis; spoken language processing;
D O I
10.1109/TSA.2004.838534
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The importance of automatically recognizing emotions from human speech has grown with the increasing role of spoken language interfaces in human-computer interaction applications. This paper explores the detection of domain-specific emotions using language and discourse information in conjunction with acoustic correlates of emotion in speech signals. The specific focus is on a case study of detecting negative and non-negative emotions using spoken language data obtained from a call center application. Most previous studies in emotion recognition have used only the acoustic information contained in speech. In this paper, a combination of three sources of information-acoustic, lexical, and discourse-is used for emotion recognition. To capture emotion information at the language level, an information-theoretic notion of emotional salience is introduced. Optimization of the acoustic correlates of emotion with respect to classification error was accomplished by investigating different feature sets obtained from feature selection, followed by principal component analysis. Experimental results on our call center data show that the best results are obtained when acoustic and language information are combined. Results show that combining all the information, rather than using only acoustic information, improves emotion classification by 40.7% for males and 36.4% for females (linear discriminant classifier used for acoustic information).
引用
收藏
页码:293 / 303
页数:11
相关论文
共 50 条
  • [21] NEGATIVE EMOTIONS DETECTION AS AN INDICATOR OF DIALOGS QUALITY IN CALL CENTERS
    Vaudable, Christophe
    Devillers, Laurence
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 5109 - 5112
  • [22] A Slot Is Not Built in One Utterance: Spoken Language Dialogs with Sub-Slots
    Zhang, Sai
    Hu, Yuwei
    Wu, Yuchuan
    Wu, Jiaman
    Li, Yongbin
    Sun, Jian
    Yuan, Caixia
    Wang, Xiaojie
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 309 - 321
  • [23] AN INTERACTION-AWARE ATTENTION NETWORK FOR SPEECH EMOTION RECOGNITION IN SPOKEN DIALOGS
    Yeh, Sung-Lin
    Lin, Yun-Shao
    Lee, Chi-Chun
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6685 - 6689
  • [24] DETECTING LEADERSHIP AND COHESION IN SPOKEN INTERACTIONS
    Wang, Wen
    Precoda, Kristin
    Hadsell, Raia
    Kira, Zsolt
    Richey, Colleen
    Jiva, Gabriel
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 5105 - 5108
  • [25] Detecting Sensitive Content in Spoken Language
    Tripathi, Rahul
    Dhamodharaswamy, Balaji
    Jagannathan, Srinivasan
    Nandi, Abhishek
    2019 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA 2019), 2019, : 374 - 381
  • [26] Detecting Emotions behind the Screen
    Alkaabi, Najla
    Zaki, Nazar
    Ismail, Heba
    Khan, Manzoor
    AI, 2022, 3 (04) : 948 - 960
  • [27] Detecting Emotions in Comments on Forums
    Gifu, D.
    Cioca, M.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2014, 9 (06) : 694 - 702
  • [28] Attitudes Toward Emotions
    Harmon-Jones, Eddie
    Harmon-Jones, Cindy
    Amodio, David M.
    Gable, Philip A.
    JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 2011, 101 (06) : 1332 - 1350
  • [29] Toward a rationality of emotions
    Radden, J
    MIND, 1999, 108 (429) : 203 - 206
  • [30] An emotion space model for recognition of emotions in spoken Chinese
    Jin, XC
    Wang, ZF
    AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS, 2005, 3784 : 397 - 402