Toward detecting emotions in spoken dialogs

被引:541
|
作者
Lee, CM [1 ]
Narayanan, SS
机构
[1] Univ So Calif, Dept Elect Engn, Los Angeles, CA 90089 USA
[2] Univ So Calif, IMSC, Los Angeles, CA 90089 USA
来源
基金
美国国家科学基金会;
关键词
acoustic correlates; dialog systems; emotion recognition; emotional salience; feature selection; information fusion; principal component analysis; spoken language processing;
D O I
10.1109/TSA.2004.838534
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The importance of automatically recognizing emotions from human speech has grown with the increasing role of spoken language interfaces in human-computer interaction applications. This paper explores the detection of domain-specific emotions using language and discourse information in conjunction with acoustic correlates of emotion in speech signals. The specific focus is on a case study of detecting negative and non-negative emotions using spoken language data obtained from a call center application. Most previous studies in emotion recognition have used only the acoustic information contained in speech. In this paper, a combination of three sources of information-acoustic, lexical, and discourse-is used for emotion recognition. To capture emotion information at the language level, an information-theoretic notion of emotional salience is introduced. Optimization of the acoustic correlates of emotion with respect to classification error was accomplished by investigating different feature sets obtained from feature selection, followed by principal component analysis. Experimental results on our call center data show that the best results are obtained when acoustic and language information are combined. Results show that combining all the information, rather than using only acoustic information, improves emotion classification by 40.7% for males and 36.4% for females (linear discriminant classifier used for acoustic information).
引用
收藏
页码:293 / 303
页数:11
相关论文
共 50 条
  • [1] Classifying emotions in human-machine spoken dialogs
    Lee, CM
    Narayanan, SS
    Pieraccini, R
    IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, : 737 - 740
  • [2] Spoken Dialogs With a Virtual Science Tutor
    Ward, Wayne
    Bolanos, Daniel
    Cole, Ronald
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 758 - 761
  • [3] Negative Emotion Recognition in Spoken Dialogs
    Zhang, Xiaodong
    Wang, Houfeng
    Li, Li
    Zhao, Maoxiang
    Li, Quanzhong
    CHINESE COMPUTATIONAL LINGUISTICS AND NATURAL LANGUAGE PROCESSING BASED ON NATURALLY ANNOTATED BIG DATA (CCL 2015), 2015, 9427 : 103 - 115
  • [4] Spoken Dialogs with Children for Science Learning and Literacy
    Cole, Ron
    Ward, Wayne
    Bolanos, Daniel
    Buchenroth-Martin, Cindy
    Borts, Eric
    TEXT, SPEECH, AND DIALOGUE, TSD 2013, 2013, 8082 : 34 - 34
  • [5] Annotation of Discourse Relations for Conversational Spoken Dialogs
    Tonelli, Sara
    Riccardi, Giuseppe
    Prasad, Rashmi
    Joshi, Aravind
    LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : 2084 - 2090
  • [6] Feature extraction of spoken dialogs for emotion detection
    Ciota, Zygmunt
    2006 8TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-4, 2006, : 700 - 703
  • [7] Emotion detection in task-oriented spoken dialogs
    Devillers, L
    Lamel, L
    Vasilescu, I
    2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS, 2003, : 549 - 552
  • [8] Design and Data Collection for Spoken Polish Dialogs Database
    Marasek, Krzysztof
    Gubrynowicz, Ryszard
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 185 - 189
  • [9] Design and data collection for spoken Polish dialogs database
    Department of Multimedia, Polish-Japanese Institute of Information Technology, Koszykowa st., 86, Warsaw
    02-008, Poland
    Proc. Int. Conf. Lang. Resourc. Eval., LREC, (185-189):
  • [10] Dialogs taking into account experience, emotions and personality
    Bosser, Anne-Gwenn
    Levieux, Guillaume
    Sehaba, Karim
    Buendia, Axel
    Corruble, Vincent
    de Fondaumiere, Guillaume
    Gal, Viviane
    Natkin, Stephane
    Sabouret, Nicolas
    ENTERTAINMENT COMPUTING - ICEC 2007, 2007, 4740 : 356 - +