Speech Emotion Recognition Cross Language Families: Mandarin vs. Western Languages

被引:0
|
作者
Xiao, Zhongzhe [1 ]
Wu, Di [1 ]
Zhang, Xiaojun [1 ]
Tao, Zhi [1 ]
机构
[1] Soochow Univ, Coll Phys Optoelect & Energy, Suzhou, Peoples R China
关键词
emotional speech; cross-language; Mandarin; recognition;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An investigation on classification of emotional speech cross different language families is proposed in this paper. Datasets on three languages, CDESD in Mandarin, Emo-DB in German, and DES in Danish are analyzed. With 2-D classifications on arousal-appraisal space, better recognition performances are observed in arousal dimension than in appraisal dimension. The classification rates in cross language family test between CDESD and Emo-DB or DES are far higher than chance level, shows that there exist universal mechanisms in human voice emotion independent on languages. Results in test within the same language family between Emo-DB and DES are even better than in cross language family test with CDESD in Mandarin, shows the language and culture also influence the way of expression in speech. The best classification rate in the cross language family test is achieved on male speech samples as 71.62%, when CDESD dataset is used as training set and Emo-DB as testing set.
引用
收藏
页码:253 / 257
页数:5
相关论文
共 50 条
  • [31] Syllable language models for Mandarin speech recognition: Exploiting character language models
    Liu, X. (xl207@eng.cam.ac.uk), 1600, Acoustical Society of America (133):
  • [32] Say Cheese vs. Smile: Reducing Speech-Related Variability for Facial Emotion Recognition
    Kim, Yelin
    Provost, Emily Mower
    PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 27 - 36
  • [33] Model Comparison in Speech Emotion Recognition for Indonesian Language
    Rumagit, Reinert Yosua
    Alexander, Glenn
    Saputra, Irfan Fahmi
    5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE 2020, 2021, 179 : 789 - 797
  • [34] Enhancing multilingual recognition of emotion in speech by language identification
    Sagha, Hesam
    Matejka, Pavel
    Gavryukova, Maryna
    Povolny, Filip
    Marchi, Erik
    Schuller, Bjoern
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2949 - 2953
  • [35] Hybrid Dataset for Speech Emotion Recognition in Russian Language
    Kondratenko, Vladimir
    Sokolov, Artem
    Karpov, Nikolay
    Kutuzov, Oleg
    Savushkin, Nikita
    Minkin, Fyodor
    INTERSPEECH 2023, 2023, : 4548 - 4552
  • [36] Toward Language-Agnostic Speech Emotion Recognition
    Ntalampiras, Stavros
    JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2020, 68 (1-2): : 7 - 13
  • [37] BanglaSER: A speech emotion recognition dataset for the Bangla language
    Das, Rakesh Kumar
    Islam, Nahidul
    Ahmed, Md. Rayhan
    Islam, Salekul
    Shatabda, Swakkhar
    Islam, A. K. M. Muzahidul
    DATA IN BRIEF, 2022, 42
  • [38] Searle vs. Searle on language, speech, and thought
    Goldberg, Sandy
    Yang, Guiming
    PRAGMATICS & COGNITION, 2014, 22 (03) : 352 - 372
  • [39] Writing vs. Speech in Foreign Language Teaching
    卢允中
    外国语(上海外国语学院学报), 1985, (03)
  • [40] Segment-based emotion recognition from continuous Mandarin Chinese speech
    Yeh, Jun-Heng
    Pao, Tsang-Long
    Lin, Ching-Yi
    Tsai, Yao-Wei
    Chen, Yu-Te
    COMPUTERS IN HUMAN BEHAVIOR, 2011, 27 (05) : 1545 - 1552