Speech Emotion Recognition Cross Language Families: Mandarin vs. Western Languages

被引:0
|
作者
Xiao, Zhongzhe [1 ]
Wu, Di [1 ]
Zhang, Xiaojun [1 ]
Tao, Zhi [1 ]
机构
[1] Soochow Univ, Coll Phys Optoelect & Energy, Suzhou, Peoples R China
关键词
emotional speech; cross-language; Mandarin; recognition;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
An investigation on classification of emotional speech cross different language families is proposed in this paper. Datasets on three languages, CDESD in Mandarin, Emo-DB in German, and DES in Danish are analyzed. With 2-D classifications on arousal-appraisal space, better recognition performances are observed in arousal dimension than in appraisal dimension. The classification rates in cross language family test between CDESD and Emo-DB or DES are far higher than chance level, shows that there exist universal mechanisms in human voice emotion independent on languages. Results in test within the same language family between Emo-DB and DES are even better than in cross language family test with CDESD in Mandarin, shows the language and culture also influence the way of expression in speech. The best classification rate in the cross language family test is achieved on male speech samples as 71.62%, when CDESD dataset is used as training set and Emo-DB as testing set.
引用
收藏
页码:253 / 257
页数:5
相关论文
共 50 条
  • [21] Speech-based Emotion Recognition and Speaker Identification: Static vs. Dynamic Mode of Speech Representation
    Sidorov, Maxim
    Minker, Wolfgang
    Semenkin, Eugene S.
    JOURNAL OF SIBERIAN FEDERAL UNIVERSITY-MATHEMATICS & PHYSICS, 2016, 9 (04): : 518 - 523
  • [22] Noisy Speech Emotion Recognition in Romanian Language
    Feraru, S. M.
    Zbancioc, M. D.
    2019 INTERNATIONAL SYMPOSIUM ON SIGNALS, CIRCUITS AND SYSTEMS (ISSCS 2019), 2019,
  • [23] Integrating Language and Emotion Features for Multilingual Speech Emotion Recognition
    Heracleous, Panikos
    Mohammad, Yasser
    Yoneyama, Akio
    HUMAN-COMPUTER INTERACTION. MULTIMODAL AND NATURAL INTERACTION, HCI 2020, PT II, 2020, 12182 : 187 - 196
  • [24] CROSS-LINGUAL SPEECH RECOGNITION BETWEEN LANGUAGES FROM THE SAME LANGUAGE FAMILY
    Zgank, Andrej
    PROCEEDINGS OF THE ROMANIAN ACADEMY SERIES A-MATHEMATICS PHYSICS TECHNICAL SCIENCES INFORMATION SCIENCE, 2019, 20 (02): : 184 - 191
  • [25] MANDARIN AUDIO-VISUAL SPEECH RECOGNITION WITH EFFECTS TO THE NOISE AND EMOTION
    Pao, Tsang-Long
    Liao, Wen-Yuan
    Chen, Yu-Te
    Wu, Tsan-Nung
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2010, 6 (02): : 711 - 723
  • [26] Comparison of several classifiers for emotion recognition from noisy mandarin speech
    Pao, Tsang-Long
    Liao, Wen-Yuan
    Chen, Yu-Te
    Yeh, Jun-Heng
    Cheng, Yun-Maw
    Chien, Charles S.
    2007 THIRD INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING, VOL 1, PROCEEDINGS, 2007, : 23 - +
  • [27] Emotion Recognition from Noisy Mandarin Speech Preprocessed by Compressed Sensing
    Jiang, Xiaoqing
    He, Dapeng
    Yang, Xinghai
    Wang, Lingyin
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2017, PT II, 2017, 10362 : 626 - 636
  • [28] Mandarin speech emotion recognition based on high dimensional geometry theory
    Cao Wenming
    He Tiancheng
    CHINESE JOURNAL OF ELECTRONICS, 2006, 15 (4A): : 818 - 821
  • [29] Gender-Aware Speech Emotion Recognition in Multiple Languages
    Nicolini, Marco
    Ntalampiras, Stavros
    PATTERN RECOGNITION APPLICATIONS AND METHODS, ICPRAM 2023, 2024, 14547 : 111 - 123
  • [30] Syllable language models for Mandarin speech recognition: Exploiting character language models
    Liu, Xunying
    Hieronymus, James L.
    Gales, Mark J. F.
    Woodland, Philip C.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 133 (01): : 519 - 528