Speech Emotion Recognition Cross Language Families: Mandarin vs. Western Languages

被引：0

作者：

Xiao, Zhongzhe ^{[1
]}

Wu, Di ^{[1
]}

Zhang, Xiaojun ^{[1
]}

Tao, Zhi ^{[1
]}

机构：

[1] Soochow Univ, Coll Phys Optoelect & Energy, Suzhou, Peoples R China

来源：

PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), VOL 1 | 2016年

关键词：

emotional speech; cross-language; Mandarin; recognition;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

An investigation on classification of emotional speech cross different language families is proposed in this paper. Datasets on three languages, CDESD in Mandarin, Emo-DB in German, and DES in Danish are analyzed. With 2-D classifications on arousal-appraisal space, better recognition performances are observed in arousal dimension than in appraisal dimension. The classification rates in cross language family test between CDESD and Emo-DB or DES are far higher than chance level, shows that there exist universal mechanisms in human voice emotion independent on languages. Results in test within the same language family between Emo-DB and DES are even better than in cross language family test with CDESD in Mandarin, shows the language and culture also influence the way of expression in speech. The best classification rate in the cross language family test is achieved on male speech samples as 71.62%, when CDESD dataset is used as training set and Emo-DB as testing set.

引用

页码：253 / 257

页数：5

共 50 条

[31] Syllable language models for Mandarin speech recognition: Exploiting character language models
Liu, X. (xl207@eng.cam.ac.uk), 1600, Acoustical Society of America (133):
[32] Say Cheese vs. Smile: Reducing Speech-Related Variability for Facial Emotion Recognition
Kim, Yelin
Provost, Emily Mower
PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 27 - 36
[33] Model Comparison in Speech Emotion Recognition for Indonesian Language
Rumagit, Reinert Yosua
Alexander, Glenn
Saputra, Irfan Fahmi
5TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE 2020, 2021, 179 : 789 - 797
[34] Enhancing multilingual recognition of emotion in speech by language identification
Sagha, Hesam
Matejka, Pavel
Gavryukova, Maryna
Povolny, Filip
Marchi, Erik
Schuller, Bjoern
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2949 - 2953
[35] Hybrid Dataset for Speech Emotion Recognition in Russian Language
Kondratenko, Vladimir
Sokolov, Artem
Karpov, Nikolay
Kutuzov, Oleg
Savushkin, Nikita
Minkin, Fyodor
INTERSPEECH 2023, 2023, : 4548 - 4552
[36] Toward Language-Agnostic Speech Emotion Recognition
Ntalampiras, Stavros
JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2020, 68 (1-2): : 7 - 13
[37] BanglaSER: A speech emotion recognition dataset for the Bangla language
Das, Rakesh Kumar
Islam, Nahidul
Ahmed, Md. Rayhan
Islam, Salekul
Shatabda, Swakkhar
Islam, A. K. M. Muzahidul
DATA IN BRIEF, 2022, 42
[38] Searle vs. Searle on language, speech, and thought
Goldberg, Sandy
Yang, Guiming
PRAGMATICS & COGNITION, 2014, 22 (03) : 352 - 372
[39] Writing vs. Speech in Foreign Language Teaching
卢允中
外国语(上海外国语学院学报), 1985, (03)
[40] Segment-based emotion recognition from continuous Mandarin Chinese speech
Yeh, Jun-Heng
Pao, Tsang-Long
Lin, Ching-Yi
Tsai, Yao-Wei
Chen, Yu-Te
COMPUTERS IN HUMAN BEHAVIOR, 2011, 27 (05) : 1545 - 1552

← 1 2 3 4 5 →