Articulatory-to-Acoustic Conversion of Mandarin Emotional Speech Based on PSO-LSSVM

被引：2

作者：

Ren, Guofeng ^{[1
]}

Fu, Jianmei ^{[1
]}

Shao, Guicheng ^{[1
]}

Xun, Yanqin ^{[1
]}

机构：

[1] Xinzhou Teachers Univ, Dept Elect, Xinzhou 034000, Peoples R China

来源：

COMPLEXITY | 2021年 / 2021卷

关键词：

Emotion Recognition;

D O I：

10.1155/2021/8876005

中图分类号：

O1 [数学];

学科分类号：

0701 ; 070101 ;

摘要：

The production of emotional speech is determined by the movement of the speaker's tongue, lips, and jaw. In order to combine articulatory data and acoustic data of speakers, articulatory-to-acoustic conversion of emotional speech has been studied. In this paper, parameters of LSSVM model have been optimized using the PSO method, and the optimized PSO-LSSVM model was applied to the articulatory-to-acoustic conversion. The root mean square error (RMSE) and mean Mel-cepstral distortion (MMCD) have been used to evaluate the results of conversion; the evaluated result illustrates that MMCD of MFCC is 1.508 dB, and RMSE of the second formant (F2) is 25.10 Hz. The results of this research can be further applied to the feature fusion of emotion speech recognition to improve the accuracy of emotion recognition.

引用

页数：10

共 50 条

[1] Ultrasound-based Articulatory-to-Acoustic Mapping with WaveGlow Speech Synthesis
Csapo, Tamas Gabor
Zainko, Csaba
Toth, Laszlo
Gosztolya, Gabor
Marko, Alexandra
INTERSPEECH 2020, 2020, : 2727 - 2731
[2] A stochastic articulatory-to-acoustic mapping as a basis for speech recognition
Hogden, J
Valdez, P
IMTC/2001: PROCEEDINGS OF THE 18TH IEEE INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE, VOLS 1-3: REDISCOVERING MEASUREMENT IN THE AGE OF INFORMATICS, 2001, : 1105 - 1110
[3] Autoencoder-Based Articulatory-to-Acoustic Mapping for Ultrasound Silent Speech Interfaces
Gosztolya, Gabor
Pinter, Adam
Toth, Laszlo
Grosz, Tamas
Marko, Alexandra
Csapo, Tamas Gabor
2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[4] Articulatory-Acoustic Analyses of Mandarin Words in Emotional Context Speech for Smart Campus
Ren, Guofeng
Zhang, Xueying
Duan, Shufei
IEEE ACCESS, 2018, 6 : 48418 - 48427
[5] Non-Parallel Articulatory-to-Acoustic Conversion Using Multiview-Based Time Warping
Gonzalez-Lopez, Jose A.
Gomez-Alanis, Alejandro
Perez-Cordoba, Jose L.
Green, Phil D.
APPLIED SCIENCES-BASEL, 2022, 12 (03):
[6] Non-linear AVO inversion based on PSO-LSSVM
Xie W.
Wang Y.
Liu J.
Su J.
Mao Q.
He R.
Shiyou Diqiu Wuli Kantan/Oil Geophysical Prospecting, 2016, 51 (06): : 1187 - 1194
[7] Research based on PSO-LSSVM Node Positioning in Wireless Network
Li, Xinliang
Luo, Gexi
INTERNATIONAL JOURNAL OF FUTURE GENERATION COMMUNICATION AND NETWORKING, 2016, 9 (05): : 287 - 294
[8] Continuous Articulatory-to-Acoustic Mapping using Phone-based Trajectory HMM for a Silent Speech Interface
Hueber, Thomas
Bailly, Gerard
Denby, Bruce
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 722 - 725
[9] Forecasting of slope displacement based on PSO-LSSVM with mixed kernel
Zheng Zhi-cheng
Xu Wei-ya
Xu Fei
Liu Zao-bao
ROCK AND SOIL MECHANICS, 2012, 33 (05) : 1421 - 1426
[10] Articulatory-to-acoustic conversion using BLSTM-RNNs with augmented input representation
Liu, Zheng-Chen
Ling, Zhen-Hua
Dai, Li-Rong
SPEECH COMMUNICATION, 2018, 99 : 161 - 172

← 1 2 3 4 5 →