Articulatory-to-Acoustic Conversion of Mandarin Emotional Speech Based on PSO-LSSVM

被引:2
|
作者
Ren, Guofeng [1 ]
Fu, Jianmei [1 ]
Shao, Guicheng [1 ]
Xun, Yanqin [1 ]
机构
[1] Xinzhou Teachers Univ, Dept Elect, Xinzhou 034000, Peoples R China
关键词
Emotion Recognition;
D O I
10.1155/2021/8876005
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
The production of emotional speech is determined by the movement of the speaker's tongue, lips, and jaw. In order to combine articulatory data and acoustic data of speakers, articulatory-to-acoustic conversion of emotional speech has been studied. In this paper, parameters of LSSVM model have been optimized using the PSO method, and the optimized PSO-LSSVM model was applied to the articulatory-to-acoustic conversion. The root mean square error (RMSE) and mean Mel-cepstral distortion (MMCD) have been used to evaluate the results of conversion; the evaluated result illustrates that MMCD of MFCC is 1.508 dB, and RMSE of the second formant (F2) is 25.10 Hz. The results of this research can be further applied to the feature fusion of emotion speech recognition to improve the accuracy of emotion recognition.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Ultrasound-based Articulatory-to-Acoustic Mapping with WaveGlow Speech Synthesis
    Csapo, Tamas Gabor
    Zainko, Csaba
    Toth, Laszlo
    Gosztolya, Gabor
    Marko, Alexandra
    INTERSPEECH 2020, 2020, : 2727 - 2731
  • [2] A stochastic articulatory-to-acoustic mapping as a basis for speech recognition
    Hogden, J
    Valdez, P
    IMTC/2001: PROCEEDINGS OF THE 18TH IEEE INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE, VOLS 1-3: REDISCOVERING MEASUREMENT IN THE AGE OF INFORMATICS, 2001, : 1105 - 1110
  • [3] Autoencoder-Based Articulatory-to-Acoustic Mapping for Ultrasound Silent Speech Interfaces
    Gosztolya, Gabor
    Pinter, Adam
    Toth, Laszlo
    Grosz, Tamas
    Marko, Alexandra
    Csapo, Tamas Gabor
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [4] Articulatory-Acoustic Analyses of Mandarin Words in Emotional Context Speech for Smart Campus
    Ren, Guofeng
    Zhang, Xueying
    Duan, Shufei
    IEEE ACCESS, 2018, 6 : 48418 - 48427
  • [5] Non-Parallel Articulatory-to-Acoustic Conversion Using Multiview-Based Time Warping
    Gonzalez-Lopez, Jose A.
    Gomez-Alanis, Alejandro
    Perez-Cordoba, Jose L.
    Green, Phil D.
    APPLIED SCIENCES-BASEL, 2022, 12 (03):
  • [6] Non-linear AVO inversion based on PSO-LSSVM
    Xie W.
    Wang Y.
    Liu J.
    Su J.
    Mao Q.
    He R.
    Shiyou Diqiu Wuli Kantan/Oil Geophysical Prospecting, 2016, 51 (06): : 1187 - 1194
  • [7] Research based on PSO-LSSVM Node Positioning in Wireless Network
    Li, Xinliang
    Luo, Gexi
    INTERNATIONAL JOURNAL OF FUTURE GENERATION COMMUNICATION AND NETWORKING, 2016, 9 (05): : 287 - 294
  • [8] Continuous Articulatory-to-Acoustic Mapping using Phone-based Trajectory HMM for a Silent Speech Interface
    Hueber, Thomas
    Bailly, Gerard
    Denby, Bruce
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 722 - 725
  • [9] Forecasting of slope displacement based on PSO-LSSVM with mixed kernel
    Zheng Zhi-cheng
    Xu Wei-ya
    Xu Fei
    Liu Zao-bao
    ROCK AND SOIL MECHANICS, 2012, 33 (05) : 1421 - 1426
  • [10] Articulatory-to-acoustic conversion using BLSTM-RNNs with augmented input representation
    Liu, Zheng-Chen
    Ling, Zhen-Hua
    Dai, Li-Rong
    SPEECH COMMUNICATION, 2018, 99 : 161 - 172