PROSODIC ANALYSIS AND MODELLING FOR MALAY EMOTIONAL SPEECH SYNTHESIS

被引:0
|
作者
Mustafa, Mumtaz B. [1 ,3 ]
Ainon, Raja N. [1 ,3 ]
Zainuddin, Roziati [1 ,3 ]
Don, Zuraidah M. [2 ,3 ]
Knowles, Gerry [3 ,4 ]
Mokhtar, Salimah [1 ,3 ]
机构
[1] Univ Malaya, Fac Comp Sci & Informat Technol, Kuala Lumpur, Malaysia
[2] Univ Malaya, Fac Language & Linguist, Kuala Lumpur, Malaysia
[3] Univ Malaya, ICT Res Cluster, Computat Speech Grp, Kuala Lumpur, Malaysia
[4] Lingenium Sdn Bhd, Kuala Lumpur, Malaysia
关键词
Emotional speech re-synthesis; Prosody conversion; Rule-based approach; MBROLA;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper discusses an emotional prosody generator for a Malay speech synthesis system that can re-synthesize the selected vocal emotion from neutral synthesized speech output and improve the naturalness by adopting rule-based prosody conversion techniques. The role of prosodic features in emotional expression, particularly fundamental frequency and duration, has been widely investigated in several research projects. This project attempts to improve the naturalness of the synthesized emotional Malay speech by establishing an effective mechanism for the re-synthesis of emotion. Such a mechanism is created by analyzing the variation in the F0 contour of continuous emotional Malay speech against a fixed time period. The emotional prosodic generator for Malay developed in the course of this research makes use of principles of parametric prosody manipulation to synthesize four basic emotions, namely happiness, anger, sadness and fear. Subjective evaluation by means of listening tests was conducted to validate the ability of the emotions generator to generate the necessary prosody to synthesize emotional expression. The evaluation results show an overall recognition rate of between 61% and 85%.
引用
收藏
页码:102 / 110
页数:9
相关论文
共 50 条
  • [1] Analysis of prosodic features: towards modelling of emotional and pragmatic attributes of speech
    Adell, Jordi
    Bonafonte, Antonio
    Escudero, David
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2005, (35): : 277 - 283
  • [2] Prosodic Processing for the Automatic Synthesis of Emotional Russian Speech
    Kaliyev, Arman
    Matveev, Yuri N.
    Lyakso, Elena E.
    Rybin, Sergey V.
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE QUALITY MANAGEMENT, TRANSPORT AND INFORMATION SECURITY, INFORMATION TECHNOLOGIES (IT&QM&IS), 2018, : 653 - 655
  • [3] Statistical Analysis of Spectral Properties and Prosodic Parameters of Emotional Speech
    Pribil, J.
    Pribilova, A.
    [J]. MEASUREMENT SCIENCE REVIEW, 2009, 9 (04): : 95 - 104
  • [4] RESEARCH ON SYNTHESIS OF SPEECH PARAMETER AND EMOTIONAL SPEECH FOR MALAY LANGUAGE USING LSTM RNN
    Zhai, Jun-Jun
    Wu, Shao-Shuai
    Li, Yi-Bing
    [J]. PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 2, 2018, : 598 - 603
  • [5] THE EMOTIONAL STATE EFFECT OF PROSODIC SPEECH CUES
    BERGMANN, G
    GOLDBECK, T
    SCHERER, KR
    [J]. ZEITSCHRIFT FUR EXPERIMENTELLE UND ANGEWANDTE PSYCHOLOGIE, 1988, 35 (02): : 167 - 200
  • [6] An Analysis of Malay Language Emotional Speech Corpus for Emotion Recognition System
    Apandi, Nurfarihah
    Jamil, Nursuriati
    [J]. 2016 IEEE INDUSTRIAL ELECTRONICS AND APPLICATIONS CONFERENCE (IEACON), 2016, : 225 - 231
  • [7] Integrating Rule and Template-based Approaches for Emotional Malay Speech Synthesis
    Begum, Mumtaz
    Ainon, Raja N.
    Zainuddin, Roziati
    Don, Zuraidah M.
    Knowles, Gerry
    [J]. INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 253 - +
  • [8] Prosodic Breaks on Malay speech corpus: Evaluation of Pitch, Intensity and Duration
    Hanum, Haslizatul Mohamed
    Nasaruddin, Syazwani
    Abu Bakar, Zainab
    [J]. 2016 THIRD INTERNATIONAL CONFERENCE ON INFORMATION RETRIEVAL AND KNOWLEDGE MANAGEMENT (CAMP), 2016, : 43 - 47
  • [9] Exploring Prosodic Features Modelling for Secondary Emotions Needed for Empathetic Speech Synthesis
    James, Jesin
    Balamurali, B. T.
    Watson, Catherine
    Mixdorff, Hansjoerg
    [J]. SENSORS, 2023, 23 (06)
  • [10] Understanding emotional expression using prosodic analysis of natural speech: Refining the methodology
    Cohen, Alex S.
    Hong, S. Lee
    Guevara, Alvaro
    [J]. JOURNAL OF BEHAVIOR THERAPY AND EXPERIMENTAL PSYCHIATRY, 2010, 41 (02) : 150 - 157