Pitch Contour Modelling and Modification for Expressive Marathi Speech Synthesis

被引:0
|
作者
Deo, Rohit S. [1 ]
Deshpande, Pallavi S. [2 ]
机构
[1] SKN Coll Engn, Dept E&TC, Pune, Maharashtra, India
[2] BVDU Coll Engn, Dept E&TC, Pune, Maharashtra, India
关键词
Text-to-speech; Expressive Speech Synthesis; Prosody;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we have measured and analyzed features of speech signal such as fundamental frequency, jitter and shimmer its statistical modeling for Marathi. These models can be used for modifying prosody of the neutral speech further. Jitter and shimmer are measures of cycle-to-cycle variations of fundamental frequency and amplitude respectively. It characterizes the emotion and differs in values as emotion varies. An emotion or target model mentioned here is in the form of interrogate. A pitch target model is developed to model and modify the prosody of the Marathi words. The study comprises the study of existing pitch contour of words whose prosody is to be modified and target pitch contour. Its statistical analysis is done. At the end Gaussian normalization is employed to modify the prosody with help of analyzed data. Result of the subjective experiments satisfies the native listeners.
引用
收藏
页码:2455 / 2458
页数:4
相关论文
共 50 条
  • [41] Modification of Pitch Parameters in Speech Coding for Information Hiding
    Radej, Adrian
    Janicki, Artur
    [J]. TEXT, SPEECH, AND DIALOGUE (TSD 2020), 2020, 12284 : 513 - 523
  • [42] Accurate Pitch Marking for Prosodic Modification of Speech Segments
    Ewender, Thomas
    Pfister, Beat
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 178 - 181
  • [43] On a Pitch Alteration for Speech Synthesis Systems
    JongKuk Kim
    HernSoo Hahn
    Uei-Joong Yoon
    MyungJin Bae
    [J]. Wireless Personal Communications, 2009, 50 : 435 - 446
  • [44] On a Pitch Alteration for Speech Synthesis Systems
    Kim, JongKuk
    Hahn, HernSoo
    Yoon, Uei-Joong
    Bae, MyungJin
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2009, 50 (04) : 435 - 446
  • [45] Introducing pitch modification in residual excited LPC based Tamil text-to-speech synthesis
    Krithiga, MV
    Geetha, TV
    [J]. APPLIED COMPUTING, PROCEEDINGS, 2004, 3285 : 177 - 183
  • [46] Enhanced shape-invariant pitch and time-scale modification for concatenative speech synthesis
    Pollard, MP
    Cheetham, BMG
    Goodyear, CC
    Edgington, MD
    Lowry, A
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1433 - 1436
  • [47] Expressive Speech Synthesis Using Emotion-Specific Speech Inventories
    Zainko, Csaba
    Fek, Mark
    Nemeth, Geza
    [J]. VERBAL AND NONVERBAL FEATURES OF HUMAN-HUMAN AND HUMAN-MACHINE INTERACTIONS, 2008, 5042 : 225 - 234
  • [48] Generating emphatic speech with hidden Markov model for expressive speech synthesis
    Wu, Zhiyong
    Ning, Yishuang
    Zang, Xiao
    Jia, Jia
    Meng, Fanbo
    Meng, Helen
    Cai, Lianhong
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (22) : 9909 - 9925
  • [49] Generating emphatic speech with hidden Markov model for expressive speech synthesis
    Zhiyong Wu
    Yishuang Ning
    Xiao Zang
    Jia Jia
    Fanbo Meng
    Helen Meng
    Lianhong Cai
    [J]. Multimedia Tools and Applications, 2015, 74 : 9909 - 9925
  • [50] Towards Glottal Source Controllability in Expressive Speech Synthesis
    Lorenzo-Trueba, Jaime
    Barra-Chicote, Roberto
    Raitio, Tuomo
    Obin, Nicolas
    Alku, Paavo
    Yamagishi, Junichi
    Montero, Juan M.
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1618 - 1621