Pitch Contour Modelling and Modification for Expressive Marathi Speech Synthesis

被引:0
|
作者
Deo, Rohit S. [1 ]
Deshpande, Pallavi S. [2 ]
机构
[1] SKN Coll Engn, Dept E&TC, Pune, Maharashtra, India
[2] BVDU Coll Engn, Dept E&TC, Pune, Maharashtra, India
关键词
Text-to-speech; Expressive Speech Synthesis; Prosody;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we have measured and analyzed features of speech signal such as fundamental frequency, jitter and shimmer its statistical modeling for Marathi. These models can be used for modifying prosody of the neutral speech further. Jitter and shimmer are measures of cycle-to-cycle variations of fundamental frequency and amplitude respectively. It characterizes the emotion and differs in values as emotion varies. An emotion or target model mentioned here is in the form of interrogate. A pitch target model is developed to model and modify the prosody of the Marathi words. The study comprises the study of existing pitch contour of words whose prosody is to be modified and target pitch contour. Its statistical analysis is done. At the end Gaussian normalization is employed to modify the prosody with help of analyzed data. Result of the subjective experiments satisfies the native listeners.
引用
收藏
页码:2455 / 2458
页数:4
相关论文
共 50 条
  • [1] Pitch and Duration Modification for Expressive Speech Synthesis in Marathi TTS system
    Anil, Manjare Chandraprabha
    Shirbahadurkar, S. D.
    [J]. 2015 INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING (ICPC), 2015,
  • [2] Expressive Speech Synthesis using Prosodic Modification for Marathi Language
    Anil, Manjare Chandraprabha
    Shirbahadurkar, S. D.
    [J]. 2ND INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN) 2015, 2015, : 126 - 130
  • [3] Speech Modification for Prosody Conversion in Expressive Marathi Text-to-Speech Synthesis
    Anil, Manjare Chandraprabha
    Shirbahadurkar, S. D.
    [J]. 2014 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2014, : 56 - 58
  • [4] An Iterated Two-Step Sinusoidal Pitch Contour Formulation for Expressive Speech Synthesis
    Ramli, Izzad
    Jamil, Nursuriati
    Seman, Noraini
    [J]. JOURNAL OF INFORMATION AND COMMUNICATION TECHNOLOGY-MALAYSIA, 2021, 20 (04): : 489 - 510
  • [5] Applying pitch target model to convert F0 contour for expressive Mandarin speech synthesis
    Kang, Yongguo
    Tao, Jianhua
    Xu, Bo
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 733 - 736
  • [6] Prosody modelling of Spanish for expressive speech synthesis
    Iriondo, Ignasi
    Socoro, Joan Claudi
    Alias, Francesc
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 821 - +
  • [7] Voice Quality Modelling for Expressive Speech Synthesis
    Monzo, Carlos
    Iriondo, Ignasi
    Socoro, Joan Claudi
    [J]. SCIENTIFIC WORLD JOURNAL, 2014,
  • [8] Pitch Estimation of Marathi Spoken Numbers in Various Speech Signals
    Nimbhore, S. S.
    Ramteke, G. D.
    Ramteke, R. J.
    [J]. 2013 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2013, : 405 - 409
  • [9] Synthesis of unseen context and spectral and pitch contour smoothing in concatenated text to speech synthesis
    Low, PH
    Vaseghi, S
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 469 - 472
  • [10] An HMM Based Pitch-Contour Generation Method for Mandarin Speech Synthesis
    Gu, Hung-Yan
    Yang, Chung-Chieh
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2011, 27 (05) : 1561 - 1580