Pitch Contour Modelling and Modification for Expressive Marathi Speech Synthesis

被引：0

作者：

Deo, Rohit S. ^{[1
]}

Deshpande, Pallavi S. ^{[2
]}

机构：

[1] SKN Coll Engn, Dept E&TC, Pune, Maharashtra, India

[2] BVDU Coll Engn, Dept E&TC, Pune, Maharashtra, India

来源：

2014 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI) | 2014年

关键词：

Text-to-speech; Expressive Speech Synthesis; Prosody;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

In this paper, we have measured and analyzed features of speech signal such as fundamental frequency, jitter and shimmer its statistical modeling for Marathi. These models can be used for modifying prosody of the neutral speech further. Jitter and shimmer are measures of cycle-to-cycle variations of fundamental frequency and amplitude respectively. It characterizes the emotion and differs in values as emotion varies. An emotion or target model mentioned here is in the form of interrogate. A pitch target model is developed to model and modify the prosody of the Marathi words. The study comprises the study of existing pitch contour of words whose prosody is to be modified and target pitch contour. Its statistical analysis is done. At the end Gaussian normalization is employed to modify the prosody with help of analyzed data. Result of the subjective experiments satisfies the native listeners.

引用

页码：2455 / 2458

页数：4

共 50 条

[41] Modification of Pitch Parameters in Speech Coding for Information Hiding
Radej, Adrian
Janicki, Artur
[J]. TEXT, SPEECH, AND DIALOGUE (TSD 2020), 2020, 12284 : 513 - 523
[42] Accurate Pitch Marking for Prosodic Modification of Speech Segments
Ewender, Thomas
Pfister, Beat
[J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 178 - 181
[43] On a Pitch Alteration for Speech Synthesis Systems
JongKuk Kim
HernSoo Hahn
Uei-Joong Yoon
MyungJin Bae
[J]. Wireless Personal Communications, 2009, 50 : 435 - 446
[44] On a Pitch Alteration for Speech Synthesis Systems
Kim, JongKuk
Hahn, HernSoo
Yoon, Uei-Joong
Bae, MyungJin
[J]. WIRELESS PERSONAL COMMUNICATIONS, 2009, 50 (04) : 435 - 446
[45] Introducing pitch modification in residual excited LPC based Tamil text-to-speech synthesis
Krithiga, MV
Geetha, TV
[J]. APPLIED COMPUTING, PROCEEDINGS, 2004, 3285 : 177 - 183
[46] Enhanced shape-invariant pitch and time-scale modification for concatenative speech synthesis
Pollard, MP
Cheetham, BMG
Goodyear, CC
Edgington, MD
Lowry, A
[J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1433 - 1436
[47] Expressive Speech Synthesis Using Emotion-Specific Speech Inventories
Zainko, Csaba
Fek, Mark
Nemeth, Geza
[J]. VERBAL AND NONVERBAL FEATURES OF HUMAN-HUMAN AND HUMAN-MACHINE INTERACTIONS, 2008, 5042 : 225 - 234
[48] Generating emphatic speech with hidden Markov model for expressive speech synthesis
Wu, Zhiyong
Ning, Yishuang
Zang, Xiao
Jia, Jia
Meng, Fanbo
Meng, Helen
Cai, Lianhong
[J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (22) : 9909 - 9925
[49] Generating emphatic speech with hidden Markov model for expressive speech synthesis
Zhiyong Wu
Yishuang Ning
Xiao Zang
Jia Jia
Fanbo Meng
Helen Meng
Lianhong Cai
[J]. Multimedia Tools and Applications, 2015, 74 : 9909 - 9925
[50] Towards Glottal Source Controllability in Expressive Speech Synthesis
Lorenzo-Trueba, Jaime
Barra-Chicote, Roberto
Raitio, Tuomo
Obin, Nicolas
Alku, Paavo
Yamagishi, Junichi
Montero, Juan M.
[J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1618 - 1621

← 1 2 3 4 5 →