Pitch Contour Modelling and Modification for Expressive Marathi Speech Synthesis

被引：0

作者：

Deo, Rohit S. ^{[1
]}

Deshpande, Pallavi S. ^{[2
]}

机构：

[1] SKN Coll Engn, Dept E&TC, Pune, Maharashtra, India

[2] BVDU Coll Engn, Dept E&TC, Pune, Maharashtra, India

来源：

2014 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI) | 2014年

关键词：

Text-to-speech; Expressive Speech Synthesis; Prosody;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

In this paper, we have measured and analyzed features of speech signal such as fundamental frequency, jitter and shimmer its statistical modeling for Marathi. These models can be used for modifying prosody of the neutral speech further. Jitter and shimmer are measures of cycle-to-cycle variations of fundamental frequency and amplitude respectively. It characterizes the emotion and differs in values as emotion varies. An emotion or target model mentioned here is in the form of interrogate. A pitch target model is developed to model and modify the prosody of the Marathi words. The study comprises the study of existing pitch contour of words whose prosody is to be modified and target pitch contour. Its statistical analysis is done. At the end Gaussian normalization is employed to modify the prosody with help of analyzed data. Result of the subjective experiments satisfies the native listeners.

引用

页码：2455 / 2458

页数：4

共 50 条

[1] Pitch and Duration Modification for Expressive Speech Synthesis in Marathi TTS system
Anil, Manjare Chandraprabha
Shirbahadurkar, S. D.
[J]. 2015 INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING (ICPC), 2015,
[2] Expressive Speech Synthesis using Prosodic Modification for Marathi Language
Anil, Manjare Chandraprabha
Shirbahadurkar, S. D.
[J]. 2ND INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN) 2015, 2015, : 126 - 130
[3] Speech Modification for Prosody Conversion in Expressive Marathi Text-to-Speech Synthesis
Anil, Manjare Chandraprabha
Shirbahadurkar, S. D.
[J]. 2014 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2014, : 56 - 58
[4] An Iterated Two-Step Sinusoidal Pitch Contour Formulation for Expressive Speech Synthesis
Ramli, Izzad
Jamil, Nursuriati
Seman, Noraini
[J]. JOURNAL OF INFORMATION AND COMMUNICATION TECHNOLOGY-MALAYSIA, 2021, 20 (04): : 489 - 510
[5] Applying pitch target model to convert F0 contour for expressive Mandarin speech synthesis
Kang, Yongguo
Tao, Jianhua
Xu, Bo
[J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 733 - 736
[6] Prosody modelling of Spanish for expressive speech synthesis
Iriondo, Ignasi
Socoro, Joan Claudi
Alias, Francesc
[J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 821 - +
[7] Voice Quality Modelling for Expressive Speech Synthesis
Monzo, Carlos
Iriondo, Ignasi
Socoro, Joan Claudi
[J]. SCIENTIFIC WORLD JOURNAL, 2014,
[8] Pitch Estimation of Marathi Spoken Numbers in Various Speech Signals
Nimbhore, S. S.
Ramteke, G. D.
Ramteke, R. J.
[J]. 2013 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2013, : 405 - 409
[9] Synthesis of unseen context and spectral and pitch contour smoothing in concatenated text to speech synthesis
Low, PH
Vaseghi, S
[J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 469 - 472
[10] An HMM Based Pitch-Contour Generation Method for Mandarin Speech Synthesis
Gu, Hung-Yan
Yang, Chung-Chieh
[J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2011, 27 (05) : 1561 - 1580

← 1 2 3 4 5 →