Improving the Flexibility of Dynamic Prosody Modification Using Instants of Significant Excitation

被引:8
|
作者
Govind, D. [1 ]
Joy, Tinu T. [1 ]
机构
[1] Amrita Vishwa Vidyapeetham Univ, Ctr Computat Engn & Networking, Coimbatore, Tamil Nadu, India
关键词
Dynamic prosody modification; Instants of significant excitation; Objective measure; Pitch markers; Jitter; TIME-SCALE MODIFICATION; SPEECH SIGNALS; EPOCH EXTRACTION;
D O I
10.1007/s00034-015-0159-5
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Modification of suprasegmental features such as pitch and duration of original speech by fixed scaling factors is referred to as static prosody modification. In dynamic prosody modification, the prosodic scaling factors (time-varying modification factors) are defined for all the pitch cycles present in the original speech. The present work is focused on improving the naturalness of the prosody modified speech by reducing the generation of piecewise constant segments in the modified pitch contour. The prosody modification is performed by anchoring around the accurate instants of significant excitation estimated from the original speech. The division of longer pitch intervals into many equal intervals over long speech segments introduces step-like discontinuities in the form of piecewise constant segments in the modified pitch contours. The effectiveness of proposed dynamic modification method is initially confirmed from the smooth modified pitch contour plot obtained for finer static prosody scaling factors, waveforms, spectrogram plots and comparison subjective evaluations. Also, the average jitter computed from the pitch segments of each glottal activity region in the modified speech is proposed as an objective measure for the prosody modification. The naturalness of the prosody modified speech using the proposed method is objectively and subjectively compared with that of the existing zero frequency filtered signal-based dynamic prosody modification. Also, the proposed algorithm effectively preserves the dynamics of the prosodic patterns in singing voices where in the parameters rapidly and continuously fluctuate within a higher range.
引用
收藏
页码:2518 / 2543
页数:26
相关论文
共 50 条
  • [21] Improving the performance of keyword spotting system for children's speech through prosody modification
    Shahnawazuddin, S.
    Maity, Karabi
    Pradhan, Gayadhar
    [J]. DIGITAL SIGNAL PROCESSING, 2019, 86 : 11 - 18
  • [22] Improving human scoring of prosody using parametric speech synthesis
    Prafianto, Hafiyan
    Nose, Takashi
    Chiba, Yuya
    Ito, Akinori
    [J]. SPEECH COMMUNICATION, 2019, 111 (14-21) : 14 - 21
  • [23] Improving Mandarin Prosody Generation Using Alternative Smoothing Techniques
    Huang, Yi-Chin
    Wu, Chung-Hsien
    Weng, Si-Ting
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (11) : 1897 - 1907
  • [24] Improvement of syllable based TTS system in assamese using prosody modification
    Department of Electronics and Electrical Engineering, Indian Institute of Technology Guwahati, Guwahati
    781039, India
    [J]. IEEE Int. Conf. Electron., Energy, Environ., Commun., Comput., Control: (E3-C3), INDICON, 1600,
  • [25] Improvement of Syllable based TTS System in Assamese using Prosody Modification
    Sharma, Bidisha
    Prasanna, S. R. Mahadeva
    [J]. 2015 ANNUAL IEEE INDIA CONFERENCE (INDICON), 2015,
  • [26] Speaker transformation using sentence HMM based alignments and detailed prosody modification
    Arslan, LM
    Talkin, D
    [J]. PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 289 - 292
  • [27] Improving information system flexibility through remote dynamic - Component configuration
    Liu, Lu
    Li, Zongyong
    Li, Ruibo
    [J]. 2006 INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT, VOLS 1 AND 2, PROCEEDINGS, 2006, : 461 - 466
  • [28] Methods and Means of Improving the Dynamic Characteristics of Brushless Excitation Systems
    Komkov A.L.
    Filimonov N.Y.
    Yurganov A.A.
    [J]. Power Technology and Engineering, 2020, 54 (4) : 575 - 580
  • [29] Methods and Means of Improving the Dynamic Characteristics of Brushless Excitation Systems
    Komkov, A.L.
    Filimonov, N. Yu.
    Yurganov, A.A.
    [J]. Komkov, A.L. (Andrey.Komkov@ruselmash.ru), 1600, Springer (54): : 575 - 580
  • [30] A Pitch and Noise Robust Keyword Spotting System Using SMAC Features with Prosody Modification
    Maity, Karabi
    Pradhan, Gayadhar
    Singh, Jyoti Prakash
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2021, 40 (04) : 1892 - 1904