Improving the Flexibility of Dynamic Prosody Modification Using Instants of Significant Excitation

被引:8
|
作者
Govind, D. [1 ]
Joy, Tinu T. [1 ]
机构
[1] Amrita Vishwa Vidyapeetham Univ, Ctr Computat Engn & Networking, Coimbatore, Tamil Nadu, India
关键词
Dynamic prosody modification; Instants of significant excitation; Objective measure; Pitch markers; Jitter; TIME-SCALE MODIFICATION; SPEECH SIGNALS; EPOCH EXTRACTION;
D O I
10.1007/s00034-015-0159-5
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Modification of suprasegmental features such as pitch and duration of original speech by fixed scaling factors is referred to as static prosody modification. In dynamic prosody modification, the prosodic scaling factors (time-varying modification factors) are defined for all the pitch cycles present in the original speech. The present work is focused on improving the naturalness of the prosody modified speech by reducing the generation of piecewise constant segments in the modified pitch contour. The prosody modification is performed by anchoring around the accurate instants of significant excitation estimated from the original speech. The division of longer pitch intervals into many equal intervals over long speech segments introduces step-like discontinuities in the form of piecewise constant segments in the modified pitch contours. The effectiveness of proposed dynamic modification method is initially confirmed from the smooth modified pitch contour plot obtained for finer static prosody scaling factors, waveforms, spectrogram plots and comparison subjective evaluations. Also, the average jitter computed from the pitch segments of each glottal activity region in the modified speech is proposed as an objective measure for the prosody modification. The naturalness of the prosody modified speech using the proposed method is objectively and subjectively compared with that of the existing zero frequency filtered signal-based dynamic prosody modification. Also, the proposed algorithm effectively preserves the dynamics of the prosodic patterns in singing voices where in the parameters rapidly and continuously fluctuate within a higher range.
引用
收藏
页码:2518 / 2543
页数:26
相关论文
共 50 条
  • [1] Improving the Flexibility of Dynamic Prosody Modification Using Instants of Significant Excitation
    D. Govind
    Tinu T. Joy
    [J]. Circuits, Systems, and Signal Processing, 2016, 35 : 2518 - 2543
  • [2] Prosody modification using instants of significant excitation
    Rao, KS
    Yegnanarayana, B
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (03): : 972 - 980
  • [3] Unconstrained Pitch Contour Modification Using Instants of Significant Excitation
    Rao, Krothapalli Sreenivasa
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2012, 31 (06) : 2133 - 2152
  • [4] Unconstrained Pitch Contour Modification Using Instants of Significant Excitation
    Krothapalli Sreenivasa Rao
    [J]. Circuits, Systems, and Signal Processing, 2012, 31 : 2133 - 2152
  • [5] Prosodic manipulation using instants of significant excitation
    Rao, KS
    Yegnanarayana, B
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 528 - 531
  • [6] Prosodic manipulation using instants of significant excitation
    Rao, KS
    Yegnanarayana, B
    [J]. 2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 389 - 392
  • [7] LPC VOCODER Using Instants of Significant Excitation and Pole Focusing
    Saranya, A.
    Sripriya, N.
    [J]. ADVANCES IN PARALLEL, DISTRIBUTED COMPUTING, 2011, 203 : 180 - 190
  • [8] Non-uniform time scale modification using instants of significant excitation and vowel onset points
    Rao, K. Sreenivasa
    Vuppala, Anil Kumar
    [J]. SPEECH COMMUNICATION, 2013, 55 (06) : 745 - 756
  • [9] Estimation of Instants of Significant Excitation using Accumulated Energy Function of DCT
    Sripriya, N.
    Nagarajan, T.
    [J]. 2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
  • [10] DETERMINATION OF INSTANTS OF SIGNIFICANT EXCITATION IN SPEECH USING GROUP DELAY FUNCTION
    SMITS, R
    YEGNANARAYANA, B
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1995, 3 (05): : 325 - 333