Enhanced shape-invariant pitch and time-scale modification for concatenative speech synthesis

被引:0
|
作者
Pollard, MP
Cheetham, BMG
Goodyear, CC
Edgington, MD
Lowry, A
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
To preserve shape-invariance when pitch or time-scale modifying sinusoidally modelled voiced speech, the phases of the sinusoids used to model the glottal excitation are made to add coherently at estimated excitation points. Previous methods achieve this by estimating excitation phases at synthesis frame boundaries, disregarding the frequency modulation that may occur between the frame boundary and the nearest modified excitation point This approximation can produce a significant mis-alignment of the excitation phases, leading to distortion of the temporal structure of the synthetic speech. In this paper, a shape-invariant technique is proposed which aligns the excitation phases at excitation points, whilst allowing for variations in the frequency of the sinusoidal components.
引用
收藏
页码:1433 / 1436
页数:4
相关论文
共 50 条
  • [1] Shape-invariant pitch and time-scale modification of speech by variable order phase interpolation
    Pollard, MP
    Cheetham, BMG
    Goodyear, CC
    Edgington, MD
    1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 919 - 922
  • [2] SHAPE INVARIANT TIME-SCALE AND PITCH MODIFICATION OF SPEECH
    QUATIERI, TF
    MCAULAY, RJ
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1992, 40 (03) : 497 - 510
  • [3] Shape invariant time-scale modification of speech using a harmonic model
    O'Brien, Darragh
    Monaghan, Alex
    ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 1 : 381 - 384
  • [4] Shape invariant time-scale modification of speech using a harmonic model
    O'Brien, D
    Monaghan, A
    ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 381 - 384
  • [5] NONPARAMETRIC TECHNIQUES FOR PITCH-SCALE AND TIME-SCALE MODIFICATION OF SPEECH
    MOULINES, E
    LAROCHE, J
    SPEECH COMMUNICATION, 1995, 16 (02) : 175 - 205
  • [6] Time-scale and pitch modification for Chinese speech based on sinusoidal model
    Zhou, J.Y.
    Chai, P.Q.
    Tongji Daxue Xuebao/Journal of Tongji University, 2001, 29 (03): : 312 - 316
  • [7] Source-filter models for time-scale pitch-scale modification of speech
    Acero, A
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 881 - 884
  • [8] Speech Time-Scale Modification With GANs
    Cohen, Eyal
    Kreuk, Felix
    Keshet, Joseph
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1067 - 1071
  • [9] Time-scale modification of speech signals
    Ninness, Brett
    Henriksen, Soren John
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2008, 56 (04) : 1479 - 1488
  • [10] Voice Privacy Through Time-Scale and Pitch Modification
    Prajapati, Gauri P.
    Singh, Dipesh K.
    Patil, Hemant A.
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2021, 2024, 13102 : 72 - 80