Enhanced shape-invariant pitch and time-scale modification for concatenative speech synthesis

被引:0
|
作者
Pollard, MP
Cheetham, BMG
Goodyear, CC
Edgington, MD
Lowry, A
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
To preserve shape-invariance when pitch or time-scale modifying sinusoidally modelled voiced speech, the phases of the sinusoids used to model the glottal excitation are made to add coherently at estimated excitation points. Previous methods achieve this by estimating excitation phases at synthesis frame boundaries, disregarding the frequency modulation that may occur between the frame boundary and the nearest modified excitation point This approximation can produce a significant mis-alignment of the excitation phases, leading to distortion of the temporal structure of the synthetic speech. In this paper, a shape-invariant technique is proposed which aligns the excitation phases at excitation points, whilst allowing for variations in the frequency of the sinusoidal components.
引用
收藏
页码:1433 / 1436
页数:4
相关论文
共 50 条
  • [21] A Spectral Variation Function for Variable Time-Scale Modification of Speech
    Kachare, Pramod H.
    Pandey, Prem C.
    2021 NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2021, : 48 - 52
  • [22] EFFECT OF TIME-SCALE MODIFICATION OF SPEECH ON THE SPEECH RECOGNITION THRESHOLD IN NOISE FOR ELDERLY LISTENERS
    STOLLMAN, MHP
    KAPTEYN, TS
    AUDIOLOGY, 1994, 33 (05): : 280 - 290
  • [23] TIME-SCALE MODIFICATION OF SPEECH SIGNALS FOR SUPPORTING HEARING IMPAIRED SCHOOLCHILDREN
    Kupryjanow, Adam
    Czyzewski, Andrzej
    SPA 2009: SIGNAL PROCESSING ALGORITHMS, ARCHITECTURES, ARRANGEMENTS, AND APPLICATIONS CONFERENCE PROCEEDINGS, 2009, : 159 - 162
  • [24] Speech Packet Concealment Techniques Based on Time-Scale Modification for VoIP
    Bhute, Vijaya P.
    Shrawankar, Urmila N.
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, 2008, : 825 - 828
  • [25] TIME-SCALE MODIFICATION OF SPEECH BASED ON SHORT-TIME FOURIER-ANALYSIS
    PORTNOFF, MR
    IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1981, 29 (03): : 374 - 390
  • [26] Time-Scale and Pitch-Scale Modification by the Phase Vocoder without Occurring the Phase Unwrapping Problem
    Yoneguchi, Ryoichi
    Murakami, Takahiro
    2017 22ND INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2017,
  • [27] Time-scale modification of speech signals, for language-learning impaired children
    Erogul, O
    Karagoz, I
    PROCEEDINGS OF THE 1998 2ND INTERNATIONAL CONFERENCE BIOMEDICAL ENGINEERING DAYS, 1998, : 33 - 35
  • [28] Audio watermarking by time-scale modification
    Mansour, MF
    Tewfik, AH
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING, 2001, : 1353 - 1356
  • [29] Frequency Dependent Time-Scale Modification
    Roberts, Timothy
    Paliwal, Kuldip K.
    2018 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2018,
  • [30] Time-scale modification of music signals
    Grofit, S
    Lavner, Y
    22ND CONVENTION OF ELECTRICAL AND ELECTRONICS ENGINEERS IN ISRAEL, PROCEEDINGS, 2002, : 254 - 256