Enhanced shape-invariant pitch and time-scale modification for concatenative speech synthesis

被引:0
|
作者
Pollard, MP
Cheetham, BMG
Goodyear, CC
Edgington, MD
Lowry, A
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
To preserve shape-invariance when pitch or time-scale modifying sinusoidally modelled voiced speech, the phases of the sinusoids used to model the glottal excitation are made to add coherently at estimated excitation points. Previous methods achieve this by estimating excitation phases at synthesis frame boundaries, disregarding the frequency modulation that may occur between the frame boundary and the nearest modified excitation point This approximation can produce a significant mis-alignment of the excitation phases, leading to distortion of the temporal structure of the synthetic speech. In this paper, a shape-invariant technique is proposed which aligns the excitation phases at excitation points, whilst allowing for variations in the frequency of the sinusoidal components.
引用
收藏
页码:1433 / 1436
页数:4
相关论文
共 50 条
  • [41] A new technique for improving quality of speech in voice over IP using time-scale modification
    Agnihotri, S
    Aravindhan, K
    Jamadagni, HS
    Pawate, BI
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 2085 - 2088
  • [42] Low Bit Rate Speech Coding Using Lattice Vector Quantization and Time-Scale Modification
    Xiao, Qiang
    Chen, Liang
    Geng, Chao
    MANUFACTURING SCIENCE AND TECHNOLOGY, PTS 1-8, 2012, 383-390 : 5111 - +
  • [43] Speaking rate control based on time-scale modification and its effects on the performance of speech recognition
    Kang, Jin Ah
    Choi, Seung Ho
    INTERNATIONAL JOURNAL OF ENGINEERING SYSTEMS MODELLING AND SIMULATION, 2014, 6 (1-2) : 31 - 36
  • [44] Wavelet speech enhancement based on time-scale adaptation
    Bahoura, Mohammed
    Rouat, Jean
    SPEECH COMMUNICATION, 2006, 48 (12) : 1620 - 1637
  • [45] A novel quality measure for time-scale modified speech
    Chen, Fu-Kun
    Jou, Yue-Dar
    2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 208 - +
  • [46] Time-Scale Feature Extractions for Emotional Speech Characterization
    Chetouani, Mohamed
    Mahdhaoui, Ammar
    Ringeval, Fabien
    COGNITIVE COMPUTATION, 2009, 1 (02) : 194 - 201
  • [47] Speech compensation to time-scale modified auditory feedback
    Graduate School of Sport Sciences, Waseda University, 2-579-15, Mikajima, Tokorozawa, Saitama, Japan
    不详
    Int. Semin. Speech Prod., ISSP, (321-328):
  • [48] Mel-Scale Sub-band Modelling for Perceptually Improved Time-Scale Modification of Speech and Audio Signals
    Sharma, Neeraj
    Potadar, Shreepad
    Chetupalli, Srikanth Raj
    Sreenivas, T. V.
    2017 TWENTY-THIRD NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2017,
  • [49] Objective quality measurement for audio time-scale modification
    Liu, F
    Lee, JJ
    Kuo, CCJ
    INTERNET MULTIMEDIA MANAGEMENT SYSTEMS IV, 2003, 5242 : 208 - 216
  • [50] A time-scale modification dataset with subjective quality labels
    Roberts, Timothy
    Paliwal, Kuldip K.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2020, 148 (01): : 201 - 210