Enhanced shape-invariant pitch and time-scale modification for concatenative speech synthesis

被引：0

作者：

Pollard, MP

Cheetham, BMG

Goodyear, CC

Edgington, MD

Lowry, A

机构：

来源：

ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4 | 1996年

关键词：

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

To preserve shape-invariance when pitch or time-scale modifying sinusoidally modelled voiced speech, the phases of the sinusoids used to model the glottal excitation are made to add coherently at estimated excitation points. Previous methods achieve this by estimating excitation phases at synthesis frame boundaries, disregarding the frequency modulation that may occur between the frame boundary and the nearest modified excitation point This approximation can produce a significant mis-alignment of the excitation phases, leading to distortion of the temporal structure of the synthetic speech. In this paper, a shape-invariant technique is proposed which aligns the excitation phases at excitation points, whilst allowing for variations in the frequency of the sinusoidal components.

引用

页码：1433 / 1436

页数：4

共 50 条

[41] A new technique for improving quality of speech in voice over IP using time-scale modification
Agnihotri, S
Aravindhan, K
Jamadagni, HS
Pawate, BI
2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 2085 - 2088
[42] Low Bit Rate Speech Coding Using Lattice Vector Quantization and Time-Scale Modification
Xiao, Qiang
Chen, Liang
Geng, Chao
MANUFACTURING SCIENCE AND TECHNOLOGY, PTS 1-8, 2012, 383-390 : 5111 - +
[43] Speaking rate control based on time-scale modification and its effects on the performance of speech recognition
Kang, Jin Ah
Choi, Seung Ho
INTERNATIONAL JOURNAL OF ENGINEERING SYSTEMS MODELLING AND SIMULATION, 2014, 6 (1-2) : 31 - 36
[44] Wavelet speech enhancement based on time-scale adaptation
Bahoura, Mohammed
Rouat, Jean
SPEECH COMMUNICATION, 2006, 48 (12) : 1620 - 1637
[45] A novel quality measure for time-scale modified speech
Chen, Fu-Kun
Jou, Yue-Dar
2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 208 - +
[46] Time-Scale Feature Extractions for Emotional Speech Characterization
Chetouani, Mohamed
Mahdhaoui, Ammar
Ringeval, Fabien
COGNITIVE COMPUTATION, 2009, 1 (02) : 194 - 201
[47] Speech compensation to time-scale modified auditory feedback
Graduate School of Sport Sciences, Waseda University, 2-579-15, Mikajima, Tokorozawa, Saitama, Japan
不详
Int. Semin. Speech Prod., ISSP, (321-328):
[48] Mel-Scale Sub-band Modelling for Perceptually Improved Time-Scale Modification of Speech and Audio Signals
Sharma, Neeraj
Potadar, Shreepad
Chetupalli, Srikanth Raj
Sreenivas, T. V.
2017 TWENTY-THIRD NATIONAL CONFERENCE ON COMMUNICATIONS (NCC), 2017,
[49] Objective quality measurement for audio time-scale modification
Liu, F
Lee, JJ
Kuo, CCJ
INTERNET MULTIMEDIA MANAGEMENT SYSTEMS IV, 2003, 5242 : 208 - 216
[50] A time-scale modification dataset with subjective quality labels
Roberts, Timothy
Paliwal, Kuldip K.
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2020, 148 (01): : 201 - 210

← 1 2 3 4 5 →