On a modified cepstral pitch control technique for the high quality text-to-speech type system

被引：0

作者：

Kim, J ^{[1
]}

Bae, M ^{[1
]}

机构：

[1] Soongsil Univ, Dept Telecommun Engn, Seoul 156743, South Korea

来源：

1998 MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, PROCEEDINGS | 1999年

关键词：

D O I：

10.1109/MWSCAS.1998.759568

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In the area of speech synthesis, the waveform coding methods are mainly used to maintain intelligibility and naturalness of synthetic speech, However, it is difficult to apply the waveform coding methods to the synthesis by rule since this methods do not separate both the excitation information and vocal tract information from a speech signal. This paper proposes a modified pitch alteration method that can reduce the spectrum distortion by reconstructing the pitch altered speech signal using both the formant component in the quefrency domain and the phase component in the time domain. This has little spectrum distortion of 1.18% for 50% pitch change.

引用

页码：616 / 619

页数：4

共 50 条

[1] On a cepstral technique for pitch control in the high quality text-to-speech type system
Bae, MJ
Lee, SH
PROCEEDINGS OF THE 39TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS I-III, 1996, : 803 - 806
[2] On a spectral scaling technique for pitch control in the high quality text-to-speech type system
Chung, HG
Bae, MJ
40TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1 AND 2, 1998, : 1430 - 1433
[3] Pitch models of Mandarin text-to-speech
邵艳秋
穗志方
韩纪庆
Journal of Harbin Institute of Technology(New series), 2009, 16 (02) : 179 - 184
[4] High-quality prosody generation in Mandarin text-to-speech system
Guo, Qing
Zhang, Jie
Katae, Nobuyuki
Yu, Hao
Fujitsu Scientific and Technical Journal, 2010, 46 (01): : 40 - 46
[5] High-Quality Prosody Generation in Mandarin Text-to-Speech System
Guo, Qing
Zhang, Jie
Katae, Nobuyuki
Yu, Hao
FUJITSU SCIENTIFIC & TECHNICAL JOURNAL, 2010, 46 (01): : 40 - 46
[6] A high quality text-to-speech system composed of multiple neural networks
Karaali, O
Corrigan, G
Massey, N
Miller, C
Schnurr, O
Mackie, A
PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 1237 - 1240
[7] FASTPITCH: PARALLEL TEXT-TO-SPEECH WITH PITCH PREDICTION
Lancucki, Adrian
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6588 - 6592
[8] High-quality text-to-speech synthesis: An overview
Dutoit, T.
Journal of Electrical and Electronics Engineering, Australia, 1997, 17 (01): : 25 - 36
[9] Slovenian text-to-speech system
Sef, T
ISCAS 2000: IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS - PROCEEDINGS, VOL V: EMERGING TECHNOLOGIES FOR THE 21ST CENTURY, 2000, : 41 - 44
[10] A Hakka text-to-speech system
Yu, Hsiu-Min
Hwang, Hsin-Te
Lin, Dong-Yi
Chen, Sin-Horng
CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 241 - +

← 1 2 3 4 5 →