On a modified cepstral pitch control technique for the high quality text-to-speech type system

被引:0
|
作者
Kim, J [1 ]
Bae, M [1 ]
机构
[1] Soongsil Univ, Dept Telecommun Engn, Seoul 156743, South Korea
关键词
D O I
10.1109/MWSCAS.1998.759568
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In the area of speech synthesis, the waveform coding methods are mainly used to maintain intelligibility and naturalness of synthetic speech, However, it is difficult to apply the waveform coding methods to the synthesis by rule since this methods do not separate both the excitation information and vocal tract information from a speech signal. This paper proposes a modified pitch alteration method that can reduce the spectrum distortion by reconstructing the pitch altered speech signal using both the formant component in the quefrency domain and the phase component in the time domain. This has little spectrum distortion of 1.18% for 50% pitch change.
引用
收藏
页码:616 / 619
页数:4
相关论文
共 50 条
  • [1] On a cepstral technique for pitch control in the high quality text-to-speech type system
    Bae, MJ
    Lee, SH
    PROCEEDINGS OF THE 39TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS I-III, 1996, : 803 - 806
  • [2] On a spectral scaling technique for pitch control in the high quality text-to-speech type system
    Chung, HG
    Bae, MJ
    40TH MIDWEST SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1 AND 2, 1998, : 1430 - 1433
  • [3] Pitch models of Mandarin text-to-speech
    邵艳秋
    穗志方
    韩纪庆
    Journal of Harbin Institute of Technology(New series), 2009, 16 (02) : 179 - 184
  • [4] High-quality prosody generation in Mandarin text-to-speech system
    Guo, Qing
    Zhang, Jie
    Katae, Nobuyuki
    Yu, Hao
    Fujitsu Scientific and Technical Journal, 2010, 46 (01): : 40 - 46
  • [5] High-Quality Prosody Generation in Mandarin Text-to-Speech System
    Guo, Qing
    Zhang, Jie
    Katae, Nobuyuki
    Yu, Hao
    FUJITSU SCIENTIFIC & TECHNICAL JOURNAL, 2010, 46 (01): : 40 - 46
  • [6] A high quality text-to-speech system composed of multiple neural networks
    Karaali, O
    Corrigan, G
    Massey, N
    Miller, C
    Schnurr, O
    Mackie, A
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 1237 - 1240
  • [7] FASTPITCH: PARALLEL TEXT-TO-SPEECH WITH PITCH PREDICTION
    Lancucki, Adrian
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6588 - 6592
  • [8] High-quality text-to-speech synthesis: An overview
    Dutoit, T.
    Journal of Electrical and Electronics Engineering, Australia, 1997, 17 (01): : 25 - 36
  • [9] Slovenian text-to-speech system
    Sef, T
    ISCAS 2000: IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS - PROCEEDINGS, VOL V: EMERGING TECHNOLOGIES FOR THE 21ST CENTURY, 2000, : 41 - 44
  • [10] A Hakka text-to-speech system
    Yu, Hsiu-Min
    Hwang, Hsin-Te
    Lin, Dong-Yi
    Chen, Sin-Horng
    CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 241 - +