On a modified cepstral pitch control technique for the high quality text-to-speech type system

被引:0
|
作者
Kim, J [1 ]
Bae, M [1 ]
机构
[1] Soongsil Univ, Dept Telecommun Engn, Seoul 156743, South Korea
关键词
D O I
10.1109/MWSCAS.1998.759568
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In the area of speech synthesis, the waveform coding methods are mainly used to maintain intelligibility and naturalness of synthetic speech, However, it is difficult to apply the waveform coding methods to the synthesis by rule since this methods do not separate both the excitation information and vocal tract information from a speech signal. This paper proposes a modified pitch alteration method that can reduce the spectrum distortion by reconstructing the pitch altered speech signal using both the formant component in the quefrency domain and the phase component in the time domain. This has little spectrum distortion of 1.18% for 50% pitch change.
引用
收藏
页码:616 / 619
页数:4
相关论文
共 50 条
  • [41] High quality Arabic text-to-speech synthesis using unit selection
    Abdelmalek, Raja
    Mnasri, Zied
    2016 13TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD), 2016, : 1 - 5
  • [42] An Advanced NLP Framework for High-Quality Text-to-Speech Synthesis
    Ungurean, Catalin
    Burileanu, Dragos
    2011 6TH CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2011,
  • [43] Enhancing the Quality of Nepali Text-to-Speech Systems
    Ghimire, Rupak Raj
    Bal, Bal Krishna
    CREATIVITY IN INTELLIGENT TECHNOLOGIES AND DATA SCIENCE, (CIT&DS), 2017, 754 : 187 - 197
  • [44] Part of Speech Tagging for Romanian Text-to-Speech System
    Teodorescu, Lucian Radu
    Boldizsar, Razvan
    Ordean, Mihai
    Duma, Melania
    Detesan, Laura
    Ordean, Mihaela
    13TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2011), 2012, : 153 - 159
  • [45] Perceptual Quality Dimensions of Text-to-Speech Systems
    Hinterleitner, Florian
    Moeller, Sebastian
    Norrenbrock, Christoph
    Heute, Ulrich
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2188 - 2191
  • [46] Enhanced quality text-to-speech for restricted domains
    不详
    BELL LABS TECHNICAL JOURNAL, 1997, 2 (04) : 169 - 170
  • [47] On a speech synthesis technique with high quality by the pitch alteration of speech waveform
    Kim, D
    Bae, MJ
    Ieem, JS
    ISSPA 96 - FOURTH INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND ITS APPLICATIONS, PROCEEDINGS, VOLS 1 AND 2, 1996, : 714 - 717
  • [48] Including Pitch Accent Optionality in Unit Selection Text-to-Speech Synthesis
    Badino, Leonardo
    Clark, Robert A. J.
    Strom, Volker
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2118 - 2121
  • [49] Explicit Intensity Control for Accented Text-to-speech
    Liu, Rui
    Zuo, Haolin
    Hu, De
    Gao, Guanglai
    Li, Haizhou
    INTERSPEECH 2023, 2023, : 22 - 26
  • [50] FOCUS AND ACCENT IN A DUTCH TEXT-TO-SPEECH SYSTEM
    BAART, JLG
    FOURTH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 1989, : 111 - 115