A novel prosodic-information synthesizer based on recurrent fuzzy neural network for the Chinese TTS system

被引:11
|
作者
Lin, CT [1 ]
Wu, RC [1 ]
Chang, JY [1 ]
Liang, SF [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Elect & Control Engn, Hsinchu 300, Taiwan
关键词
Chinese text-to-speech system; fuzzy inference engine; prosodic information; recurrent neural network; sandhi rules; speech synthesizer;
D O I
10.1109/TSMCB.2003.811518
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a new technique for the Chinese text-to-speech (TTS) system is proposed. Our major effort focuses on the prosodic information generation. New methodologies for constructing fuzzy rules in a prosodic model simulating human's pronouncing rules are developed. The proposed Recurrent Fuzzy Neural Network (RFNN) is a multilayer recurrent neural network (RNN) which integrates a Self-constructing Neural Fuzzy Inference Network (SONFIN) into a recurrent connectionist structure. The RFNN can be functionally divided into two,parts. The first part adopts the SONFIN as a prosodic model to explore the relationship between high-level linguistic features and prosodic information based on fuzzy inference rules. As compared to conventional neural networks, the SONFIN can always construct itself with an economic network size in high learning speed. The second part employs a five-layer network to generate all prosodic parameters by directly using the prosodic fuzzy rules inferred from the first part as well as other important features of syllables. The TTS system combined with the proposed method can behave not only sandhi rules but also the other prosodic phenomena existing in the traditional TTS systems. Moreover, the proposed scheme can even find out some new rules about prosodic phrase structure. The performance of the proposed RFNN-based prosodic model is verified by imbedding it into a Chinese TTS system with a Chinese monosyllable database based on the time-domain pitch synchronous overlap add (TD-PSOLA) method. Our experimental results show that the proposed RFNN can generate proper prosodic parameters including pitch means, pitch shapes, maximum energy levels, syllable duration, and pause duration. Some synthetic sounds are on-line available for demonstration.
引用
下载
收藏
页码:309 / 324
页数:16
相关论文
共 50 条
  • [31] The information search system using neural network and fuzzy clustering based on mobile agent
    Ko, J
    Gerardo, BD
    Lee, J
    Hwang, JJ
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2005, PT 2, 2005, 3481 : 205 - 214
  • [32] The Information-enhanced BIT Design of Avionics System Based on Fuzzy Neural Network
    Yao Guo-Ping
    Hou Wen-Kui
    Shi Long
    Shi Jun-You
    PROCEEDINGS 2013 INTERNATIONAL CONFERENCE ON MECHATRONIC SCIENCES, ELECTRIC ENGINEERING AND COMPUTER (MEC), 2013, : 2950 - 2954
  • [33] A block-diagonal recurrent fuzzy neural network for system identification
    Paris A. Mastorocostas
    Constantinos S. Hilas
    Neural Computing and Applications, 2009, 18 : 707 - 717
  • [34] A Multilayer Recurrent Fuzzy Neural Network for Accurate Dynamic System Modeling
    柳贺
    黄道
    Journal of Donghua University(English Edition), 2008, 25 (04) : 373 - 378
  • [35] Dynamic system identification using a recurrent compensatory fuzzy neural network
    Lee, Chi-Yung
    Lin, Cheng-Jian
    Chen, G-Hung
    Chang, Chun-Lung
    2008, Institute of Control, Robotics and Systems (06)
  • [36] Dynamic system identification using a recurrent compensatory fuzzy neural network
    Lee, Chi-Yung
    Lin, Cheng-Jian
    Chen, Cheng-Hung
    Chang, Chun-Lung
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2008, 6 (05) : 755 - 766
  • [37] A block-diagonal recurrent fuzzy neural network for system identification
    Mastorocostas, Paris A.
    Hilas, Constantinos S.
    NEURAL COMPUTING & APPLICATIONS, 2009, 18 (07): : 707 - 717
  • [38] Modified PSO Algorithm on Recurrent Fuzzy Neural Network for System Identification
    Hung, Chung Wen
    Mao, Wei Lung
    Huang, Han Yi
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2019, 25 (02): : 329 - 341
  • [39] A Novel Rough Neural Network Based on Fuzzy Partition
    Xu Xiang
    Zhang Dongbo
    Wang Yaonanr
    Liu Ziwen
    2010 CHINESE CONTROL AND DECISION CONFERENCE, VOLS 1-5, 2010, : 2350 - +
  • [40] A novel recurrent neural network-based prediction system for option trading and hedging
    C. Quek
    M. Pasquier
    N. Kumar
    Applied Intelligence, 2008, 29 : 138 - 151