A novel prosodic-information synthesizer based on recurrent fuzzy neural network for the Chinese TTS system

被引:11
|
作者
Lin, CT [1 ]
Wu, RC [1 ]
Chang, JY [1 ]
Liang, SF [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Elect & Control Engn, Hsinchu 300, Taiwan
关键词
Chinese text-to-speech system; fuzzy inference engine; prosodic information; recurrent neural network; sandhi rules; speech synthesizer;
D O I
10.1109/TSMCB.2003.811518
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a new technique for the Chinese text-to-speech (TTS) system is proposed. Our major effort focuses on the prosodic information generation. New methodologies for constructing fuzzy rules in a prosodic model simulating human's pronouncing rules are developed. The proposed Recurrent Fuzzy Neural Network (RFNN) is a multilayer recurrent neural network (RNN) which integrates a Self-constructing Neural Fuzzy Inference Network (SONFIN) into a recurrent connectionist structure. The RFNN can be functionally divided into two,parts. The first part adopts the SONFIN as a prosodic model to explore the relationship between high-level linguistic features and prosodic information based on fuzzy inference rules. As compared to conventional neural networks, the SONFIN can always construct itself with an economic network size in high learning speed. The second part employs a five-layer network to generate all prosodic parameters by directly using the prosodic fuzzy rules inferred from the first part as well as other important features of syllables. The TTS system combined with the proposed method can behave not only sandhi rules but also the other prosodic phenomena existing in the traditional TTS systems. Moreover, the proposed scheme can even find out some new rules about prosodic phrase structure. The performance of the proposed RFNN-based prosodic model is verified by imbedding it into a Chinese TTS system with a Chinese monosyllable database based on the time-domain pitch synchronous overlap add (TD-PSOLA) method. Our experimental results show that the proposed RFNN can generate proper prosodic parameters including pitch means, pitch shapes, maximum energy levels, syllable duration, and pause duration. Some synthetic sounds are on-line available for demonstration.
引用
下载
收藏
页码:309 / 324
页数:16
相关论文
共 50 条
  • [21] AN INTELLIGENT CONTROL SYSTEM BASED ON RECURRENT NEURAL FUZZY NETWORK AND ITS APPLICATION TO CSTR
    JIA Li YU Jinshou (Research Institute of Automation
    Journal of Systems Science & Complexity, 2005, (01) : 43 - 54
  • [22] Dynamic system modeling with multilayer recurrent fuzzy neural network
    Liu, He
    Huang, Dao
    Jia, Li
    CIS: 2007 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PROCEEDINGS, 2007, : 570 - +
  • [23] System fuzzy modeling based on fuzzy neural network
    Harbin Gongye Daxue Xuebao, 5 (79-81, 85):
  • [24] A nonlinear ANC system with a SPSA-based recurrent fuzzy neural network controller
    Zhang, Qizhi
    Zhou, Yali
    Liu, Xiaohe
    Li, Xiaodong
    Gan, Woonseng
    ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 1, PROCEEDINGS, 2007, 4491 : 176 - +
  • [25] An induction generator system using fuzzy modeling and recurrent fuzzy neural network
    Lin, Faa-Jeng
    Huang, Po-Kai
    Wang, Hin-Chien
    Teng, Li-Tao
    IEEE TRANSACTIONS ON POWER ELECTRONICS, 2007, 22 (01) : 260 - 271
  • [26] Fault diagnosis based on the fuzzy-recurrent neural network
    Zhao, X.
    Xiao, D.Y.
    Asian Journal of Control, 2001, 3 (02) : 89 - 95
  • [27] A Controller for Robotic Manipulators Based on Recurrent Fuzzy Neural Network
    Zhang, Hongmin
    Dai, Xuefeng
    2011 3RD WORLD CONGRESS IN APPLIED COMPUTING, COMPUTER SCIENCE, AND COMPUTER ENGINEERING (ACC 2011), VOL 3, 2011, 3 : 654 - +
  • [28] A Direct Feedback Control Based on Fuzzy Recurrent Neural Network
    李明
    马小平
    International Journal of Mining Science and Technology, 2002, (02) : 102 - 105
  • [29] Fuzzy time series prediction method based on fuzzy recurrent neural network
    Aliev, Rafik
    Fazlollahi, Bijan
    Aliev, Rashad
    Guirimov, Babek
    NEURAL INFORMATION PROCESSING, PT 2, PROCEEDINGS, 2006, 4233 : 860 - 869
  • [30] Nonlinear System Identification Based on a Novel Adaptive Fuzzy Wavelet Neural Network
    Salimifard, Maryam
    Safavi, Ali Akbar
    2013 21ST IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2013,