A System Design of English Speech Synthesis

被引:0
|
作者
Li, Su-Mei [1 ,2 ]
Liu, Cong-Cong [3 ]
Yang, Yuan-Cheng [3 ]
Li, Xin-Guang [1 ]
Ma, Shan-Xian [3 ]
机构
[1] Guangdong Univ Foreign Studies, Lab Language Engn & Comp, Guangzhou, Peoples R China
[2] Guangdong Univ Foreign Studies, Expt Teaching Ctr, Guangzhou, Peoples R China
[3] Guangdong Univ Foreign Studies, Sch Informat Sci & Technol, Guangzhou, Peoples R China
关键词
WaveNet; WaveRNN; English Speech Synthesis; Aritificial Intelligence;
D O I
10.1109/CISP-BMEI53629.2021.9624390
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Speech synthesis, also known as text to speech (TTS), is a technology to convert text into sound, which is also an important technology to realize the communication between human and machine. English is an international language. It is necessary to study English speech synthesis technology. Aiming at English speech synthesis, an end-to-end speech synthesis method and system based on convolutional neural network WaveRNN and WaveNet is explained used in this paper. Experiments show that the Mean Opinion Score (MOS) of the synthesized speech is 3.32, and the speech quality is better than that of the general parametric speech synthesis system.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Text to Speech Synthesis System in Indian English
    Mahanta, Deepshikha
    Sharma, Bidisha
    Sarmah, Priyankoo
    Prasanna, S. R. Mahadeva
    [J]. PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 2614 - 2618
  • [2] Iterative English accent adaptation in a speech synthesis system
    Olinsky, C
    Cummins, F
    [J]. PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON SPEECH SYNTHESIS, 2002, : 79 - 82
  • [3] Text to Speech Synthesis System for English to Malayalam Translation
    Anto, Ancy
    Nisha, K. K.
    [J]. IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGICAL TRENDS IN COMPUTING, COMMUNICATIONS AND ELECTRICAL ENGINEERING (ICETT), 2016,
  • [4] Design of a Speaking Training System for English Speech Education using Speech Recognition Technology
    He, Hengheng
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (11) : 450 - 455
  • [5] Design of a Speaking Training System for English Speech Education using Speech Recognition Technology
    He H.
    [J]. International Journal of Advanced Computer Science and Applications, 2022, 13 (11): : 450 - 455
  • [6] Trainable Cantonese/English dual language speech synthesis system
    Li, HP
    Chen, FX
    Shen, LQ
    Ma, XJ
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 508 - 511
  • [7] An HMM-based speech synthesis system applied to English
    Tokuda, K
    Zen, H
    Black, AW
    [J]. PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON SPEECH SYNTHESIS, 2002, : 227 - 230
  • [8] INTEGRATING MACHINE TRANSLATION AND SPEECH SYNTHESIS COMPONENT FOR ENGLISH TO DRAVIDIAN LANGUAGE SPEECH TO SPEECH TRANSLATION SYSTEM
    Sangeetha, J.
    Jothilakshmi, S.
    [J]. JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2015, 10 (02): : 196 - 211
  • [9] The IBM expressive text-to-speech synthesis system for American English
    Pitrelli, John F.
    Bakis, Raitno
    Eide, Ellen M.
    Fernandez, Raul
    Hamza, Wael
    Picheny, Michael A.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (04): : 1099 - 1108
  • [10] Intelligent speech sounds synthesis system design and application
    Wang, Hong
    Ning, Yu
    Weng, Wenhua
    [J]. Wuhan Gongye Daxue Xuebao/Journal of Wuhan University of Technology, 2001, 23 (09): : 61 - 64