A System Design of English Speech Synthesis

被引:0
|
作者
Li, Su-Mei [1 ,2 ]
Liu, Cong-Cong [3 ]
Yang, Yuan-Cheng [3 ]
Li, Xin-Guang [1 ]
Ma, Shan-Xian [3 ]
机构
[1] Guangdong Univ Foreign Studies, Lab Language Engn & Comp, Guangzhou, Peoples R China
[2] Guangdong Univ Foreign Studies, Expt Teaching Ctr, Guangzhou, Peoples R China
[3] Guangdong Univ Foreign Studies, Sch Informat Sci & Technol, Guangzhou, Peoples R China
关键词
WaveNet; WaveRNN; English Speech Synthesis; Aritificial Intelligence;
D O I
10.1109/CISP-BMEI53629.2021.9624390
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Speech synthesis, also known as text to speech (TTS), is a technology to convert text into sound, which is also an important technology to realize the communication between human and machine. English is an international language. It is necessary to study English speech synthesis technology. Aiming at English speech synthesis, an end-to-end speech synthesis method and system based on convolutional neural network WaveRNN and WaveNet is explained used in this paper. Experiments show that the Mean Opinion Score (MOS) of the synthesized speech is 3.32, and the speech quality is better than that of the general parametric speech synthesis system.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] On a Speech Multiple System Implementation for Speech Synthesis
    Jong Kuk Kim
    Hern Soo Hahn
    Myung Jin Bae
    [J]. Wireless Personal Communications, 2009, 49 : 533 - 543
  • [32] On a Speech Multiple System Implementation for Speech Synthesis
    Kim, Jong Kuk
    Hahn, Hern Soo
    Bae, Myung Jin
    [J]. WIRELESS PERSONAL COMMUNICATIONS, 2009, 49 (04) : 533 - 543
  • [33] Phoneme Set Design for Speech Recognition of English by Japanese
    Wang, Xiaoyun
    Zhang, Jinsong
    Nishida, Masafumi
    Yamamoto, Seiichi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2015, E98D (01): : 148 - 156
  • [34] Design and Implementation of Interactive Speech Recognizing English Dictionary
    Dev, Dipayan
    Banerjee, Pradipta
    [J]. PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING, NETWORKING AND INFORMATICS (ICACNI 2015), VOL 1, 2016, 43 : 421 - 433
  • [35] Design of Speech Recognition System
    Wang, Chao
    Zhu, Ruifei
    Jia, Hongguang
    Wei, Qun
    Jiang, Huhai
    Zhang, Tianyi
    Yu, Linyao
    [J]. 2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2013, : 1042 - 1044
  • [36] Performance Evaluation of Speech Synthesis Techniques for English Language
    Kayte, Sangramsing N.
    Mundada, Monica
    Gaikwad, Santosh
    Gawali, Bharti
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2015, VOL 2, 2016, 439 : 253 - 262
  • [37] Design and Implementation of Burmese Speech Synthesis System Based on HMM-DNN
    Liu, Mengyuan
    Yang, Jian
    [J]. PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2019, : 79 - 83
  • [38] Design of Human Face Detection and Recognition System Along With Speech Synthesis Subtitle
    Thalluril, Lakshmi Narayana
    Bosebabu, P.
    Kalavakolanu, S. R. Sastry
    Chandra, G. Roopa Krishna
    [J]. 2015 INTERNATIONAL CONFERENCE ON CONTROL, INSTRUMENTATION, COMMUNICATION AND COMPUTATIONAL TECHNOLOGIES (ICCICCT), 2015, : 396 - 400
  • [39] SPEECH-SYNTHESIS SYSTEM
    不详
    [J]. INSTRUMENTATION TECHNOLOGY, 1975, 22 (09): : 60 - 60
  • [40] PERFORMANCE OF A SPEECH SYNTHESIS SYSTEM
    AINSWORTH, WA
    [J]. INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1974, 6 (05): : 493 - 511