A System Design of English Speech Synthesis

被引：0

作者：

Li, Su-Mei ^{[1
,2
]}

Liu, Cong-Cong ^{[3
]}

Yang, Yuan-Cheng ^{[3
]}

Li, Xin-Guang ^{[1
]}

Ma, Shan-Xian ^{[3
]}

机构：

[1] Guangdong Univ Foreign Studies, Lab Language Engn & Comp, Guangzhou, Peoples R China

[2] Guangdong Univ Foreign Studies, Expt Teaching Ctr, Guangzhou, Peoples R China

[3] Guangdong Univ Foreign Studies, Sch Informat Sci & Technol, Guangzhou, Peoples R China

来源：

2021 14TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2021) | 2021年

关键词：

WaveNet; WaveRNN; English Speech Synthesis; Aritificial Intelligence;

D O I：

10.1109/CISP-BMEI53629.2021.9624390

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

Speech synthesis, also known as text to speech (TTS), is a technology to convert text into sound, which is also an important technology to realize the communication between human and machine. English is an international language. It is necessary to study English speech synthesis technology. Aiming at English speech synthesis, an end-to-end speech synthesis method and system based on convolutional neural network WaveRNN and WaveNet is explained used in this paper. Experiments show that the Mean Opinion Score (MOS) of the synthesized speech is 3.32, and the speech quality is better than that of the general parametric speech synthesis system.

引用

页数：6

共 50 条

[1] Text to Speech Synthesis System in Indian English
Mahanta, Deepshikha
Sharma, Bidisha
Sarmah, Priyankoo
Prasanna, S. R. Mahadeva
[J]. PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 2614 - 2618
[2] Iterative English accent adaptation in a speech synthesis system
Olinsky, C
Cummins, F
[J]. PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON SPEECH SYNTHESIS, 2002, : 79 - 82
[3] Text to Speech Synthesis System for English to Malayalam Translation
Anto, Ancy
Nisha, K. K.
[J]. IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGICAL TRENDS IN COMPUTING, COMMUNICATIONS AND ELECTRICAL ENGINEERING (ICETT), 2016,
[4] Design of a Speaking Training System for English Speech Education using Speech Recognition Technology
He, Hengheng
[J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (11) : 450 - 455
[5] Design of a Speaking Training System for English Speech Education using Speech Recognition Technology
He H.
[J]. International Journal of Advanced Computer Science and Applications, 2022, 13 (11): : 450 - 455
[6] Trainable Cantonese/English dual language speech synthesis system
Li, HP
Chen, FX
Shen, LQ
Ma, XJ
[J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 508 - 511
[7] An HMM-based speech synthesis system applied to English
Tokuda, K
Zen, H
Black, AW
[J]. PROCEEDINGS OF THE 2002 IEEE WORKSHOP ON SPEECH SYNTHESIS, 2002, : 227 - 230
[8] INTEGRATING MACHINE TRANSLATION AND SPEECH SYNTHESIS COMPONENT FOR ENGLISH TO DRAVIDIAN LANGUAGE SPEECH TO SPEECH TRANSLATION SYSTEM
Sangeetha, J.
Jothilakshmi, S.
[J]. JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2015, 10 (02): : 196 - 211
[9] The IBM expressive text-to-speech synthesis system for American English
Pitrelli, John F.
Bakis, Raitno
Eide, Ellen M.
Fernandez, Raul
Hamza, Wael
Picheny, Michael A.
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (04): : 1099 - 1108
[10] Intelligent speech sounds synthesis system design and application
Wang, Hong
Ning, Yu
Weng, Wenhua
[J]. Wuhan Gongye Daxue Xuebao/Journal of Wuhan University of Technology, 2001, 23 (09): : 61 - 64

← 1 2 3 4 5 →