E3TTS: End-to-End Text-Based Speech Editing TTS System and Its Applications

被引：0

作者：

Liang, Zheng ^{[1
]}

Ma, Ziyang ^{[1
]}

Du, Chenpeng ^{[1
]}

Yu, Kai ^{[1
]}

Chen, Xie ^{[1
]}

机构：

[1] Shanghai Jiao Tong University, X-LANCE Lab, Department of Computer Science and Engineering, MoE Key Lab of Artificial Intelligence, AI Institute, Shanghai,200240, China

来源：

IEEE/ACM Transactions on Audio Speech and Language Processing | 2024年 / 32卷

基金：

中国国家自然科学基金;

关键词：

Speech enhancement;

D O I：

10.1109/TASLP.2024.3485466

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

引用

页码：4810 / 4821

共 50 条

[1] Emotion selectable end-to-end text-based speech editing
Wang, Tao
Yi, Jiangyan
Fu, Ruibo
Tao, Jianhua
Wen, Zhengqi
Zhang, Chu Yuan
ARTIFICIAL INTELLIGENCE, 2024, 329
[2] A Novel End-to-End Turkish Text-to-Speech (TTS) System via Deep Learning
Oyucu, Saadin
ELECTRONICS, 2023, 12 (08)
[3] SR-TTS: a rhyme-based end-to-end speech synthesis system
Yao, Yihao
Liang, Tao
Feng, Rui
Shi, Keke
Yu, Junxiao
Wang, Wei
Li, Jianqing
FRONTIERS IN NEUROROBOTICS, 2024, 18
[4] SANE-TTS: Stable And Natural End-to-End Multilingual Text-to-Speech
Cho, Hyunjae
Jung, Wonbin
Lee, Junhyeok
Woo, Sang Hoon
INTERSPEECH 2022, 2022, : 1 - 5
[5] Prosody-TTS: An End-to-End Speech Synthesis System with Prosody Control
Pamisetty, Giridhar
Murty, K. Sri Rama
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2023, 42 (01) : 361 - 384
[6] Prosody-TTS: An End-to-End Speech Synthesis System with Prosody Control
Giridhar Pamisetty
K. Sri Rama Murty
Circuits, Systems, and Signal Processing, 2023, 42 : 361 - 384
[7] CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing
Wang, Tao
Yi, Jiangyan
Fu, Ruibo
Tao, Jianhua
Wen, Zhengqi
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2241 - 2254
[8] CONTEXT-AWARE MASK PREDICTION NETWORK FOR END-TO-END TEXT-BASED SPEECH EDITING
Wang, Tao
Yi, Jiangyan
Deng, Liqun
Fu, Ruibo
Tao, Jianhua
Wen, Zhengqi
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6082 - 6086
[9] Boosting subjective quality of Arabic text-to-speech (TTS) using end-to-end deep architecture
Fahmy, Fady K.
Abbas, Hazem M.
Khalil, Mahmoud, I
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2022, 25 (01) : 79 - 88
[10] Boosting subjective quality of Arabic text-to-speech (TTS) using end-to-end deep architecture
Fady K. Fahmy
Hazem M. Abbas
Mahmoud I. Khalil
International Journal of Speech Technology, 2022, 25 : 79 - 88

← 1 2 3 4 5 →