E3TTS: End-to-End Text-Based Speech Editing TTS System and Its Applications

被引：0

作者：

Liang, Zheng ^{[1
]}

Ma, Ziyang ^{[1
]}

Du, Chenpeng ^{[1
]}

Yu, Kai ^{[1
]}

Chen, Xie ^{[1
]}

机构：

[1] Shanghai Jiao Tong University, X-LANCE Lab, Department of Computer Science and Engineering, MoE Key Lab of Artificial Intelligence, AI Institute, Shanghai,200240, China

来源：

IEEE/ACM Transactions on Audio Speech and Language Processing | 2024年 / 32卷

基金：

中国国家自然科学基金;

关键词：

Speech enhancement;

D O I：

10.1109/TASLP.2024.3485466

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

引用

页码：4810 / 4821

共 50 条

[31] Knowledge-based Linguistic Encoding for End-to-End Mandarin Text-to-Speech Synthesis
Li, Jingbei
Wu, Zhiyong
Li, Runnan
Zhi, Pengpeng
Yang, Song
Meng, Helen
INTERSPEECH 2019, 2019, : 4494 - 4498
[32] End-to-end text-to-speech synthesis with unaligned multiple language units based on attention
Aso, Masashi
Takamichi, Shinnosuke
Saruwatari, Hiroshi
INTERSPEECH 2020, 2020, : 4009 - 4013
[33] Deep-learning based end-to-end system for text reading in the wild
Harizi, Riadh
Walha, Rim
Drira, Fadoua
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (17) : 24691 - 24719
[34] Deep-learning based end-to-end system for text reading in the wild
Riadh Harizi
Rim Walha
Fadoua Drira
Multimedia Tools and Applications, 2022, 81 : 24691 - 24719
[35] Development of CRF and CTC Based End-To-End Kazakh Speech Recognition System
Oralbekova, Dina
Mamyrbayev, Orken
Othman, Mohamed
Alimhan, Keylan
Zhumazhanov, Bagashar
Nuranbayeva, Bulbul
INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2022, PT I, 2022, 13757 : 519 - 531
[36] Hardware Accelerator for Transformer based End-to-End Automatic Speech Recognition System
Yamini, Shaarada D.
Mirishkar, Ganesh S.
Vuppala, Anil Kumar
Purini, Suresh
2023 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS, IPDPSW, 2023, : 93 - 100
[37] End-to-End Text-to-Speech Based on Latent Representation of Speaking Styles Using Spontaneous Dialogue
Mitsui, Kentaro
Zhao, Tianyu
Sawada, Kei
Hono, Yukiya
Nankaku, Yoshihiko
Tokuda, Keiichi
INTERSPEECH 2022, 2022, : 2328 - 2332
[38] Adaptive End-to-End Text-to-Speech Synthesis Based on Error Correction Feedback from Humans
Fujii, Kazuki
Saito, Yuki
Saruwatari, Hiroshi
PROCEEDINGS OF 2022 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2022, : 1702 - 1707
[39] Adaptive End-to-End Text-to-Speech Synthesis Based on Error Correction Feedback from Humans
Fujii, Kazuki
Saito, Yuki
Saruwatari, Hiroshi
Proceedings of 2022 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2022, 2022, : 1702 - 1707
[40] Speech Vision: An End-to-End Deep Learning-Based Dysarthric Automatic Speech Recognition System
Shahamiri, Seyed Reza
IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2021, 29 : 852 - 861

← 1 2 3 4 5 →