E3TTS: End-to-End Text-Based Speech Editing TTS System and Its Applications

被引：0

作者：

Liang, Zheng ^{[1
]}

Ma, Ziyang ^{[1
]}

Du, Chenpeng ^{[1
]}

Yu, Kai ^{[1
]}

Chen, Xie ^{[1
]}

机构：

[1] Shanghai Jiao Tong University, X-LANCE Lab, Department of Computer Science and Engineering, MoE Key Lab of Artificial Intelligence, AI Institute, Shanghai,200240, China

来源：

IEEE/ACM Transactions on Audio Speech and Language Processing | 2024年 / 32卷

基金：

中国国家自然科学基金;

关键词：

Speech enhancement;

D O I：

10.1109/TASLP.2024.3485466

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

引用

页码：4810 / 4821

共 50 条

[41] End-to-end Triplet Loss based Emotion Embedding System for Speech Emotion Recognition
Kumar, Puneet
Jain, Sidharth
Raman, Balasubramanian
Roy, Partha Pratim
Iwamura, Masakazu
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 8766 - 8773
[42] A study of transformer-based end-to-end speech recognition system for Kazakh language
Mamyrbayev, Orken
Oralbekova, Dina
Alimhan, Keylan
Turdalykyzy, Tolganay
Othman, Mohamed
SCIENTIFIC REPORTS, 2022, 12 (01)
[43] A study of transformer-based end-to-end speech recognition system for Kazakh language
Mamyrbayev Orken
Oralbekova Dina
Alimhan Keylan
Turdalykyzy Tolganay
Othman Mohamed
Scientific Reports, 12
[44] An End-to-End e-Election System Based on Multimodal Identification and Authentication
Ayo, Charles
Daramola, Justine
Gabriel, Obi
Sofoluwe, Adetokunbo
PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON E-GOVERNMENT, 2010, : 10 - 17
[45] Advance research in agricultural text-to-speech: the word segmentation of analytic language and the deep learning-based end-to-end system
Li, Xinxing
Ma, Diankun
Yin, Baoquan
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2021, 180
[46] A Text Detection and Recognition System based on an End-to-End Trainable Framework from UAV Imagery
Wu, Qingtian
Zhou, Yimin
Liang, Guoyuan
2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2018, : 736 - 741
[47] E2E-DASR: End-to-end deep learning-based dysarthric automatic speech recognition
Almadhor, Ahmad
Irfan, Rizwana
Gao, Jiechao
Saleem, Nasir
Rauf, Hafiz Tayyab
Kadry, Seifedine
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 222
[48] BLSTM-CRF Based End-to-End Prosodic Boundary Prediction with Context Sensitive Embeddings in A Text-to-Speech Front-End
Zheng, Yibin
Tao, Jianhua
Wen, Zhengqi
Li, Ya
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 47 - 51
[49] A New End-to-End Long-Time Speech Synthesis System Based on Tacotron2
Liu, Renyuan
Yang, Jian
Liu, Mengyuan
2019 INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING SYSTEMS (SPSS 2019), 2019, : 46 - 50
[50] Fast offline transformer-based end-to-end automatic speech recognition for real-world applications
Oh, Yoo Rhee
Park, Kiyoung
Park, Jeon Gue
ETRI JOURNAL, 2022, 44 (03) : 476 - 490

← 1 2 3 4 5 →