E3TTS: End-to-End Text-Based Speech Editing TTS System and Its Applications

被引:0
|
作者
Liang, Zheng [1 ]
Ma, Ziyang [1 ]
Du, Chenpeng [1 ]
Yu, Kai [1 ]
Chen, Xie [1 ]
机构
[1] Shanghai Jiao Tong University, X-LANCE Lab, Department of Computer Science and Engineering, MoE Key Lab of Artificial Intelligence, AI Institute, Shanghai,200240, China
基金
中国国家自然科学基金;
关键词
Speech enhancement;
D O I
10.1109/TASLP.2024.3485466
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
引用
收藏
页码:4810 / 4821
相关论文
共 50 条
  • [41] End-to-end Triplet Loss based Emotion Embedding System for Speech Emotion Recognition
    Kumar, Puneet
    Jain, Sidharth
    Raman, Balasubramanian
    Roy, Partha Pratim
    Iwamura, Masakazu
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 8766 - 8773
  • [42] A study of transformer-based end-to-end speech recognition system for Kazakh language
    Mamyrbayev, Orken
    Oralbekova, Dina
    Alimhan, Keylan
    Turdalykyzy, Tolganay
    Othman, Mohamed
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [43] A study of transformer-based end-to-end speech recognition system for Kazakh language
    Mamyrbayev Orken
    Oralbekova Dina
    Alimhan Keylan
    Turdalykyzy Tolganay
    Othman Mohamed
    Scientific Reports, 12
  • [44] An End-to-End e-Election System Based on Multimodal Identification and Authentication
    Ayo, Charles
    Daramola, Justine
    Gabriel, Obi
    Sofoluwe, Adetokunbo
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON E-GOVERNMENT, 2010, : 10 - 17
  • [45] Advance research in agricultural text-to-speech: the word segmentation of analytic language and the deep learning-based end-to-end system
    Li, Xinxing
    Ma, Diankun
    Yin, Baoquan
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2021, 180
  • [46] A Text Detection and Recognition System based on an End-to-End Trainable Framework from UAV Imagery
    Wu, Qingtian
    Zhou, Yimin
    Liang, Guoyuan
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2018, : 736 - 741
  • [47] E2E-DASR: End-to-end deep learning-based dysarthric automatic speech recognition
    Almadhor, Ahmad
    Irfan, Rizwana
    Gao, Jiechao
    Saleem, Nasir
    Rauf, Hafiz Tayyab
    Kadry, Seifedine
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 222
  • [48] BLSTM-CRF Based End-to-End Prosodic Boundary Prediction with Context Sensitive Embeddings in A Text-to-Speech Front-End
    Zheng, Yibin
    Tao, Jianhua
    Wen, Zhengqi
    Li, Ya
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 47 - 51
  • [49] A New End-to-End Long-Time Speech Synthesis System Based on Tacotron2
    Liu, Renyuan
    Yang, Jian
    Liu, Mengyuan
    2019 INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING SYSTEMS (SPSS 2019), 2019, : 46 - 50
  • [50] Fast offline transformer-based end-to-end automatic speech recognition for real-world applications
    Oh, Yoo Rhee
    Park, Kiyoung
    Park, Jeon Gue
    ETRI JOURNAL, 2022, 44 (03) : 476 - 490