E3TTS: End-to-End Text-Based Speech Editing TTS System and Its Applications

被引:0
|
作者
Liang, Zheng [1 ]
Ma, Ziyang [1 ]
Du, Chenpeng [1 ]
Yu, Kai [1 ]
Chen, Xie [1 ]
机构
[1] Shanghai Jiao Tong University, X-LANCE Lab, Department of Computer Science and Engineering, MoE Key Lab of Artificial Intelligence, AI Institute, Shanghai,200240, China
基金
中国国家自然科学基金;
关键词
Speech enhancement;
D O I
10.1109/TASLP.2024.3485466
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
引用
收藏
页码:4810 / 4821
相关论文
共 50 条
  • [21] FPETS : Fully Parallel End-to-End Text-to-Speech System
    Ma, Dabiao
    Su, Zhiba
    Wang, Wenxuan
    Lu, Yuhao
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 8457 - 8463
  • [22] Effective Emotion Transplantation in an End-to-End Text-to-Speech System
    Joo, Young-Sun
    Bae, Hanbin
    Kim, Young-Ik
    Cho, Hoon-Young
    Kang, Hong-Goo
    IEEE ACCESS, 2020, 8 : 161713 - 161719
  • [23] TTS-SA (A Text-to-Speech System based on Standard Arabic)
    Hanane, Tebbi
    Maamar, Hamadouche
    Hamid, Azzoune
    2014 FOURTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION AND COMMUNICATION TECHNOLOGY AND IT'S APPLICATIONS (DICTAP), 2014, : 337 - 341
  • [24] E2EG: End-to-End Node Classification Using Graph Topology and Text-based Node Attributes
    Dinh, Tu Anh
    den Boef, Jeroen
    Cornelisse, Joran
    Groth, Paul
    2023 23RD IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS, ICDMW 2023, 2023, : 1084 - 1091
  • [25] ARM based implementation of Text-To-Speech (TTS) for real time Embedded System
    Rawoof, Abdul
    Kulesh
    Ray, Kailash Chandra
    2014 FIFTH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP 2014), 2014, : 192 - 196
  • [26] EXPLICIT ALIGNMENT OF TEXT AND SPEECH ENCODINGS FOR ATTENTION-BASED END-TO-END SPEECH RECOGNITION
    Drexler, Jennifer
    Glass, James
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 913 - 919
  • [27] End-to-end attack on text-based CAPTCHAs based on cycle-consistent generative adversarial network
    Li, Chunhui
    Chen, Xingshu
    Wang, Haizhou
    Wang, Peiming
    Zhang, Yu
    Wang, Wenxian
    NEUROCOMPUTING, 2021, 433 : 223 - 236
  • [28] End-to-end speech recognition system based on improved CLDNN structure
    Feng, Yujie
    Zhang, Yi
    Xu, Xuan
    PROCEEDINGS OF 2019 IEEE 8TH JOINT INTERNATIONAL INFORMATION TECHNOLOGY AND ARTIFICIAL INTELLIGENCE CONFERENCE (ITAIC 2019), 2019, : 538 - 542
  • [29] END-TO-END TEXT-TO-SPEECH USING LATENT DURATION BASED ON VQ-VAE
    Yasuda, Yusuke
    Wang, Xin
    Yamagishi, Junichi
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5694 - 5698
  • [30] Parrotron: An End-to-End Speech-to-Speech Conversion Model and its Applications to Hearing-Impaired Speech and Speech Separation
    Biadsy, Fadi
    Weiss, Ron J.
    Moreno, Pedro J.
    Kanvesky, Dimitri
    Jia, Ye
    INTERSPEECH 2019, 2019, : 4115 - 4119