High-quality text-to-speech synthesis: An overview

被引:0
|
作者
Dutoit, T. [1 ]
机构
[1] Faculte Polytechnique de Mons, Mons, Belgium
关键词
D O I
暂无
中图分类号
学科分类号
摘要
引用
收藏
页码:25 / 36
相关论文
共 50 条
  • [1] An Advanced NLP Framework for High-Quality Text-to-Speech Synthesis
    Ungurean, Catalin
    Burileanu, Dragos
    2011 6TH CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2011,
  • [2] EfficientTTS: An Efficient and High-Quality Text-to-Speech Architecture
    Miao, Chenfeng
    Liang, Shuang
    Liu, Zhencheng
    Chen, Minchuan
    Ma, Jun
    Wang, Shaojun
    Xiao, Jing
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [3] PortaSpeech: Portable and High-Quality Generative Text-to-Speech
    Ren, Yi
    Liu, Jinglin
    Zhao, Zhou
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [4] High-quality prosody generation in Mandarin text-to-speech system
    Guo, Qing
    Zhang, Jie
    Katae, Nobuyuki
    Yu, Hao
    Fujitsu Scientific and Technical Journal, 2010, 46 (01): : 40 - 46
  • [5] High-Quality Prosody Generation in Mandarin Text-to-Speech System
    Guo, Qing
    Zhang, Jie
    Katae, Nobuyuki
    Yu, Hao
    FUJITSU SCIENTIFIC & TECHNICAL JOURNAL, 2010, 46 (01): : 40 - 46
  • [6] ProDiff: Progressive Fast Diffusion Model for High-Quality Text-to-Speech
    Huang, Rongjie
    Zhao, Zhou
    Liu, Huadai
    Liu, Jinglin
    Cui, Chenye
    Ren, Yi
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 2595 - 2605
  • [7] Parameter Generation Methods With Rich Context Models for High-Quality and Flexible Text-To-Speech Synthesis
    Takamichi, Shinnosuke
    Toda, Tomoki
    Shiga, Yoshinori
    Sakti, Sakriani
    Neubig, Graham
    Nakamura, Satoshi
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2014, 8 (02) : 239 - 250
  • [8] CAMNet: A controllable acoustic model for efficient, expressive, high-quality text-to-speech
    Alvarez, Jesus Monge
    Francois, Holly
    Sung, Hosang
    Choi, Seungdo
    Jeong, Jonghoon
    Choo, Kihyun
    Min, Kyoungbo
    Park, Sangjun
    APPLIED ACOUSTICS, 2022, 186
  • [9] MULTI-BAND MELGAN: FASTERWAVEFORM GENERATION FOR HIGH-QUALITY TEXT-TO-SPEECH
    Yang, Geng
    Yang, Shan
    Liu, Kai
    Fang, Peng
    Chen, Wei
    Xie, Lei
    2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 492 - 498
  • [10] SpikeVoice: High-Quality Text-to-Speech Via Efficient Spiking Neural Network
    Wang, Kexin
    Zhang, Jiahong
    Ren, Yong
    Yao, Man
    Di Shang
    Xu, Bo
    Li, Guoqi
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 7927 - 7940