Advancements in Expressive Speech Synthesis: a Review

被引:0
|
作者
Alwaisi, Shaimaa [1 ]
Nemeth, Geza [1 ]
机构
[1] Budapest Univ Technol & Econ, Fac Elect Engn & Informat, Dept Telecommun & Media Informat, Budapest, Hungary
来源
INFOCOMMUNICATIONS JOURNAL | 2024年 / 16卷 / 01期
关键词
Speech style; Expressivity; Emotional speech; Expressive TTS; Prosody modification; Multi- lingual and multi- speaker TTS; SPEAKER ADAPTATION; VOICE CONVERSION; TEXT; MODEL; TTS;
D O I
10.36244/ICJ.2024.1.5
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
In recent years, we have witnessed a fast and wide spread acceptance of speech sinthesis technology in, leading to the transition toward a society characterized by a strong desire to incorporate these applications in their daily lives. We provide a comprehensive survey on the recent advancements in the field of expressive Text-To-Speech systems. Among different methods to represent expressivity, this paper facucesthe developmentofax pressive TTS systems, emphasizing the methodologies employed to enhance the quality and expressiveness of synthetic speech, such as style transfer and improving speaker variability. After that, we point out some of the subjective and objective metrics that are used to evaluate the quality of synthesized speech. Fi- nally, we point out the realm of child speech synthesis, a domain that has been neglected for some time. This underscores that the field of research in children's speech synthesis is still wide open for exploration and development. Overall, this paper presents a comprehensive overview of historical and contemporary trends and future directions in speech synthesis research.
引用
收藏
页码:35 / 46
页数:12
相关论文
共 50 条
  • [1] Expressive speech synthesis: A review
    Govind D.
    Prasanna S.R.M.
    [J]. International Journal of Speech Technology, 2013, 16 (2) : 237 - 260
  • [2] Towards Expressive Speech Synthesis: Analysis and Modeling of Expressive Speech
    Raptis, Spyros
    Karabetsos, Sotiris
    Chalamandaris, Aimilios
    Tsiakoulis, Pirros
    [J]. 2014 5th IEEE Conference on Cognitive Infocommunications (CogInfoCom), 2014, : 461 - 465
  • [3] Speech Variability Compensation for Expressive Speech Synthesis
    Chen, Yan-You
    Kuan, Ta-Wen
    Tsai, Chun-Yu
    Wang, Jhing-Fa
    Chang, Chia-Hao
    [J]. 1ST INTERNATIONAL CONFERENCE ON ORANGE TECHNOLOGIES (ICOT 2013), 2013, : 210 - 213
  • [4] EXPRESSIVE SPEECH SYNTHESIS FOR CRITICAL SITUATIONS
    Rusko, Milan
    Darjaa, Sakhia
    Trnka, Marian
    Sabo, Robert
    Ritomsk, Marian
    [J]. COMPUTING AND INFORMATICS, 2014, 33 (06) : 1312 - 1332
  • [5] ARTICULATORY FEATURES FOR EXPRESSIVE SPEECH SYNTHESIS
    Black, Alan W.
    Bunnell, H. Timothy
    Dou, Ying
    Muthukumar, Prasanna Kumar
    Metze, Florian
    Perry, Daniel
    Polzehl, Tim
    Prahallad, Kishore
    Steidl, Stefan
    Vaughn, Callie
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4005 - 4008
  • [6] Expressive speech: Production, perception and application to speech synthesis
    Erickson, Donna
    [J]. ACOUSTICAL SCIENCE AND TECHNOLOGY, 2005, 26 (04) : 317 - 325
  • [7] Controllable Emphatic Speech Synthesis based on Forward Attention for Expressive Speech Synthesis
    Liu, Liangqi
    Hu, Jiankun
    Wu, Zhiyong
    Yang, Song
    Yang, Songfan
    Jia, Jia
    Meng, Helen
    [J]. 2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 410 - 414
  • [8] Specifying affect and emotion for expressive speech synthesis
    Campbell, N
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2004, 2945 : 395 - 406
  • [9] Prosody modelling of Spanish for expressive speech synthesis
    Iriondo, Ignasi
    Socoro, Joan Claudi
    Alias, Francesc
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 821 - +
  • [10] Editorial -: Special section on expressive speech synthesis
    Campbell, Nick
    Hamza, Wael
    Hoege, Harald
    Tao, Jianhua
    Bailly, Gerard
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (04): : 1097 - 1098