Automatic generation of prosodic structure for high quality Mandarin speech synthesis

被引:0
|
作者
Chou, FC
Tseng, CY
Lee, LS
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A key problem for today's speech synthesis technology is to automatically generate an appropriate hierarchical prosodic structure for text input and incorporate it into synthesized speech[1][2]. This paper presents a method for such a problem in Mandarin Chinese. This method uses a speech database for the training of a statistical model to generate the prosodic structure and determine prosodic parameters such as syllable duration, pause, energy and intonation. The experimental results show that an accuracy of 83.1% in the prediction of prosodic structure can be achieved. Furthermore, a Chinese text-to speech system on be developed based on the proposed prosodic structure.
引用
收藏
页码:1624 / 1627
页数:4
相关论文
共 50 条
  • [41] F0 contour of prosodic word in happy speech of mandarin
    Wang, HB
    Li, AJ
    Fang, Q
    [J]. AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION, PROCEEDINGS, 2005, 3784 : 433 - 440
  • [42] Prosodic Feature Analysis for Automatic Speech Assessment and Individual Report Generation in People with Down Syndrome
    Corrales-Astorgano, Mario
    Gonzalez-Ferreras, Cesar
    Escudero-Mancebo, David
    Cardenoso-Payo, Valentin
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (01):
  • [43] An Automatic Prosody Labeling Method for Mandarin Speech
    Chiang, Chen-Yu
    Yu, Hsiu-Min
    Wang, Yih-Ru
    Chen, Sin-Horng
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 725 - +
  • [44] Automatic Personality Perception from Speech in Mandarin
    Zhu, Minxian
    Xie, Xiang
    Zhang, Liqiang
    Wang, Jing
    [J]. 2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 309 - 313
  • [45] Automatic Emotion Recognition of Speech Signal in Mandarin
    Zhang, Sheng
    Ching, P. C.
    Kong, Fanrang
    [J]. INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1810 - +
  • [46] Automatic detection of a prosodic hierarchy in a journalistic speech corpus
    Gendrot, Cedric
    Gerdes, Kim
    Adda-Decker, Martine
    [J]. LANGUE FRANCAISE, 2016, (191): : 123 - +
  • [47] Improving Prosodic Boundaries Prediction for Mandarin Speech Synthesis by Using Enhanced Embedding Feature and Model Fusion Approach
    Zheng, Yibin
    Li, Ya
    Wen, Zhengqi
    Ding, Xingguang
    Tao, Jianhua
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3201 - 3205
  • [48] PROSODIC MODELING IN SWEDISH SPEECH SYNTHESIS
    BRUCE, G
    GRANSTROM, B
    [J]. SPEECH COMMUNICATION, 1993, 13 (1-2) : 63 - 73
  • [49] UNSUPERVISED PROSODIC PHRASE BOUNDARY LABELING OF MANDARIN SPEECH SYNTHESIS DATABASE USING CONTEXT-DEPENDENT HMM
    Yang, Chen-Yu
    Ling, Zhen-Hua
    Dai, Li-Rong
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6875 - 6879
  • [50] Automatic feature template generation for prosodic phrasing
    Liu, Fangzhou
    Zhou, You
    [J]. Journal of Software, 2012, 7 (04) : 779 - 785