A New Model-based Prosody Coder for Mandarin Speech

被引:0
|
作者
Chiang, Chen-Yu [1 ]
Hung, Yu-Ping [1 ]
Chen, Sin-Horng [2 ]
Wang, Yih-Ru [2 ]
机构
[1] Natl Taipei Univ, Dept Commun Engn, New Taipei City, Taiwan
[2] Natl Chiao Tung Univ, Dept Elect Engn, Hsinchu 30050, Taiwan
关键词
Prosody coding; Prosodic model;
D O I
10.1109/IIH-MSP.2013.24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a novel parametric prosody coding approach for Mandarin speech is proposed. It employs a hierarchical prosodic model (HPM) as a prosody generating model in the encoder to analyze the speech prosody of the input utterance to obtain a parametric representation of four prosodic-acoustic features of syllable pitch contour, syllable duration, syllable energy level, and syllable-juncture pause duration for encoding. In the decoder, the four prosodic-acoustic features are reconstructed by a synthesis operation using the decoded HPM parameters. The reconstructed prosodic features are lastly used in an HMM-based speech synthesizer to help to generate the reconstructed speech. Experimental results show that the reconstructed speech has good quality at low data rates of 114.9 bits/s for a speaker-dependent task. An informal listening test confirmed decoded speeches sounded very fluently.
引用
收藏
页码:60 / 63
页数:4
相关论文
共 50 条
  • [1] A New Model-based Mandarin-speech Coding System
    Chiang, Chen-Yu
    Yang, Jyh-Her
    Liu, Ming-Chieh
    Wang, Yih-Ru
    Liao, Yuan-Fu
    Chen, Sin-Horn
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2572 - 2575
  • [2] LATENT PROSODY MODEL OF CONTINUOUS MANDARIN SPEECH
    Chiang, Chen-Yu
    Wang, Xiao-Dong
    Liao, Yuan-Fu
    Wang, Yih-Ru
    Chen, Sin-Horng
    Hirose, Keikichi
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 625 - +
  • [3] Prosody model in a Mandarin Text-to-Speech System based on a hierarchical approach
    Pan, NH
    Jen, WT
    Yu, SS
    Yu, MS
    Huang, SY
    Wu, MJ
    2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 448 - 451
  • [4] A Novel Model-based Pitch Conversion Method for Mandarin Speech
    Hwang, Hsin-Te
    Chiang, Chen-Yu
    Sung, Po-Yi
    Chen, Sin-Horng
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2611 - 2614
  • [5] ENRICHING MANDARIN SPEECH RECOGNITION BY INCORPORATING A HIERARCHICAL PROSODY MODEL
    Yang, Jyh-Her
    Liu, Ming-Chieh
    Chang, Hao-Hsiang
    Chiang, Chen-Yu
    Wang, Yih-Ru
    Chen, Sin-Horng
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 5052 - 5055
  • [6] A New Approach of Speaking Rate Modeling for Mandarin Speech Prosody
    Hsieh, Chiao-Hua
    Chiang, Chen-Yu
    Wang, Yih-Ru
    Yu, Hsiu-Min
    Chen, Sin-Horng
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 654 - 657
  • [7] Prosody Dependent Mandarin Speech Recognition
    Ni, Chong-Jia
    Liu, Wen-Ju
    Xu, Bo
    2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 197 - 201
  • [8] PROSODY MODELING FOR MANDARIN EXCLAMATORY SPEECH
    Jia, Huibin
    Tao, Jianhua
    ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 890 - 893
  • [9] Evaluating Prosody of Mandarin Speech for Language Learning
    Dong, Minghui
    Li, Haizhou
    Nwe, Tin Lay
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 1986 - 1989
  • [10] An Automatic Prosody Labeling Method for Mandarin Speech
    Chiang, Chen-Yu
    Yu, Hsiu-Min
    Wang, Yih-Ru
    Chen, Sin-Horng
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 725 - +