A New Approach of Speaking Rate Modeling for Mandarin Speech Prosody

被引:0
|
作者
Hsieh, Chiao-Hua [1 ]
Chiang, Chen-Yu [1 ]
Wang, Yih-Ru [1 ]
Yu, Hsiu-Min
Chen, Sin-Horng [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Elect Engn, Hsinchu, Taiwan
关键词
speaking rate; prosody modeling;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A new approach of Mandarin-speech prosody modeling to consider the effects of speaking rate is proposed. The approach is a modification of our previous prosody labeling and modeling method to take speaking rate as a continuous independent variable and let prosodic-acoustic features and some parameters of prosodic models depend on it in order to count its influences. A speaking rate-dependent hierarchical prosodic model is hence constructed from four speech corpora of a single female speaker with fast, normal, medium and slow speaking rates. An analysis of the effects of speaking rate on the model parameters showed that they agreed well with our prior knowledge. So, the proposed approach provides a systematic and effective way to quantify the effects of speaking rate on Mandarin-speech prosody.
引用
收藏
页码:654 / 657
页数:4
相关论文
共 50 条
  • [1] Modeling of Speaking Rate Influences on Mandarin Speech Prosody and Its Application to Speaking Rate-controlled TTS
    Chen, Sin-Horng
    Hsieh, Chiao-Hua
    Chiang, Chen-Yu
    Hsiao, Hsi-Chun
    Wang, Yih-Ru
    Liao, Yuan-Fu
    Yu, Hsiu-Min
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (07) : 1158 - 1171
  • [2] An Investigation on the Mandarin Prosody of a Parallel Multi-Speaking Rate Speech Corpus
    Chiang, Chen-Yu
    Tang, Cheng-Chang
    Yu, Hsiu-Min
    Wang, Yih-Ru
    Chen, Sin-Horng
    [J]. ORIENTAL COCOSDA 2009 - INTERNATIONAL CONFERENCE ON SPEECH DATABASE AND ASSESSMENTS, 2009, : 148 - +
  • [3] PROSODY MODELING FOR MANDARIN EXCLAMATORY SPEECH
    Jia, Huibin
    Tao, Jianhua
    [J]. ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 890 - 893
  • [4] Hierarchical prosody modeling for Mandarin spontaneous speech
    Lin, Cheng-Hsien
    You, Chung-Long
    Chiang, Chen-Yu
    Wang, Yih-Ru
    Chen, Sin-Horng
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 145 (04): : 2576 - 2596
  • [5] A new duration modeling approach for Mandarin speech
    Chen, SH
    Lai, WH
    Wang, YR
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (04): : 308 - 320
  • [6] Unsupervised joint prosody labeling and modeling for Mandarin speech
    Chiang, Chen-Yu
    Chen, Sin-Horng
    Yu, Hsiu-Min
    Wang, Yih-Ru
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 125 (02): : 1164 - 1183
  • [7] Prosody-dependent Acoustic Modeling for Mandarin Speech Recognition
    Chiu, Tzu-Hsuan
    Chiang, Chen-Yu
    Liao, Yuan-Fu
    Yang, Jyh-Her
    Wang, Yih-Ru
    Chen, Sin-Horng
    [J]. PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON SPEECH PROSODY, VOLS I AND II, 2012, : 139 - 142
  • [8] A New Model-based Prosody Coder for Mandarin Speech
    Chiang, Chen-Yu
    Hung, Yu-Ping
    Chen, Sin-Horng
    Wang, Yih-Ru
    [J]. 2013 NINTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING (IIH-MSP 2013), 2013, : 60 - 63
  • [9] Prosody Dependent Mandarin Speech Recognition
    Ni, Chong-Jia
    Liu, Wen-Ju
    Xu, Bo
    [J]. 2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 197 - 201
  • [10] An Exploration of Local Speaking Rate Variations in Mandarin Read Speech
    Liou, Guan-Tin
    Chiang, Chen-Yu
    Wang, Yih-Ru
    Chen, Sin-Horng
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 42 - 46