Auditive learning based Chinese F0 prediction

被引:0
|
作者
Tao, JH [1 ]
Ni, X [1 ]
机构
[1] Chinese Acad Sci, Natl Lab Pattern Recognit, Beijing, Peoples R China
来源
2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS | 2003年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper describes a new F0 model based on auditive learning (AL) method. Being focused on the notion of prosody templates, we confirmed that F0 patterns for a syllable can be extracted from various anamorphosis of F0 contours in spontaneous speech. It is much suitable to use F0 templates selection method for Chinese F0 prediction with prosody cost function (PCF). Furthermore, an AL method is used to adjust the weight of PCF dynamically in application. Unlike other methods, the approach may give feedback as to exactly what are the crucial parameters determining the successful choice of patterns. The paper also analyzes the error distribution of the F0 predicting results. Both smoothing testing and F0 range testing show that the synthesis results are much closed to human being.
引用
收藏
页码:213 / 216
页数:4
相关论文
共 50 条
  • [31] F0 declination of intonation groups in Spanish and in Mandarin Chinese
    Yao, Junming
    SPANISH IN CONTEXT, 2019, 16 (03) : 523 - 542
  • [32] Communicative F0 generation based on impressions
    Shao, Lu
    Greenberg, Yoko
    Sagisaka, Yoshinori
    2014 5TH IEEE CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2014, : 115 - 119
  • [33] Improving F0 Prediction Using Bidirectional Associative Memories and Syllable-Level F0 Features for HMM-based Mandarin Speech Synthesis
    Gao, Li
    Ling, Zhen-Hua
    Chen, Ling-Hui
    Dai, Li-Rong
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 275 - 279
  • [34] Development of an F0 control model based on F0 dynamic characteristics for singing-voice synthesis
    Saitou, T
    Unoki, M
    Akagi, M
    SPEECH COMMUNICATION, 2005, 46 (3-4) : 405 - 417
  • [35] Investigation of Prosodic F0 Layers in Hierarchical F0 Modeling for HMM-based Speech Synthesis
    Lei, Ming
    Wu, Yi-Jian
    Ling, Zhen-Hua
    Dai, Li-Rong
    2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 613 - +
  • [36] A Novel Model of F0 Contours Prediction for Continuous Speech
    胡文英
    祖漪清
    王志中
    JournalofShanghaiJiaotongUniversity, 2005, (03) : 231 - 235
  • [37] An F0 contour control model using an F0 contour codebook
    Kagoshima, Takehiko
    Morita, Masahiro
    Seto, Shigenobu
    Akamine, Masami
    Shiga, Yoshinori
    Systems and Computers in Japan, 2007, 38 (01): : 62 - 72
  • [38] Global F0 Control Parameter Prediction Based On Impressions For Communicative Prosody Generation
    Shao, Lu
    Greenberg, Yoko
    Sagisaka, Yoshinori
    2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
  • [39] Production of f0(1710), f0(1500), and f0(1370) in J/ψ hadronic decays -: art. no. 094022
    Close, FE
    Zhao, Q
    PHYSICAL REVIEW D, 2005, 71 (09): : 1 - 9
  • [40] Generating F0 contours by statistical manipulation of natural F0 shapes
    Saito, T
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (03): : 1100 - 1106