Auditive learning based Chinese F0 prediction

被引:0
|
作者
Tao, JH [1 ]
Ni, X [1 ]
机构
[1] Chinese Acad Sci, Natl Lab Pattern Recognit, Beijing, Peoples R China
来源
2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS | 2003年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper describes a new F0 model based on auditive learning (AL) method. Being focused on the notion of prosody templates, we confirmed that F0 patterns for a syllable can be extracted from various anamorphosis of F0 contours in spontaneous speech. It is much suitable to use F0 templates selection method for Chinese F0 prediction with prosody cost function (PCF). Furthermore, an AL method is used to adjust the weight of PCF dynamically in application. Unlike other methods, the approach may give feedback as to exactly what are the crucial parameters determining the successful choice of patterns. The paper also analyzes the error distribution of the F0 predicting results. Both smoothing testing and F0 range testing show that the synthesis results are much closed to human being.
引用
收藏
页码:213 / 216
页数:4
相关论文
共 50 条
  • [21] Determination of hadronic partial widths for scalar-isoscalar resonances f0(980), f0(1300), f0(1500), f0(1750) and the broad state f0(1530+90-250)
    Anisovich, VV
    Nikonov, VA
    Sarantsev, AV
    PHYSICS OF ATOMIC NUCLEI, 2002, 65 (08) : 1545 - 1552
  • [22] A mixing scheme for the structure of f0(600) and f0(1370)
    Xia, Zheng-Tong
    Zuo, Wei
    NUCLEAR PHYSICS A, 2010, 848 (3-4) : 317 - 329
  • [23] Proximity of f0(1500) and f0(1710) to the scalar glueball
    Fariborz, Amir H.
    Azizi, Azizollah
    Asrar, Abdorreza
    PHYSICAL REVIEW D, 2015, 92 (11):
  • [24] Dispersive analysis on the f0(600) and f0(980) resonances in γγ→π+π-, π0π0 processes
    Mao, Yu
    Wang, Xuan-Gong
    Zhang, Ou
    Zheng, H. Q.
    Zhou, Z. Y.
    PHYSICAL REVIEW D, 2009, 79 (11):
  • [25] Multiband statistical learning for F0 estimation in speech
    Sha, F
    Burgoyne, JA
    Saul, LK
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: DESIGN AND IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS INDUSTRY TECHNOLOGY TRACKS MACHINE LEARNING FOR SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING SIGNAL PROCESSING FOR EDUCATION, 2004, : 661 - 664
  • [26] Quark-gluonium content of the scalar-isoscalar states f0(980), f0(1300), f0(1500), f0(1750), and f0(1420−70+150) from hadronic decays
    V. V. Anisovich
    V. A. Nikonov
    A. V. Sarantsev
    Physics of Atomic Nuclei, 2003, 66 : 741 - 754
  • [27] Quark-gluonium content of the scalar-isoscalar states f0(980), f0(1300), f0(1500), f0(1750), and f0(1420+150-70) from hadronic decays
    Anisovich, VV
    Nikonov, VA
    Sarantsev, AV
    PHYSICS OF ATOMIC NUCLEI, 2003, 66 (04) : 741 - 754
  • [28] F0 downtrends
    HeidarZadeh, S
    Naylor, P
    ICSP '96 - 1996 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, PROCEEDINGS, VOLS I AND II, 1996, : 797 - 800
  • [29] On the Intersyllabic F0 Transition and Its Perception in Standard Chinese
    林茂灿
    SocialSciencesinChina, 1996, (04) : 152 - 172
  • [30] 标量介子f0(1370),f0(1500)和f0(1710)的混合与衰变
    陈建兴
    张立梅
    夏环宇
    辽宁师范大学学报(自然科学版), 2008, 31 (04) : 411 - 415