Auditive learning based Chinese F0 prediction

被引：0

作者：

Tao, JH ^{[1
]}

Ni, X ^{[1
]}

机构：

[1] Chinese Acad Sci, Natl Lab Pattern Recognit, Beijing, Peoples R China

来源：

2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL III, PROCEEDINGS | 2003年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The paper describes a new F0 model based on auditive learning (AL) method. Being focused on the notion of prosody templates, we confirmed that F0 patterns for a syllable can be extracted from various anamorphosis of F0 contours in spontaneous speech. It is much suitable to use F0 templates selection method for Chinese F0 prediction with prosody cost function (PCF). Furthermore, an AL method is used to adjust the weight of PCF dynamically in application. Unlike other methods, the approach may give feedback as to exactly what are the crucial parameters determining the successful choice of patterns. The paper also analyzes the error distribution of the F0 predicting results. Both smoothing testing and F0 range testing show that the synthesis results are much closed to human being.

引用

页码：213 / 216

页数：4

共 50 条

[31] F0 declination of intonation groups in Spanish and in Mandarin Chinese
Yao, Junming
SPANISH IN CONTEXT, 2019, 16 (03) : 523 - 542
[32] Communicative F0 generation based on impressions
Shao, Lu
Greenberg, Yoko
Sagisaka, Yoshinori
2014 5TH IEEE CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2014, : 115 - 119
[33] Improving F0 Prediction Using Bidirectional Associative Memories and Syllable-Level F0 Features for HMM-based Mandarin Speech Synthesis
Gao, Li
Ling, Zhen-Hua
Chen, Ling-Hui
Dai, Li-Rong
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 275 - 279
[34] Development of an F0 control model based on F0 dynamic characteristics for singing-voice synthesis
Saitou, T
Unoki, M
Akagi, M
SPEECH COMMUNICATION, 2005, 46 (3-4) : 405 - 417
[35] Investigation of Prosodic F0 Layers in Hierarchical F0 Modeling for HMM-based Speech Synthesis
Lei, Ming
Wu, Yi-Jian
Ling, Zhen-Hua
Dai, Li-Rong
2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 613 - +
[36] A Novel Model of F0 Contours Prediction for Continuous Speech
胡文英
祖漪清
王志中
JournalofShanghaiJiaotongUniversity, 2005, (03) : 231 - 235
[37] An F0 contour control model using an F0 contour codebook
Kagoshima, Takehiko
Morita, Masahiro
Seto, Shigenobu
Akamine, Masami
Shiga, Yoshinori
Systems and Computers in Japan, 2007, 38 (01): : 62 - 72
[38] Global F0 Control Parameter Prediction Based On Impressions For Communicative Prosody Generation
Shao, Lu
Greenberg, Yoko
Sagisaka, Yoshinori
2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
[39] Production of f0(1710), f0(1500), and f0(1370) in J/ψ hadronic decays -: art. no. 094022
Close, FE
Zhao, Q
PHYSICAL REVIEW D, 2005, 71 (09): : 1 - 9
[40] Generating F0 contours by statistical manipulation of natural F0 shapes
Saito, T
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (03): : 1100 - 1106

← 1 2 3 4 5 →