Probabilistic speech F0 contour model incorporating statistical vocabulary model of phrase-accent command sequence

被引：0

作者：

Ishihara, Tatsuma ^{[1
]}

Kameoka, Hirokazu ^{[1
,2
]}

Yoshizato, Kota ^{[1
]}

Saito, Daisuke ^{[1
]}

Sagayama, Shigeki ^{[1
]}

机构：

[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo, Japan

[2] NTT Corp, NTT Commun Sci Labs, Tokyo, Japan

来源：

14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5 | 2013年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We have previously proposed a generative model of speech F-0 contours, based on the discrete-time version of the Fujisaki model (a model of the mechanisim for controlling F(0)s through laryngeal muscles). One advantage of this model is that it allows us to apply statistical methods to estimate the Fujisaki-model parameters from speech F-0 contours. This paper proposes a new generative model of speech F-0 contours incorporating a vocabulary model of intonation patterns. A parameter inference algorithm for the present model is derived. We quantitatively evaluated the performance of our parameter inference algorithm.

引用

页码：1016 / 1020

页数：5

共 36 条

[1] DNN-SPACE: DNN-HMM-based Generative Model of Voice F0 Contours for Statistical Phrase/Accent Command Estimation
Hojo, Nobukatsu
Ohsugi, Yasuhito
Ijima, Yusuke
Kameoka, Hirokazu
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1074 - 1078
[2] An F0 contour control model using an F0 contour codebook
Kagoshima, Takehiko
Morita, Masahiro
Seto, Shigenobu
Akamine, Masami
Shiga, Yoshinori
Systems and Computers in Japan, 2007, 38 (01): : 62 - 72
[3] A stochastic F0 contour model based on clustering and a probabilistic measure
Yamashita, Y
Ishida, T
Shimadera, K
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (03) : 543 - 549
[4] FAST ALGORITHM FOR STATISTICAL PHRASE/ACCENT COMMAND ESTIMATION BASED ON GENERATIVE MODEL INCORPORATING SPECTRAL FEATURES
Sato, Ryotaro
Kameoka, Hirokazu
Kashino, Kunio
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5595 - 5599
[5] Foreign accent in intonation patterns - A contrastive study applying a quantitative model of the F0 contour
Mixdorff, H
ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1469 - 1472
[6] Autoregressive Neural F0 Model for Statistical Parametric Speech Synthesis
Wang, Xin
Takaki, Shinji
Yamagishi, Junichi
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (08) : 1406 - 1419
[7] Automatic extraction of tone command parameters for the model of f0 contour generation for standard Chinese
Gu, WT
Hirose, K
Fujisaki, H
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (05): : 1079 - 1085
[8] An F0 Contour Fitting Model for Singing Synthesis
Lai, Wen-Hsing
2009 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT 2009), 2009, : 113 - 117
[9] F0 prediction model of speech synthesis based on template and statistical method
Tao, JH
TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 497 - 504
[10] Identification and synthesis of Cantonese tones based on the command-response model for F0 contour generation
Gu, WT
Hirose, K
Fujisaki, H
2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 289 - 292

← 1 2 3 4 →