Probabilistic speech F0 contour model incorporating statistical vocabulary model of phrase-accent command sequence

被引:0
|
作者
Ishihara, Tatsuma [1 ]
Kameoka, Hirokazu [1 ,2 ]
Yoshizato, Kota [1 ]
Saito, Daisuke [1 ]
Sagayama, Shigeki [1 ]
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Tokyo, Japan
[2] NTT Corp, NTT Commun Sci Labs, Tokyo, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We have previously proposed a generative model of speech F-0 contours, based on the discrete-time version of the Fujisaki model (a model of the mechanisim for controlling F(0)s through laryngeal muscles). One advantage of this model is that it allows us to apply statistical methods to estimate the Fujisaki-model parameters from speech F-0 contours. This paper proposes a new generative model of speech F-0 contours incorporating a vocabulary model of intonation patterns. A parameter inference algorithm for the present model is derived. We quantitatively evaluated the performance of our parameter inference algorithm.
引用
收藏
页码:1016 / 1020
页数:5
相关论文
共 36 条
  • [1] DNN-SPACE: DNN-HMM-based Generative Model of Voice F0 Contours for Statistical Phrase/Accent Command Estimation
    Hojo, Nobukatsu
    Ohsugi, Yasuhito
    Ijima, Yusuke
    Kameoka, Hirokazu
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1074 - 1078
  • [2] An F0 contour control model using an F0 contour codebook
    Kagoshima, Takehiko
    Morita, Masahiro
    Seto, Shigenobu
    Akamine, Masami
    Shiga, Yoshinori
    Systems and Computers in Japan, 2007, 38 (01): : 62 - 72
  • [3] A stochastic F0 contour model based on clustering and a probabilistic measure
    Yamashita, Y
    Ishida, T
    Shimadera, K
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (03) : 543 - 549
  • [4] FAST ALGORITHM FOR STATISTICAL PHRASE/ACCENT COMMAND ESTIMATION BASED ON GENERATIVE MODEL INCORPORATING SPECTRAL FEATURES
    Sato, Ryotaro
    Kameoka, Hirokazu
    Kashino, Kunio
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5595 - 5599
  • [5] Foreign accent in intonation patterns - A contrastive study applying a quantitative model of the F0 contour
    Mixdorff, H
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1469 - 1472
  • [6] Autoregressive Neural F0 Model for Statistical Parametric Speech Synthesis
    Wang, Xin
    Takaki, Shinji
    Yamagishi, Junichi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (08) : 1406 - 1419
  • [7] Automatic extraction of tone command parameters for the model of f0 contour generation for standard Chinese
    Gu, WT
    Hirose, K
    Fujisaki, H
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2004, E87D (05): : 1079 - 1085
  • [8] An F0 Contour Fitting Model for Singing Synthesis
    Lai, Wen-Hsing
    2009 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT 2009), 2009, : 113 - 117
  • [9] F0 prediction model of speech synthesis based on template and statistical method
    Tao, JH
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 497 - 504
  • [10] Identification and synthesis of Cantonese tones based on the command-response model for F0 contour generation
    Gu, WT
    Hirose, K
    Fujisaki, H
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 289 - 292