Automatic parameter extraction of fundamental frequency contours of speech based on a generative model

被引:0
|
作者
Fujisaki, H
Ohno, S
Tomita, O
机构
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The process of generating an F-0 contour from a small number of linguistically meaningful parameters, has been modeled quite accurately, and the model has been used extensively in speech synthesis. The present paper deals with the inverse problem, i.e., that of extracting the model parameters from a given contour, which can only be solved by successive approximation. This paper presents a method for deriving a first-order approximation to a given F-0 contour from the linguistic information of the utterance, and refining the approximation by Analysis-by-Synthesis. The validity of the method has been confirmed experimentally.
引用
收藏
页码:729 / 732
页数:4
相关论文
共 50 条
  • [1] Pre-processing of fundamental frequency contours of speech for automatic parameter extraction
    Fujisaki, H
    Narusawa, S
    Maruno, M
    [J]. 2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 722 - 725
  • [2] A method for automatic extraction of model parameters from fundamental frequency contours of speech
    Narusawa, S
    Minematsu, N
    Hirose, K
    Fujisaki, H
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 509 - 512
  • [3] Modeling of Fundamental Frequency Contours for HMM-based Speech Synthesis Representation of fundamental frequency contours for statistical speech synthesis
    Hirose, Keikichi
    [J]. PROCEEDINGS OF 2016 IEEE 13TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP 2016), 2016, : 171 - 176
  • [4] Improved Automatic Extraction of Generation Process Model Commands and Its use for Generating Fundamental Frequency Contours for Training HMM-based Speech Synthesis
    Hashimoto, Hiroya
    Hirose, Keikichi
    Minematsu, Nobuaki
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 458 - 461
  • [5] Generative Modeling of Voice Fundamental Frequency Contours
    Kameoka, Hirokazu
    Yoshizato, Kota
    Ishihara, Tatsuma
    Kadowaki, Kento
    Ohishi, Yasunori
    Kashino, Kunio
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2015, 23 (06) : 1042 - 1053
  • [6] ANALYSIS OF FUNDAMENTAL FREQUENCY CONTOURS IN SPEECH
    LEVITT, H
    RABINER, LR
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1971, 49 (02): : 569 - &
  • [7] A generative model of fundamental frequency contours for polysyllabic words of Thai tones
    Seresangtakul, P
    Takara, T
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 452 - 455
  • [8] VAE-SPACE: DEEP GENERATIVE MODEL OF VOICE FUNDAMENTAL FREQUENCY CONTOURS
    Tanaka, Kou
    Kameoka, Hirokazu
    Morikawa, Kazuho
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5779 - 5783
  • [9] CHARACTERIZATION OF FUNDAMENTAL-FREQUENCY CONTOURS OF SPEECH
    MAEDA, S
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1974, 56 : S33 - S33
  • [10] A Targets-based Superpositional Model of Fundamental Frequency Contours Applied to HMM-based Speech Synthesis
    Ni, Jinfu
    Shiga, Yoshinori
    Hori, Chiori
    Kidawara, Yutaka
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1051 - 1055