F0 estimation of speech based on IRAPT using WLP-based TV-CAR analysis

被引:0
|
作者
Shan, Wei [1 ]
Funaki, Keiichi [2 ]
机构
[1] Univ Ryukyus, Grad Sch Engn & Sci, Nishihara, Okinawa, Japan
[2] Univ Ryukyus, C&N Ctr, Nishihara, Okinawa, Japan
关键词
F-0; estimation; IRAPT; WLP; complex analysis; analytic signal; LINEAR PREDICTION;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Fundamental frequency (F-0) estimation plays an important role in speech processing such as speech coding, synthesis, recognition and so on. Although a present F-0 estimation method performs well under clean condition, the performance deteriorates significantly in noisy environment. For this reason robust F-0 estimation against additive noise is demanded. We have previously proposed F-0 estimation methods based on Time-Varying Complex AR (TV-CAR) analysis whose criterion is the weighted correlation of the complex residual obtained by the TV-CAR analysis, sum of the harmonics for the complex residual spectrum, or so on. On the other hand, E. Azarov et al. have proposed an improved method of RAPT (Robust Algorithm for Pitch Tracking) using an instantaneous harmonics that is called IRAPT (Instantaneous RAPT). The IRAPT can perform better estimation than RAPT. Since IRAPT uses band-limited analytic signal to obtain harmonic frequencies, the complex residual signal obtained by the TV-CAR analysis can also be applied to the IRAPT. In this paper, novel F-0 estimation method using the instantaneous frequency based on the robust WLP (Weighted Linear Prediction) TV-CAR residual is proposed and evaluated.
引用
收藏
页数:4
相关论文
共 50 条
  • [31] Continuous F0 Modeling for HMM Based Statistical Parametric Speech Synthesis
    Yu, Kai
    Young, Steve
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (05): : 1071 - 1079
  • [32] PROBABLISTIC MODELLING OF F0 IN UNVOICED REGIONS IN HMM BASED SPEECH SYNTHESIS
    Yu, K.
    Toda, T.
    Gasic, M.
    Keizer, S.
    Mairesse, F.
    Thomson, B.
    Young, S.
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3773 - +
  • [33] F0 prediction model of speech synthesis based on template and statistical method
    Tao, JH
    TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2004, 3206 : 497 - 504
  • [34] A Hierarchical F0 Modeling Method for HMM-based Speech Synthesis
    Lei, Ming
    Wu, Yi-Jian
    Soong, Frank K.
    Ling, Zhen-Hua
    Dai, Li-Rong
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2170 - +
  • [35] AN F0 MODELING TECHNIQUE BASED ON PROSODIC EVENTS FOR SPONTANEOUS SPEECH SYNTHESIS
    Koriyama, Tomoki
    Nose, Takashi
    Kobayashi, Takao
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4589 - 4592
  • [36] Perception-based Automatic Approximation of F0 Contours in Cantonese Speech
    Li, Yujia
    Lee, Tan
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 1425 - 1428
  • [37] Using Noisy Speech to Study the Robustness of a Continuous F0 Modelling Method in HMM-based Speech Synthesis
    Ogbureke, Kalu U.
    Cabral, Joao P.
    Carson-Berndsen, Julie
    PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON SPEECH PROSODY, VOLS I AND II, 2012, : 67 - 70
  • [38] Prediction of F0 Based on Articulatory Features Using DNN
    Zhao, Cenxi
    Wang, Longbiao
    Dang, Jianwu
    Yu, Ruiguo
    STUDIES ON SPEECH PRODUCTION, 2018, 10733 : 58 - 67
  • [39] F0 Modeling in HMM-Based Speech Synthesis System using Deep Belief Network
    Mukherjee, Sankar
    Mandal, Shyamal Kumar Das
    2014 17TH ORIENTAL CHAPTER OF THE INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDIZATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (COCOSDA), 2014,
  • [40] Whisper to Normal Speech Based on Deep Neural Networks with MCC and F0 Features
    Lian, Hailun
    Hu, Yuting
    Zhou, Jian
    Wang, Huabin
    Tao, Liang
    2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,