On a Robust F0 Estimation of Speech based on IRAPT using Robust TV-CAR Analysis

被引:0
|
作者
Hotta, Kazushi [1 ]
Funaki, Keiichi [2 ]
机构
[1] Univ Ryukyus, Grad Sch Engn & Sci, Nishihara, Okinawa 90301, Japan
[2] Univ Ryukyus, Comp & Networking Ctr, Nishihara, Okinawa 90301, Japan
来源
2014 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA) | 2014年
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Fundamental frequency (F-0) estimation is important in speech processing such as speech coding, synthesis, recognition and so on. A present F-0 estimation method performs well under clean condition, however the performance deteriorates significantly in noisy environment. As a result, robust F-0 estimation against additive noise is demanded. We have previously proposed F-0 estimation methods based on Time-Varying Complex AR (TV-CAR) analysis whose criterion is the weighted correlation of the complex residual obtained by the TV-CAR analysis, sum of the harmonics for the complex residual spectrum, or so on. On the other hand, E.Azarov et al. have proposed an improved method of RAPT (Robust Algorithm for Pitch Tracking) using an instantaneous harmonics that is called IRAPT (Instantaneous RAPT). The IRAPT can perform better estimation than RAPT. Since IRAPT uses band-limited analytic signal to obtain harmonic frequencies, the complex residual signal obtained by the TV-CAR analysis can also be applied to the IRAPT. In this paper, novel F-0 estimation method using the instantaneous frequency based on the robust ELS (Extended Least Square) TV-CAR residual is proposed and evaluated.
引用
收藏
页数:4
相关论文
共 50 条
  • [41] SAFE: a Statistical Algorithm for F0 Estimation for Both Clean and Noisy Speech
    Chu, Wei
    Alwan, Abeer
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2598 - 2601
  • [42] Robust method for estimating F0 of complex tone based on pitch perception of amplitude modulated signal
    Miwa, Kenichiro
    Unoki, Masashi
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2311 - 2315
  • [43] An Analogy of F0 Estimation Algorithms Using Sustained Vowel
    Karunaimathi, Prarthana, V
    Gladis, Dennis
    Balakrishnan, D.
    PROCEEDING OF THE THIRD INTERNATIONAL SYMPOSIUM ON WOMEN IN COMPUTING AND INFORMATICS (WCI-2015), 2015, : 217 - 221
  • [44] NOISE-ROBUST F0 ESTIMATION USING SNR-WEIGHTED SUMMARY CORRELOGRAMS FROM MULTI-BAND COMB FILTERS
    Tan, Lee Ngee
    Alwan, Abeer
    2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4464 - 4467
  • [45] Review of F0 modelling and generation in HMM based speech synthesis
    Yu, Kai
    PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 599 - 604
  • [46] Extraction of important sentences for speech summarization based on an F0 model
    Inoue, Akira
    Yamashita, Yoichi
    Acoustical Science and Technology, 2003, 24 (01) : 35 - 37
  • [47] Improving F0 Prediction Using Bidirectional Associative Memories and Syllable-Level F0 Features for HMM-based Mandarin Speech Synthesis
    Gao, Li
    Ling, Zhen-Hua
    Chen, Ling-Hui
    Dai, Li-Rong
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 275 - 279
  • [48] Robust F-0 and jitter estimation in pathological voices
    Vieira, MN
    McInnes, FR
    Jack, MA
    ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 745 - 748
  • [49] Evaluation of a noise-robust multi-stream speaker verification method using F0 information
    Asami, Taichi
    Iwano, Koji
    Furui, Sadaoki
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (03) : 549 - 557
  • [50] A Method for Automatically Estimating F0 Model Parameters and A Speech Re-Synthesis Tool Using F0 Model and STRAIGHT
    Sato, Shota
    Kimura, Taro
    Horiuchi, Yasuo
    Nishida, Masafumi
    Kuroiwa, Shingo
    Ichikawa, Akira
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 545 - +