Robust F0 estimation of speech signal using harmonicity measure based on instantaneous frequency

被引:0
|
作者
Arifianto, D [1 ]
Tanaka, T [1 ]
Masuko, T [1 ]
Kobayashi, T [1 ]
机构
[1] Tokyo Inst Technol, Interdisciplinary Grad Sch Sci & Engn, Yokohama, Kanagawa 2268502, Japan
关键词
instantaneous frequency amplitude spectrum; harmonicity measure; fundamental frequency estimation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Borrowing the notion of instantaneous frequency that was developed in the context of time-frequency signal analysis, an instantaneous frequency amplitude spectrum (IFAS) is introduced for estimating fundamental frequency of speech signal in both noiseless and adverse environments. We define harmonicity measure as a quantity that indicates degree of periodical regularity in the IFAS and that shows substantial difference between periodic signal and noise-like waveform. The harmonicity measure is applied to estimate the existence of fundamental frequency. We provide experimental examples to demonstrate the general applicability of the harmonicity measure and apply the proposed procedure to Japanese continuous speech signals. The results show that the proposed method outperforms the conventional methods with or without the presence of noise.
引用
收藏
页码:2812 / 2820
页数:9
相关论文
共 50 条
  • [31] F0 ESTIMATION FOR NOISY SPEECH BY EXPLORING TEMPORAL HARMONIC STRUCTURES IN LOCAL TIME FREQUENCY SPECTRUM SEGMENT
    Wang, Dongmei
    Hansen, John H. L.
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 6510 - 6514
  • [32] SAFE: a Statistical Algorithm for F0 Estimation for Both Clean and Noisy Speech
    Chu, Wei
    Alwan, Abeer
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 2598 - 2601
  • [33] Robust method for estimating F0 of complex tone based on pitch perception of amplitude modulated signal
    Miwa, Kenichiro
    Unoki, Masashi
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 2311 - 2315
  • [34] An Analogy of F0 Estimation Algorithms Using Sustained Vowel
    Karunaimathi, Prarthana, V
    Gladis, Dennis
    Balakrishnan, D.
    PROCEEDING OF THE THIRD INTERNATIONAL SYMPOSIUM ON WOMEN IN COMPUTING AND INFORMATICS (WCI-2015), 2015, : 217 - 221
  • [35] Speech formant frequency and pitch estimation using instantaneous complex frequency
    Kaniewska, Magdalena
    ICSES 2008 INTERNATIONAL CONFERENCE ON SIGNALS AND ELECTRONIC SYSTEMS, CONFERENCE PROCEEDINGS, 2008, : 493 - 496
  • [36] Instantaneous frequency estimation based signal extraction method
    Test Center, Mechanical Engineering College, Chongqing University, Chongqing 400044, China
    J Vib Shock, 2008, 7 (141-145):
  • [37] Statistical Regression Models for Noise Robust F0 Estimation Using Recurrent Deep Neural Networks
    Kato, Akihiro
    Kinnunen, Tomi H.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (12) : 2336 - 2349
  • [38] F0, LPC, and MFCC Analysis for Emotion Recognition Based on Speech
    Teixeira, Felipe L.
    Teixeira, Joao Paulo
    Soares, Salviano F. P.
    Pio Abreu, J. L.
    OPTIMIZATION, LEARNING ALGORITHMS AND APPLICATIONS, OL2A 2022, 2022, 1754 : 389 - 404
  • [39] A stochastic F0 contour model based on clustering and a probabilistic measure
    Yamashita, Y
    Ishida, T
    Shimadera, K
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2003, E86D (03) : 543 - 549
  • [40] Review of F0 modelling and generation in HMM based speech synthesis
    Yu, Kai
    PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 599 - 604