iPEEH: Improving pitch estimation by enhancing harmonics

被引:4
|
作者
Wu, Kebin [1 ]
Zhang, David [2 ,3 ]
Lu, Guangming [4 ]
机构
[1] Tsinghua Univ, Grad Sch Shenzhen, Dept Elect Engn, Beijing 100084, Peoples R China
[2] Hong Kong Polytech Univ, Biometr Res Ctr, Kowloon, Hong Kong, Peoples R China
[3] Hong Kong Polytech Univ, Dept Comp, Kowloon, Hong Kong, Peoples R China
[4] Harbin Inst Technol, Shenzhen Grad Sch, Biocomp Res Ctr, Shenzhen 518055, Peoples R China
关键词
Enhancement; Fundamental frequency detection; Harmonics; Improvement; Pitch; TO-NOISE RATIO; PATHOLOGICAL VOICES; WAVELET TRANSFORM; SPEECH; RECOGNITION; ROBUST; MUSIC; AUTOCORRELATION; ALGORITHM; SPECTRUM;
D O I
10.1016/j.eswa.2016.08.018
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pitch estimation is quite crucial to many applications. Although a number of estimation methods working in different domains have been put forward, there are still demands for improvement, especially for noisy speech. In this paper, we present iPEEH, a general technique to raise performance of pitch estimators by enhancing harmonics. By analysis and experiments, it is found that missing and submerged harmonics are the root causes for failures of many pitch detectors. Hence, we propose to enhance the harmonics in spectrum before implementing the pitch detection. One enhancement algorithm that mainly applies the square operation to regenerate harmonics is presented in detail, including the theoretical analysis and implementation. Four speech databases with 11 types of additive noise and 5 noise levels are utilized in assessment. We compare the performance of algorithms before and after using iPEEH. Experimental results indicate that the proposed iPEEH can effectively reduce the detection errors. In some cases, the error rate reductions are higher than 20%. In addition, the advantage of iPEEH is manifold since it is demonstrated in experiments that the iPEEH is effective for various noise types, noise levels, multiple basic frequency-based estimators, and two audio types. Through this work, we investigated the underlying reasons for pitch detection failures and presented a novel direction for pitch detection. Besides, this approach, a preprocessing step in essence, indicates the significance of preprocessing for any intelligent systems. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:317 / 329
页数:13
相关论文
共 50 条
  • [1] Improving Pitch Detection through Emphasized Harmonics in Time-Domain
    Park, Hyung-Woo
    Kim, Myung-Sook
    Bae, Myung-Jin
    [J]. COMPUTER APPLICATIONS FOR DATABASE, EDUCATION, AND UBIQUITOUS COMPUTING, 2012, 352 : 184 - +
  • [2] Joint Robust Voicing Detection and Pitch Estimation Based on Residual Harmonics
    Drugman, Thomas
    Alwan, Abeer
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1984 - +
  • [3] Robust pitch estimation with harmonics enhancement in noisy environments based on instantaneous frequency
    Abe, T
    Kobayashi, T
    Imai, S
    [J]. ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, 1996, : 1277 - 1280
  • [4] HARMONICS ESTIMATION BASED ON INSTANTANEOUS FREQUENCY AND ITS APPLICATION TO PITCH DETERMINATION OF SPEECH
    ABE, T
    KOBAYASHI, T
    IMAI, S
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 1995, E78D (09) : 1188 - 1194
  • [5] Improving the Accuracy and the Robustness of Harmonic Model for Pitch Estimation
    Asgari, Meysam
    Shafran, Izhak
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1935 - 1939
  • [6] PITCH DETERMINATION BY MEASUREMENT OF HARMONICS
    MILLER, RL
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1968, 44 (01): : 390 - &
  • [7] Pitch Estimation Algorithm for Narrowband Speech Signal using Phase Differences between Harmonics
    Hosoda, Yuya
    Kawamura, Arata
    Iiguni, Youji
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 920 - 925
  • [8] Improving pitch estimation for efficient multiband excitation coding of speech
    Chan, CF
    Yu, EWM
    [J]. ELECTRONICS LETTERS, 1996, 32 (10) : 870 - 872
  • [9] Sensor fusion for improving the estimation of roll and pitch for an agricultural sprayer
    Khot, L. R.
    Tang, L.
    Steward, B. L.
    Han, S.
    [J]. BIOSYSTEMS ENGINEERING, 2008, 101 (01) : 13 - 20
  • [10] Infants' perception of pitch: Number of harmonics
    Clarkson, MG
    Martin, RL
    Miciek, SG
    [J]. INFANT BEHAVIOR & DEVELOPMENT, 1996, 19 (02): : 191 - 197