USE OF PITCH CONTINUITY FOR ROBUST SPEECH ACTIVITY DETECTION

被引:0
|
作者
Shao, Yiwen [1 ,2 ]
Lin, Qiguang [1 ]
机构
[1] Baihu Technol Co Ltd, Guangzhou, Guangdong, Peoples R China
[2] Johns Hopkins Univ, Ctr Language & Speech Proc, Baltimore, MD 21218 USA
关键词
autocorrelation function; speech activity detection; pitch continuity; pitch detection;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech activity detection (SAD) is an important component for various speech processing applications and has been researched extensively recently. The pitch continuity, a significant characteristic of speech, however, has not successfully played a role in existing SAD methods. In this work, we propose a novel way to integrate the pitch continuity with pitch-related features. Practice is carried out through the Combo-SAD approach: We examine three consecutive frames and assume that they all have the same pitch as the center frame due to pitch continuity. Corresponding feature values are recomputed at the adjusted pitch location and then used in the final expression. The new combo feature is evaluated with various types of additive noise at different signal-to-noise ratios (SNR). The results show that the new feature leads to better SAD performance (with an up to 39.3% relative improvement on miss rate compared to Combo-SAD). We also introduce a novel variant of the underlying autocorrelation function and illustrate how it can improve the accuracy of pitch detection.
引用
收藏
页码:5534 / 5538
页数:5
相关论文
共 50 条
  • [31] Use of pitch and formant analysis in speech biometry
    Solov'eva E.S.
    Konyshev V.A.
    Selishchev S.V.
    [J]. Biomedical Engineering, 2007, 41 (1) : 34 - 38
  • [32] On the Use of Pitch Features for Disordered Speech Recognition
    Liu, Shansong
    Hu, Shoukang
    Liu, Xunying
    Meng, Helen
    [J]. INTERSPEECH 2019, 2019, : 4130 - 4134
  • [33] Robust Speech/Non-Speech Discrimination Based on Pitch Estimation for Mobile Robots
    Grondin, Francois
    Michaud, Francois
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2016, : 1650 - 1655
  • [34] Robust noise detection for speech detection and enhancement
    Garner, NR
    Barrett, PA
    Howard, DM
    Tyrrell, AM
    [J]. ELECTRONICS LETTERS, 1997, 33 (04) : 270 - 271
  • [35] Pitch detection algorithm of overlapping speech based on the energy of pitch and its harmonic
    Zhao Jun
    Pan Yong-xiang
    [J]. Proceedings of 2005 Chinese Control and Decision Conference, Vols 1 and 2, 2005, : 1439 - 1442
  • [36] ROBUST SPEECH ACTIVITY DETECTION IN MOVIE AUDIO: DATA RESOURCES AND EXPERIMENTAL EVALUATION
    Hebbar, Rajat
    Somandepalli, Krishna
    Narayanan, Shrikanth
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 4105 - 4109
  • [37] Feedback-Driven Sensory Mapping Adaptation for Robust Speech Activity Detection
    Bellur, Ashwin
    Elhilali, Mounya
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (03) : 481 - 492
  • [38] A robust low complexity voice activity detection algorithm for speech communication systems
    Benyassine, A
    Shlomot, E
    Su, HY
    Yuen, E
    [J]. 1997 IEEE WORKSHOP ON SPEECH CODING FOR TELECOMMUNICATIONS, PROCEEDINGS: BACK TO BASICS: ATTACKING FUNDAMENTAL PROBLEMS IN SPEECH CODING, 1997, : 97 - 98
  • [39] Speaker-Dependent Voice Activity Detection Robust to Background Speech Noise
    Matsuda, Shigeki
    Ito, Naoya
    Tsujino, Kosuke
    Kashioka, Hideki
    Sagayama, Shigeki
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 2625 - 2628
  • [40] Pitch detection and formant analysis of Arabic speech processing
    Cherif, A
    Bouafif, L
    Dabbabi, T
    [J]. APPLIED ACOUSTICS, 2001, 62 (10) : 1129 - 1140