Robust and High-resolution Voiced/Unvoiced Classification in Noisy Speech Using A Signal Smoothness Criterion

被引:0
|
作者
Murthy, A. Sreenivasa [1 ]
Sekhar, S. Chandra [1 ]
Sreenivas, T. V. [1 ]
机构
[1] Indian Inst Sci, Dept Elect Commun Engn, Bangalore 560012, Karnataka, India
关键词
voiced; unvoiced; local polynomial model; regression; signal-to-noise ratio;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel technique for robust voiced/unvoiced segment detection in noisy speech, based on local polynomial regression. The local polynomial model is well-suited for voiced segments in speech. The unvoiced segments are noise-like and do not exhibit any smooth structure. This property of smoothness is used for devising a new metric called the variance ratio metric, which, after thresholding, indicates the voiced/unvoiced boundaries with 75% accuracy for 0dB global signal-to-noise ratio (SNR). A novelty of our algorithm is that it processes the signal continuously, sample-by-sample rather than frame-by-frame. Simulation results on TIMIT speech database (downsampled to 8kHz) for various SNRs are presented to illustrate the performance of the new algorithm. Results indicate that the algorithm is robust even in high noise levels.
引用
收藏
页码:2260 / 2263
页数:4
相关论文
共 50 条
  • [21] Neural classification in high-resolution ECG signal processing
    Kestler, HA
    Schwenker, F
    Palm, G
    Wöhrle, J
    Höher, M
    ADVANCES IN NONINVASIVE ELECTROCARDIOGRAPHIC MONITORING TECHNIQUES, 2000, 229 : 441 - 452
  • [22] High-resolution speech signal reconstruction in Wireless Sensor Networks
    Pazarloglou, Andria
    Stoleru, Radu
    Gutierrez-Osuna, Ricardo
    2009 6TH IEEE CONSUMER COMMUNICATIONS AND NETWORKING CONFERENCE, VOLS 1 AND 2, 2009, : 1123 - 1127
  • [23] Maximum Position alignment method for noisy high-resolution radar target classification
    Gil-Pita, Roberto
    Rosa-Zurera, Manuel
    Vicen-Bueno, Raul
    Ferreras, Francisco Lopez
    IEEE SIGNAL PROCESSING LETTERS, 2008, 15 : 120 - 123
  • [24] Robust Recognition of English Speech in Noisy Environments Using Frequency Warped Signal Processing
    Navneet Upadhyay
    Hamurabi Gamboa Rosales
    National Academy Science Letters, 2018, 41 : 15 - 22
  • [25] Robust Recognition of English Speech in Noisy Environments Using Frequency Warped Signal Processing
    Upadhyay, Navneet
    Gamboa Rosales, Hamurabi
    NATIONAL ACADEMY SCIENCE LETTERS-INDIA, 2018, 41 (01): : 15 - 22
  • [26] Robust Autodual Morphological Profiles for the Classification of High-Resolution Satellite Images
    Luo, Bin
    Zhang, Liangpei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2014, 52 (02): : 1451 - 1462
  • [27] High-resolution superlet transform based techniques for Parkinson's disease detection using speech signal
    Bhatt, Kavita
    Jayanthi, N.
    Kumar, Manjeet
    APPLIED ACOUSTICS, 2023, 214
  • [28] High-resolution WENO schemes using local variation-based smoothness indicator
    Pandey, Prashant Kumar
    Ismail, Farzad
    Dubey, Ritesh Kumar
    COMPUTATIONAL & APPLIED MATHEMATICS, 2022, 41 (05):
  • [29] High-resolution WENO schemes using local variation-based smoothness indicator
    Prashant Kumar Pandey
    Farzad Ismail
    Ritesh Kumar Dubey
    Computational and Applied Mathematics, 2022, 41
  • [30] High-Resolution Ugtrasonic Spectrometer Using Digital Signal Processing
    David R. Daughton
    Norbert Mulders
    Journal of Low Temperature Physics, 2004, 134 : 413 - 418