Robust and High-resolution Voiced/Unvoiced Classification in Noisy Speech Using A Signal Smoothness Criterion

被引:0
|
作者
Murthy, A. Sreenivasa [1 ]
Sekhar, S. Chandra [1 ]
Sreenivas, T. V. [1 ]
机构
[1] Indian Inst Sci, Dept Elect Commun Engn, Bangalore 560012, Karnataka, India
关键词
voiced; unvoiced; local polynomial model; regression; signal-to-noise ratio;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel technique for robust voiced/unvoiced segment detection in noisy speech, based on local polynomial regression. The local polynomial model is well-suited for voiced segments in speech. The unvoiced segments are noise-like and do not exhibit any smooth structure. This property of smoothness is used for devising a new metric called the variance ratio metric, which, after thresholding, indicates the voiced/unvoiced boundaries with 75% accuracy for 0dB global signal-to-noise ratio (SNR). A novelty of our algorithm is that it processes the signal continuously, sample-by-sample rather than frame-by-frame. Simulation results on TIMIT speech database (downsampled to 8kHz) for various SNRs are presented to illustrate the performance of the new algorithm. Results indicate that the algorithm is robust even in high noise levels.
引用
收藏
页码:2260 / 2263
页数:4
相关论文
共 50 条
  • [1] Robust voiced/unvoiced speech classification using fuzzy rules
    Beritelli, F
    Casale, S
    1997 IEEE WORKSHOP ON SPEECH CODING FOR TELECOMMUNICATIONS, PROCEEDINGS: BACK TO BASICS: ATTACKING FUNDAMENTAL PROBLEMS IN SPEECH CODING, 1997, : 5 - 6
  • [2] Speech enhancement using voiced/unvoiced classification
    Lachiri, Z
    Ellouze, N
    8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS, AND INFORMATICS, VOL XVI, PROCEEDINGS, 2004, : 345 - 349
  • [3] IFAS-based voiced/unvoiced classification of speech signal
    Arifianto, D
    Kobayashi, T
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 812 - 815
  • [4] On a Classification of Voiced/Unvoiced by using SNR for Speech Recognition
    Kim, Jongkuk
    Hahn, Hernsoo
    PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND ELECTRONICS INFORMATION (ICACSEI 2013), 2013, 41 : 472 - 476
  • [5] A pattern recognition approach to robust voiced/unvoiced speech classification using fuzzy logic
    Beritelli, F
    Casale, S
    Russo, M
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 1999, 13 (01) : 109 - 132
  • [6] Robust Voiced/Unvoiced Speech Classification using Empirical Mode Decomposition and Periodic Correlation Model
    Molla, Md. Khademul Islam
    Hirose, Keikichi
    Minematsu, Nobuaki
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2530 - +
  • [7] VOICED-UNVOICED CLASSIFICATION OF SPEECH USING AUTOCORRELATION MATRIX
    Senturk, Zekeriya
    Yetgin, Omer Emre
    Salor, Ozgul
    2014 22ND SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2014, : 1802 - 1805
  • [8] HIGH-RESOLUTION SINUSOIDAL MODELING OF UNVOICED SPEECH
    Kafentzis, George P.
    Stylianou, Yannis
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 4985 - 4989
  • [9] A Fast Method for High-Resolution Voiced/Unvoiced Detection and Glottal Closure/Opening Instant Estimation of Speech
    Koutrouvelis, Andreas I.
    Kafentzis, George P.
    Gaubitch, Nikolay D.
    Heusdens, Richard
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (02) : 316 - 328
  • [10] A robust Voiced/Unvoiced phoneme classification from whispered speech using the 'color' of whispered phonemes and Deep Neural Network
    Meenakshi, G. Nisha
    Ghosh, Prasanta Kumar
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 503 - 507