Robust and High-resolution Voiced/Unvoiced Classification in Noisy Speech Using A Signal Smoothness Criterion

被引：0

作者：

Murthy, A. Sreenivasa ^{[1
]}

Sekhar, S. Chandra ^{[1
]}

Sreenivas, T. V. ^{[1
]}

机构：

[1] Indian Inst Sci, Dept Elect Commun Engn, Bangalore 560012, Karnataka, India

来源：

INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4 | 2007年

关键词：

voiced; unvoiced; local polynomial model; regression; signal-to-noise ratio;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We propose a novel technique for robust voiced/unvoiced segment detection in noisy speech, based on local polynomial regression. The local polynomial model is well-suited for voiced segments in speech. The unvoiced segments are noise-like and do not exhibit any smooth structure. This property of smoothness is used for devising a new metric called the variance ratio metric, which, after thresholding, indicates the voiced/unvoiced boundaries with 75% accuracy for 0dB global signal-to-noise ratio (SNR). A novelty of our algorithm is that it processes the signal continuously, sample-by-sample rather than frame-by-frame. Simulation results on TIMIT speech database (downsampled to 8kHz) for various SNRs are presented to illustrate the performance of the new algorithm. Results indicate that the algorithm is robust even in high noise levels.

引用

页码：2260 / 2263

页数：4

共 50 条

[1] Robust voiced/unvoiced speech classification using fuzzy rules
Beritelli, F
Casale, S
1997 IEEE WORKSHOP ON SPEECH CODING FOR TELECOMMUNICATIONS, PROCEEDINGS: BACK TO BASICS: ATTACKING FUNDAMENTAL PROBLEMS IN SPEECH CODING, 1997, : 5 - 6
[2] Speech enhancement using voiced/unvoiced classification
Lachiri, Z
Ellouze, N
8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS, AND INFORMATICS, VOL XVI, PROCEEDINGS, 2004, : 345 - 349
[3] IFAS-based voiced/unvoiced classification of speech signal
Arifianto, D
Kobayashi, T
2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING I, 2003, : 812 - 815
[4] On a Classification of Voiced/Unvoiced by using SNR for Speech Recognition
Kim, Jongkuk
Hahn, Hernsoo
PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON ADVANCED COMPUTER SCIENCE AND ELECTRONICS INFORMATION (ICACSEI 2013), 2013, 41 : 472 - 476
[5] A pattern recognition approach to robust voiced/unvoiced speech classification using fuzzy logic
Beritelli, F
Casale, S
Russo, M
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 1999, 13 (01) : 109 - 132
[6] Robust Voiced/Unvoiced Speech Classification using Empirical Mode Decomposition and Periodic Correlation Model
Molla, Md. Khademul Islam
Hirose, Keikichi
Minematsu, Nobuaki
INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2530 - +
[7] VOICED-UNVOICED CLASSIFICATION OF SPEECH USING AUTOCORRELATION MATRIX
Senturk, Zekeriya
Yetgin, Omer Emre
Salor, Ozgul
2014 22ND SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2014, : 1802 - 1805
[8] HIGH-RESOLUTION SINUSOIDAL MODELING OF UNVOICED SPEECH
Kafentzis, George P.
Stylianou, Yannis
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 4985 - 4989
[9] A Fast Method for High-Resolution Voiced/Unvoiced Detection and Glottal Closure/Opening Instant Estimation of Speech
Koutrouvelis, Andreas I.
Kafentzis, George P.
Gaubitch, Nikolay D.
Heusdens, Richard
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (02) : 316 - 328
[10] A robust Voiced/Unvoiced phoneme classification from whispered speech using the 'color' of whispered phonemes and Deep Neural Network
Meenakshi, G. Nisha
Ghosh, Prasanta Kumar
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 503 - 507

← 1 2 3 4 5 →