DUAL-CHANNEL ITERATIVE SPEECH ENHANCEMENT WITH CONSTRAINTS ON AN AUDITORY-BASED SPECTRUM

被引：34

作者：

NANDKUMAR, S

HANSEN, JHL

机构：

[1] Robust Speech Processing Laboratory, Department of Electrical Engineering, Duke University, Durham

来源：

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1995年 / 3卷 / 01期

基金：

美国国家科学基金会;

关键词：

D O I：

10.1109/89.365384

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

A new frequency-domain, constrained iterative algorithm is proposed for dual-channel speech enhancement, The dual-channel enhancement scheme is shown to follow the iterative expectation-maximization (EM) algorithm, resulting in a two-step dual-channel Wiener filtering scheme, A new technique for applying constraints during the EM iterations is developed so as to take advantage of the auditory properties of speech perception, An overriding goal is to enhance quality and at least maintain intelligibility of the estimated speech signal, Constraints are applied over time and iteration on mel-cepstral parameters which parametrize an auditory based spectrum, These constraints also adapt to changing speech characteristics over time with the aid of an adaptive boundary detector, Performance is demonstrated in three areas for speech degraded by additive white Gaussian noise, aircraft cockpit noise, and computer cooling-fan noise, First, global objective speech quality measures show improved quality when compared to unconstrained dual-channel Wiener filtering and a traditional LMS-based adaptive noise cancellation technique, over a range of signal-to-noise ratios and cross-talk levels, Second, time waveforms and frame-to-frame quality measures show good improvement, especially in unvoiced and transitional regions of speech, Informal listening tests confirm improvement in quality as measured by objective measures, Finally, objective measures classified over individual phonemes for a subset of sentences from the TIMIT speech database show a consistent and superior improvement in quality.

引用

页码：22 / 34

页数：13

共 50 条

[1] Speech Enhancement Using Auditory-Based Transform
Tank, Vanita Raj
Mahajan, S. P.
Khaparde, Arti
Deshpande, Rahul
[J]. 2015 10TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING (ICICS), 2015,
[2] Dual-channel auditory spectrum modeling
Billa, J
[J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 995 - 998
[3] Dual-channel speech intelligibility enhancement based on the psychoacoustics
Lee, Sang-Hoon
Jeong, Hong
[J]. LECTURE NOTES IN SIGNAL SCIENCE, INTERNET AND EDUCATION (SSIP'07/MIV'07/DIWEB'07), 2007, : 83 - +
[4] Auditory-Based Spectral Amplitude Estimators for Speech Enhancement
Plourde, Eric
Champagne, Benoit
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (08): : 1614 - 1623
[5] Dual-channel DNN-based Speech Enhancement for Smartphones
Martin-Donas, Juan M.
Gomez, Angel M.
Lopez-Espejo, Ivan
Peinado, Antonio M.
[J]. 2017 IEEE 19TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2017,
[6] Dual-channel speech enhancement by superdirective beamforming
Lotter, Thomas
Vary, Peter
[J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2006, 2006 (1) : 1 - 14
[7] Dual-Channel Speech Enhancement by Superdirective Beamforming
Thomas Lotter
Peter Vary
[J]. EURASIP Journal on Advances in Signal Processing, 2006
[8] A Dual-channel Speech Enhancement Method for Cellular Communication
Nabi, Wahbi
Aloui, Noureddine
Cherif, Adnane
[J]. 2018 4TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2018,
[9] Real-time spectrum estimation–based dual-channel speech-enhancement algorithm for cochlear implant
Yousheng Chen
Qin Gong
[J]. BioMedical Engineering OnLine, 11
[10] Noise variance estimation based on dual-channel phase difference for speech enhancement
Kim, Seon Man
Kim, Hong Kook
[J]. DIGITAL SIGNAL PROCESSING, 2014, 26 : 169 - 182

← 1 2 3 4 5 →