DUAL-CHANNEL ITERATIVE SPEECH ENHANCEMENT WITH CONSTRAINTS ON AN AUDITORY-BASED SPECTRUM

被引:34
|
作者
NANDKUMAR, S
HANSEN, JHL
机构
[1] Robust Speech Processing Laboratory, Department of Electrical Engineering, Duke University, Durham
来源
基金
美国国家科学基金会;
关键词
D O I
10.1109/89.365384
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A new frequency-domain, constrained iterative algorithm is proposed for dual-channel speech enhancement, The dual-channel enhancement scheme is shown to follow the iterative expectation-maximization (EM) algorithm, resulting in a two-step dual-channel Wiener filtering scheme, A new technique for applying constraints during the EM iterations is developed so as to take advantage of the auditory properties of speech perception, An overriding goal is to enhance quality and at least maintain intelligibility of the estimated speech signal, Constraints are applied over time and iteration on mel-cepstral parameters which parametrize an auditory based spectrum, These constraints also adapt to changing speech characteristics over time with the aid of an adaptive boundary detector, Performance is demonstrated in three areas for speech degraded by additive white Gaussian noise, aircraft cockpit noise, and computer cooling-fan noise, First, global objective speech quality measures show improved quality when compared to unconstrained dual-channel Wiener filtering and a traditional LMS-based adaptive noise cancellation technique, over a range of signal-to-noise ratios and cross-talk levels, Second, time waveforms and frame-to-frame quality measures show good improvement, especially in unvoiced and transitional regions of speech, Informal listening tests confirm improvement in quality as measured by objective measures, Finally, objective measures classified over individual phonemes for a subset of sentences from the TIMIT speech database show a consistent and superior improvement in quality.
引用
收藏
页码:22 / 34
页数:13
相关论文
共 50 条
  • [1] Speech Enhancement Using Auditory-Based Transform
    Tank, Vanita Raj
    Mahajan, S. P.
    Khaparde, Arti
    Deshpande, Rahul
    [J]. 2015 10TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING (ICICS), 2015,
  • [2] Dual-channel auditory spectrum modeling
    Billa, J
    [J]. 1997 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I - V: VOL I: PLENARY, EXPERT SUMMARIES, SPECIAL, AUDIO, UNDERWATER ACOUSTICS, VLSI; VOL II: SPEECH PROCESSING; VOL III: SPEECH PROCESSING, DIGITAL SIGNAL PROCESSING; VOL IV: MULTIDIMENSIONAL SIGNAL PROCESSING, NEURAL NETWORKS - VOL V: STATISTICAL SIGNAL AND ARRAY PROCESSING, APPLICATIONS, 1997, : 995 - 998
  • [3] Dual-channel speech intelligibility enhancement based on the psychoacoustics
    Lee, Sang-Hoon
    Jeong, Hong
    [J]. LECTURE NOTES IN SIGNAL SCIENCE, INTERNET AND EDUCATION (SSIP'07/MIV'07/DIWEB'07), 2007, : 83 - +
  • [4] Auditory-Based Spectral Amplitude Estimators for Speech Enhancement
    Plourde, Eric
    Champagne, Benoit
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (08): : 1614 - 1623
  • [5] Dual-channel DNN-based Speech Enhancement for Smartphones
    Martin-Donas, Juan M.
    Gomez, Angel M.
    Lopez-Espejo, Ivan
    Peinado, Antonio M.
    [J]. 2017 IEEE 19TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2017,
  • [6] Dual-channel speech enhancement by superdirective beamforming
    Lotter, Thomas
    Vary, Peter
    [J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2006, 2006 (1) : 1 - 14
  • [7] Dual-Channel Speech Enhancement by Superdirective Beamforming
    Thomas Lotter
    Peter Vary
    [J]. EURASIP Journal on Advances in Signal Processing, 2006
  • [8] A Dual-channel Speech Enhancement Method for Cellular Communication
    Nabi, Wahbi
    Aloui, Noureddine
    Cherif, Adnane
    [J]. 2018 4TH INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2018,
  • [9] Real-time spectrum estimation–based dual-channel speech-enhancement algorithm for cochlear implant
    Yousheng Chen
    Qin Gong
    [J]. BioMedical Engineering OnLine, 11
  • [10] Noise variance estimation based on dual-channel phase difference for speech enhancement
    Kim, Seon Man
    Kim, Hong Kook
    [J]. DIGITAL SIGNAL PROCESSING, 2014, 26 : 169 - 182