Improved single channel phase-aware speech enhancement technique for low signal-to-noise ratio signal

被引:20
|
作者
Samui, Suman [1 ]
Chakrabarti, Indrajit [2 ]
Ghosh, Soumya Kanti [3 ]
机构
[1] Indian Inst Technol, Adv Technol Dev Ctr, Kharagpur, W Bengal, India
[2] Indian Inst Technol, Dept Elect & Elect Commun Engn, Kharagpur, W Bengal, India
[3] Indian Inst Technol, Sch Informat Technol, Kharagpur, W Bengal, India
关键词
speech enhancement; signal denoising; spectral analysis; signal reconstruction; speech intelligibility; amplitude estimation; improved single channel phase-aware speech enhancement technique; low-signal-to-noise ratio signal; short-time spectral amplitude; phase corruption; additive noise contamination; phase-aware multiband spectral subtraction technique; spectral amplitude estimates; noise signal components; clean speech signal; composite quality measures; intelligibility assessment metrics; objective measure quality evaluation technique; SPECTRAL SUBTRACTION; SUPPRESSION; ALGORITHMS; INTELLIGIBILITY; DELAY;
D O I
10.1049/iet-spr.2015.0182
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the state-of-the-art single channel speech enhancement techniques, the short-time spectral amplitude is modified while the effect of the phase corruption due to the contamination of additive noise is neglected. This study introduces an improved speech enhancement algorithm based on a phase-aware multi-band spectral subtraction technique which estimates the spectral amplitude of the clean speech signal by considering the phase of the speech and noise signal components, and uses the estimated phase of the clean speech signal for signal reconstruction in the time domain. Experimental results show that the proposed algorithm yields better performance in terms of various objective and composite quality measures and other intelligibility assessment metrics while compared with other existing spectral subtraction methods. Using the composite objective measure quality evaluation technique, it is observed that the overall signal quality of the enhanced speech signal is improved on an average by 70% at 0 dB global input signal-to-noise ratio by using the proposed approach.
引用
收藏
页码:641 / 650
页数:10
相关论文
共 50 条
  • [31] MEASUREMENT OF SIGNAL-TO-NOISE RATIO USING A SINGLE CHANNEL POLARITY CONVERTER
    POPLAVSK.SM
    RADIO ENGINEERING AND ELECTRONIC PHYSICS-USSR, 1969, 14 (03): : 472 - &
  • [32] Optimal Signal Discrimination in a Low Signal-to-Noise Ratio Environment
    Ciodaro, Thiago
    2011 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2011, : 1085 - 1088
  • [33] UNetGAN: A Robust Speech Enhancement Approach in Time Domain for Extremely Low Signal-to-noise Ratio Condition
    Hao, Xiang
    Su, Xiangdong
    Wang, Zhiyu
    Zhang, Hui
    Batushiren
    INTERSPEECH 2019, 2019, : 1786 - 1790
  • [34] Weak Seismic Signal Enhancement for Low Signal-to-Noise Ratio Data Using Adaptive Nonstationary Signal Decomposition
    Qian, Quan
    Hu, Tianyue
    Zeng, Tongsheng
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21 : 1 - 5
  • [35] THE MEASUREMENT OF THE SIGNAL-TO-NOISE RATIO (SNR) IN CONTINUOUS SPEECH
    KLINGHOLZ, F
    SPEECH COMMUNICATION, 1987, 6 (01) : 15 - 26
  • [36] SIGNAL-TO-NOISE RATIO AS A PREDICTOR OF SPEECH TRANSMISSION QUALITY
    SEN, TK
    CARROLL, JD
    IEEE TRANSACTIONS ON AUDIO AND ELECTROACOUSTICS, 1973, AU21 (04): : 384 - 387
  • [37] A SUPERVISED SIGNAL-TO-NOISE RATIO ESTIMATION OF SPEECH SIGNALS
    Papadopoulos, Pavlos
    Tsiartas, Andreas
    Gibson, James
    Narayanan, Shrikanth
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [38] Use of a wavelet analysis technique for the enhancement of signal-to-noise ratio in ultrasonic NDE
    Chen, YJ
    Shi, YW
    Lei, YP
    INSIGHT, 1996, 38 (11) : 800 - 803
  • [39] A Study on the Benefits of Phase-Aware Speech Enhancement in Challenging Noise Scenarios
    Krawczyk-Becker, Martin
    Gerkmann, Timo
    LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION (LVA/ICA 2018), 2018, 10891 : 407 - 416
  • [40] Phase-aware subspace decomposition for single channel speech separation
    Wiem, Belhedi
    Mohamed Anouar, Ben Messaoud
    Aicha, Bouzid
    IET SIGNAL PROCESSING, 2020, 14 (04) : 214 - 222