Enhancement of Single-Channel Periodic Signals in the Time-Domain

被引:31
|
作者
Jensen, Jesper Rindom [1 ]
Benesty, Jacob [2 ]
Christensen, Mads Graesboll [3 ]
Jensen, Soren Holdt [1 ]
机构
[1] Aalborg Univ, Dept Elect Syst, DK-9220 Aalborg, Denmark
[2] Univ Quebec, INRS EMT, Montreal, PQ H5A 1K6, Canada
[3] Aalborg Univ, Dept Architecture Design & Media Technol, DK-9220 Aalborg, Denmark
关键词
Harmonic decomposition; linearly constrained minimum variance (LCMV) filter; minimum variance distortionless response (MVDR) filter; nonstationary noise; orthogonal decomposition; performance measures; pitch; single-channel speech enhancement; time-domain filtering; SPEECH ENHANCEMENT; FUNDAMENTAL-FREQUENCY; NOISE; SUPPRESSION; REDUCTION;
D O I
10.1109/TASL.2012.2191957
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Most state-of-the-art filtering methods for speech enhancement require an estimate of the noise statistics, but the noise statistics are difficult to estimate in practice when speech is present. Thus, nonstationary noise will have a detrimental impact on the performance of most speech enhancement filters. The impact of such noise can be reduced by using the signal statistics rather than the noise statistics in the filter design. For example, this is possible by assuming a harmonic model for the desired signal; while this model fits well for voiced speech, it will not be appropriate for unvoiced speech. That is, signal-dependent methods based on the signal statistics will introduce undesired distortion for some parts of speech compared to signal-independent methods based on the noise statistics. Since both the signal-independent and signal-dependent approaches to speech enhancement have advantages, it is relevant to combine them to reduce the impact of their individual disadvantages. In this paper, we give theoretical insights into the relationship between these different approaches, and these reveal a close relationship between the two approaches. This justifies joint use of such filtering methods which can be beneficial from a practical point of view. Our experimental results confirm that both signal-independent and signal-dependent approaches have advantages and that they are closely-related. Moreover, as a part of our experiments, we illustrate the practical usefulness of combining signal-independent and signal-dependent enhancement methods by applying such methods jointly on real-life speech.
引用
收藏
页码:1948 / 1963
页数:16
相关论文
共 50 条
  • [1] Single-channel deep time-domain speech enhancement networks for cabin environments
    Zhang, Lin
    Wang, Haitao
    Yang, Shuang
    Zeng, Xiangyang
    Chen, Ke'an
    [J]. Shengxue Xuebao/Acta Acustica, 2023, 48 (04): : 890 - 900
  • [2] IMPROVING NOISE ROBUST AUTOMATIC SPEECH RECOGNITION WITH SINGLE-CHANNEL TIME-DOMAIN ENHANCEMENT NETWORK
    Kinoshita, Keisuke
    Ochiai, Tsubasa
    Delcroix, Marc
    Nakatani, Tomohiro
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7009 - 7013
  • [3] Time-domain adaptive attention network for single-channel speech separation
    Wang, Kunpeng
    Zhou, Hao
    Cai, Jingxiang
    Li, Wenna
    Yao, Juan
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)
  • [4] Single-channel signal separation using time-domain basis functions
    Jang, GJ
    Lee, TW
    Oh, YH
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2003, 10 (06) : 168 - 171
  • [5] Time-domain adaptive attention network for single-channel speech separation
    Kunpeng Wang
    Hao Zhou
    Jingxiang Cai
    Wenna Li
    Juan Yao
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2023
  • [6] Biophysically-inspired single-channel speech enhancement in the time domain
    Wen, Chuan
    Verhulst, Sarah
    [J]. INTERSPEECH 2023, 2023, : 775 - 779
  • [7] Non-Causal Time-Domain Filters for Single-Channel Noise Reduction
    Jensen, Jesper Rindom
    Benesty, Jacob
    Christensen, Mads Graesboll
    Jensen, Soren Holdt
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (05): : 1526 - 1541
  • [8] Real-time Single-channel Dereverberation and Separation with Time-domain Audio Separation Network
    Luo, Yi
    Mesgarani, Nima
    [J]. 19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 342 - 346
  • [9] ON SINGLE-CHANNEL NOISE REDUCTION IN THE TIME DOMAIN
    Chen, Jingdong
    Benesty, Jacob
    Huang, Yiteng
    Gaensler, Tomas
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 277 - 280
  • [10] TASNET: TIME-DOMAIN AUDIO SEPARATION NETWORK FOR REAL-TIME, SINGLE-CHANNEL SPEECH SEPARATION
    Luo, Yi
    Mesgarani, Nima
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 696 - 700