Discrete cosine transform particle filter speech enhancement

被引:8
|
作者
Laska, Brady [1 ]
Bolic, Miodrag [3 ]
Goubran, Rafik [2 ]
机构
[1] Res Mot Ltd, Ottawa, ON K2K 3K1, Canada
[2] Carleton Univ, Dept Syst & Comp Engn, Ottawa, ON K1S 5B6, Canada
[3] Univ Ottawa, Sch Informat Technol & Engn, Ottawa, ON K1N 6N5, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Speech enhancement; Noise reduction; Particle filtering; Discrete cosine transform (DCT); SUBSPACE APPROACH; KALMAN FILTER; NOISE;
D O I
10.1016/j.specom.2010.05.005
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A discrete cosine transform (DCT) domain speech enhancement algorithm is proposed that models the evolution of speech DCT coefficients as a time-varying autoregressive process. Rao-Blackwellized particle filter (RBPF) techniques are used to estimate the model parameters and recover the clean signal coefficients. Using very low-order models for each coefficient and operating at a decimated frame rate, the proposed approach provides a significant complexity reduction compared to the standard full-band RBPF speech enhancement algorithm. In addition to the complexity gains, performance is also improved. Modeling the speech signal in the DCT-domain is shown to provide a better fit in spectral troughs, leading to more noise reduction and less speech distortion. To illustrate possible frequency-dependent processing strategies, a hybrid structure is proposed that offers a complexity/performance trade-off by substituting a simple DCT Wiener filter for the DCT-RBPF in some bands. In comparisons with high performing speech enhancement algorithms using wide-band speech and noise, the proposed DCT-RBPF algorithm achieves higher scores on objective quality and intelligibility measures. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:762 / 775
页数:14
相关论文
共 50 条
  • [1] Noisy speech enhancement using discrete cosine transform
    Soon, IY
    Koh, SN
    Yeo, CK
    [J]. SPEECH COMMUNICATION, 1998, 24 (03) : 249 - 257
  • [2] Speech enhancement using warped discrete cosine transform
    Chang, JH
    Kim, NS
    [J]. 2002 IEEE SPEECH CODING WORKSHOP PROCEEDINGS: A PARADIGM SHIFT TOWARD NEW CODING FUNCTIONS FOR THE BROADBAND AGE, 2002, : 175 - 177
  • [3] Enhancement of noisy speech using sliding discrete cosine transform
    Kober, V
    [J]. PROGRESS IN PATTERN RECOGNITION, SPEECH AND IMAGE ANALYSIS, 2003, 2905 : 229 - 235
  • [4] On The Use of Discrete Cosine Transform Polarity Spectrum in Speech Enhancement
    Shi, Sisi
    Busch, Andrew
    Paliwal, Kuldip
    Fickenscher, Thomas
    [J]. 28TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2020), 2021, : 421 - 425
  • [5] Warped discrete cosine transform-based noisy speech enhancement
    Chang, JH
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2005, 52 (09) : 535 - 539
  • [6] Enhancement of speech using deep neural network with discrete cosine transform
    Ram, Rashmirekha
    Mohanty, Mihir Narayan
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 35 (01) : 141 - 148
  • [7] Discrete cosine transform for filter pruning
    Chen, Yaosen
    Zhou, Renshuang
    Guo, Bing
    Shen, Yan
    Wang, Wei
    Wen, Xuming
    Suo, Xinhua
    [J]. APPLIED INTELLIGENCE, 2023, 53 (03) : 3398 - 3414
  • [8] Discrete cosine transform for filter pruning
    Yaosen Chen
    Renshuang Zhou
    Bing Guo
    Yan Shen
    Wei Wang
    Xuming Wen
    Xinhua Suo
    [J]. Applied Intelligence, 2023, 53 : 3398 - 3414
  • [9] A comparison of estimation methods in the discrete cosine transform modulation domain for speech enhancement
    George, Aidan E. W.
    Pickersgill, Christine
    Schwerin, Belinda
    So, Stephen
    [J]. 2016 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2016,
  • [10] Fingerprint enhancement based on discrete cosine transform
    Jirachaweng, Suksan
    Areekul, Vutipong
    [J]. ADVANCES IN BIOMETRICS, PROCEEDINGS, 2007, 4642 : 96 - +