An adaptive a priori SNR estimator for perceptual speech enhancement

被引:6
|
作者
Nahma, Lara [1 ]
Yong, Pei Chee [2 ]
Dam, Hai Huyen [1 ]
Nordholm, Sven [1 ]
机构
[1] Curtin Univ, Dept Elect Engn Comp & Math Sci, Perth, Australia
[2] Nuheara Ltd, Perth, Australia
关键词
Single-channel speech enhancement; A priori SNR estimation; Decision-directed approach; Adaptive smoothing factor; Auditory system; QUALITY ASSESSMENT; NOISE; SUPPRESSION; PESQ;
D O I
10.1186/s13636-019-0150-3
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, an adaptive averaging a priori SNR estimation employing critical band processing is proposed. The proposed method modifies the current decision-directed a priori SNR estimation to achieve faster tracking when SNR changes. The decision-directed estimator (DD) employs a fixed weighting with the value close to one, which makes it slow in following the onsets of speech utterances. The proposed SNR estimator provides a means to solve this issue by employing an adaptive weighting factor. This allows an improved tracking of onset changes in the speech signal. As a consequence, it results in better preservation of speech components. This adaptive technique ensures that the weighting between the modified decision-directed a priori estimate and the maximum likelihood a priori estimate is a function of the speech absence probability. The estimate of the speech absence probability is modeled by a sigmoid function. Furthermore, a critical band mapping for the short-time Fourier transform analysis-synthesis system is utilized in the speech enhancement to achieve less musical noise. In addition, to evaluate the ability of the a priori SNR estimation method in preserving speech components, we proposed a modified objective measurement known as modified hamming distance. Evaluations are performed by utilizing both objective and subjective measurements. The experimental results show that the proposed method improves the speech quality under different noise conditions. Moreover, it maintains the advantage of the DD approach in eliminating the musical noise under different SNR conditions. The objective results are supported by subjective listening tests using 10 subjects (5 males and 5 females).
引用
收藏
页数:20
相关论文
共 50 条
  • [1] An adaptive a priori SNR estimator for perceptual speech enhancement
    Lara Nahma
    Pei Chee Yong
    Hai Huyen Dam
    Sven Nordholm
    EURASIP Journal on Audio, Speech, and Music Processing, 2019
  • [2] Speech enhancement using a noncausal a priori SNR estimator
    Cohen, I
    IEEE SIGNAL PROCESSING LETTERS, 2004, 11 (09) : 725 - 728
  • [3] A novel approach to a robust a Priori SNR estimator in speech enhancement
    Park, Yun-Sik
    Chang, Joon-Hyuk
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2007, E90B (08) : 2182 - 2185
  • [4] A Novel Approach to a Robust A Priori SNR Estimator in Speech Enhancement
    Park, Yun-Sik
    Chang, Joon-Hyuk
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2006, 25 (08): : 383 - 388
  • [5] A new a priori SNR estimator based on multiple linear regression technique for speech enhancement
    Lee, Soojeong
    Lim, Chungsoo
    Chang, Joon-Hyuk
    DIGITAL SIGNAL PROCESSING, 2014, 30 : 154 - 164
  • [6] A New Improved Algorithm of Speech Enhancement Based on MCRA and Noncausal a Priori SNR Estimator
    Ying, Na
    Wang, Xuzhen
    Liu, Jianwei
    Wang, Qinfang
    Hua, Jianzhi
    Yang, Qingbiao
    PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON MECHATRONICS, ELECTRONIC, INDUSTRIAL AND CONTROL ENGINEERING, 2014, 5 : 1281 - 1284
  • [7] A Priori SNR Estimator Based on a Convex Combination of Two DD Approaches for Speech Enhancement
    Shen, Suojin
    Ou, Shifeng
    Wei, Jing
    Gao, Ying
    2016 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP), 2016, : 750 - 754
  • [8] An improved SNR estimator for speech enhancement
    Ren, Yao
    Johnson, Michael T.
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4901 - 4904
  • [9] IMPROVED A PRIORI SNR ESTIMATION IN SPEECH ENHANCEMENT
    Nahma, Lara
    Yong, Pei Chee
    Dam, Hai Huyen
    Nordholm, Sven
    2017 23RD ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS (APCC): BRIDGING THE METROPOLITAN AND THE REMOTE, 2017, : 253 - 257
  • [10] An Iterative Speech Model-Based A Priori SNR Estimator
    Elshamy, Samy
    Madhu, Nilesh
    Tirry, Wouter
    Fingscheidt, Tim
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1740 - 1744