An adaptive a priori SNR estimator for perceptual speech enhancement

被引：6

作者：

Nahma, Lara ^{[1
]}

Yong, Pei Chee ^{[2
]}

Dam, Hai Huyen ^{[1
]}

Nordholm, Sven ^{[1
]}

机构：

[1] Curtin Univ, Dept Elect Engn Comp & Math Sci, Perth, Australia

[2] Nuheara Ltd, Perth, Australia

来源：

EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING | 2019年 / 2019卷 / 1期

关键词：

Single-channel speech enhancement; A priori SNR estimation; Decision-directed approach; Adaptive smoothing factor; Auditory system; QUALITY ASSESSMENT; NOISE; SUPPRESSION; PESQ;

D O I：

10.1186/s13636-019-0150-3

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, an adaptive averaging a priori SNR estimation employing critical band processing is proposed. The proposed method modifies the current decision-directed a priori SNR estimation to achieve faster tracking when SNR changes. The decision-directed estimator (DD) employs a fixed weighting with the value close to one, which makes it slow in following the onsets of speech utterances. The proposed SNR estimator provides a means to solve this issue by employing an adaptive weighting factor. This allows an improved tracking of onset changes in the speech signal. As a consequence, it results in better preservation of speech components. This adaptive technique ensures that the weighting between the modified decision-directed a priori estimate and the maximum likelihood a priori estimate is a function of the speech absence probability. The estimate of the speech absence probability is modeled by a sigmoid function. Furthermore, a critical band mapping for the short-time Fourier transform analysis-synthesis system is utilized in the speech enhancement to achieve less musical noise. In addition, to evaluate the ability of the a priori SNR estimation method in preserving speech components, we proposed a modified objective measurement known as modified hamming distance. Evaluations are performed by utilizing both objective and subjective measurements. The experimental results show that the proposed method improves the speech quality under different noise conditions. Moreover, it maintains the advantage of the DD approach in eliminating the musical noise under different SNR conditions. The objective results are supported by subjective listening tests using 10 subjects (5 males and 5 females).

引用

页数：20

共 50 条

[1] An adaptive a priori SNR estimator for perceptual speech enhancement
Lara Nahma
Pei Chee Yong
Hai Huyen Dam
Sven Nordholm
EURASIP Journal on Audio, Speech, and Music Processing, 2019
[2] Speech enhancement using a noncausal a priori SNR estimator
Cohen, I
IEEE SIGNAL PROCESSING LETTERS, 2004, 11 (09) : 725 - 728
[3] A novel approach to a robust a Priori SNR estimator in speech enhancement
Park, Yun-Sik
Chang, Joon-Hyuk
IEICE TRANSACTIONS ON COMMUNICATIONS, 2007, E90B (08) : 2182 - 2185
[4] A Novel Approach to a Robust A Priori SNR Estimator in Speech Enhancement
Park, Yun-Sik
Chang, Joon-Hyuk
JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2006, 25 (08): : 383 - 388
[5] A new a priori SNR estimator based on multiple linear regression technique for speech enhancement
Lee, Soojeong
Lim, Chungsoo
Chang, Joon-Hyuk
DIGITAL SIGNAL PROCESSING, 2014, 30 : 154 - 164
[6] A New Improved Algorithm of Speech Enhancement Based on MCRA and Noncausal a Priori SNR Estimator
Ying, Na
Wang, Xuzhen
Liu, Jianwei
Wang, Qinfang
Hua, Jianzhi
Yang, Qingbiao
PROCEEDINGS OF THE 2014 INTERNATIONAL CONFERENCE ON MECHATRONICS, ELECTRONIC, INDUSTRIAL AND CONTROL ENGINEERING, 2014, 5 : 1281 - 1284
[7] A Priori SNR Estimator Based on a Convex Combination of Two DD Approaches for Speech Enhancement
Shen, Suojin
Ou, Shifeng
Wei, Jing
Gao, Ying
2016 IEEE INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP), 2016, : 750 - 754
[8] An improved SNR estimator for speech enhancement
Ren, Yao
Johnson, Michael T.
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 4901 - 4904
[9] IMPROVED A PRIORI SNR ESTIMATION IN SPEECH ENHANCEMENT
Nahma, Lara
Yong, Pei Chee
Dam, Hai Huyen
Nordholm, Sven
2017 23RD ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS (APCC): BRIDGING THE METROPOLITAN AND THE REMOTE, 2017, : 253 - 257
[10] An Iterative Speech Model-Based A Priori SNR Estimator
Elshamy, Samy
Madhu, Nilesh
Tirry, Wouter
Fingscheidt, Tim
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1740 - 1744

← 1 2 3 4 5 →