Enhancing Target Speech Based on Nonlinear Soft Masking Using a Single Acoustic Vector Sensor

被引：7

作者：

Zou, Yuexian ^{[1
]}

Liu, Zhaoyi ^{[1
]}

Ritz, Christian H. ^{[2
]}

机构：

[1] Peking Univ, Shenzhen Grad Sch, ADSPLAB, Sch Elect Comp Engn, Shenzhen 518055, Peoples R China

[2] Univ Wollongong, Sch Elect Comp & Telecommun Engn, Wollongong, NSW 2500, Australia

来源：

APPLIED SCIENCES-BASEL | 2018年 / 8卷 / 09期

基金：

中国国家自然科学基金;

关键词：

Direction of Arrival (DOA); time-frequency (TF) mask; speech sparsity; speech enhancement (SE); acoustic vector sensor (AVS); intelligent service robot; PERFORMANCE;

D O I：

10.3390/app8091436

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

Enhancing speech captured by distant microphones is a challenging task. In this study, we investigate the multichannel signal properties of the single acoustic vector sensor (AVS) to obtain the inter-sensor data ratio (ISDR) model in the time-frequency (TF) domain. Then, the monotone functions describing the relationship between the ISDRs and the direction of arrival (DOA) of the target speaker are derived. For the target speech enhancement (SE) task, the DOA of the target speaker is given, and the ISDRs are calculated. Hence, the TF components dominated by the target speech are extracted with high probability using the established monotone functions, and then, a nonlinear soft mask of the target speech is generated. As a result, a masking-based speech enhancement method is developed, which is termed the AVS-SMASK method. Extensive experiments with simulated data and recorded data have been carried out to validate the effectiveness of our proposed AVS-SMASK method in terms of suppressing spatial speech interferences and reducing the adverse impact of the additive background noise while maintaining less speech distortion. Moreover, our AVS-SMASK method is computationally inexpensive, and the AVS is of a small physical size. These merits are favorable to many applications, such as robot auditory systems.

引用

页数：17

共 50 条

[21] Experimental demonstration of single carrier underwater acoustic communication using a vector sensor
Han, Xiao
Yin, Jing-wei
Yu, Ge
Du, Peng-yu
[J]. APPLIED ACOUSTICS, 2015, 98 : 1 - 5
[22] Multisource DOA Estimation in a Reverberant Environment Using a Single Acoustic Vector Sensor
Wu, Kai
Reju, Vaninirappuputhenpurayil Gopalan
Khong, Andy W. H.
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (10) : 1848 - 1859
[23] Single channel speech enhancement using temporal masking
Gunawan, TS
Ambikairajah, E
[J]. 2004 9TH IEEE SINGAPORE INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS (ICCS), 2004, : 250 - 254
[24] Research on Acoustic Three-user Communication Based on Single Vector Sensor
Wang Dayu
Zhao Anbang
Hui Junying
Li Xu
Hao Xiuqiang
Chen Yang
[J]. 2009 5TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-8, 2009, : 1163 - 1166
[25] A method for target depth estimation in the deep ocean using a single vector sensor
Zhang, Liyuan
Shi, Jie
Cheng, Yuezhu
Lu, Zhenghua
Yang, Jiyuan
[J]. Harbin Gongcheng Daxue Xuebao/Journal of Harbin Engineering University, 2024, 45 (11): : 2168 - 2175
[26] Cepstral Smoothing of Spectral Masks for Acoustic Vector-Sensor based Convolutive Speech Separation
Chen, Xiaoyi
Wang, Yingmin
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (ICSPCC), 2014, : 855 - 858
[27] Acoustic Vector Sensor based Speech Source Separation with Mixed Gaussian-Laplacian Distributions
Chen, Xiaoyi
Alinaghi, Atiyeh
Zhong, Xionghu
Wang, Wenwu
[J]. 2013 18TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2013,
[28] Comparative Study of Beamformers for a Single Acoustic Vector Sensor
Lu, Da
Li, Hui
Yang, Kunde
Xu, Zhezhen
[J]. GLOBAL OCEANS 2020: SINGAPORE - U.S. GULF COAST, 2020,
[29] An Improved DOA Method for Single Acoustic Vector Sensor
Shi, Junjie
Sun, Dajun
[J]. 2009 5TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-8, 2009, : 1937 - 1941
[30] A Soft Decision-based Speech Enhancement using Acoustic Noise Classification
Choi, Jae-Hun
Kim, Sang-Kyun
Chang, Joon-Hyuk
[J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1200 - 1203

← 1 2 3 4 5 →