Enhancing Target Speech Based on Nonlinear Soft Masking Using a Single Acoustic Vector Sensor

被引:7
|
作者
Zou, Yuexian [1 ]
Liu, Zhaoyi [1 ]
Ritz, Christian H. [2 ]
机构
[1] Peking Univ, Shenzhen Grad Sch, ADSPLAB, Sch Elect Comp Engn, Shenzhen 518055, Peoples R China
[2] Univ Wollongong, Sch Elect Comp & Telecommun Engn, Wollongong, NSW 2500, Australia
来源
APPLIED SCIENCES-BASEL | 2018年 / 8卷 / 09期
基金
中国国家自然科学基金;
关键词
Direction of Arrival (DOA); time-frequency (TF) mask; speech sparsity; speech enhancement (SE); acoustic vector sensor (AVS); intelligent service robot; PERFORMANCE;
D O I
10.3390/app8091436
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Enhancing speech captured by distant microphones is a challenging task. In this study, we investigate the multichannel signal properties of the single acoustic vector sensor (AVS) to obtain the inter-sensor data ratio (ISDR) model in the time-frequency (TF) domain. Then, the monotone functions describing the relationship between the ISDRs and the direction of arrival (DOA) of the target speaker are derived. For the target speech enhancement (SE) task, the DOA of the target speaker is given, and the ISDRs are calculated. Hence, the TF components dominated by the target speech are extracted with high probability using the established monotone functions, and then, a nonlinear soft mask of the target speech is generated. As a result, a masking-based speech enhancement method is developed, which is termed the AVS-SMASK method. Extensive experiments with simulated data and recorded data have been carried out to validate the effectiveness of our proposed AVS-SMASK method in terms of suppressing spatial speech interferences and reducing the adverse impact of the additive background noise while maintaining less speech distortion. Moreover, our AVS-SMASK method is computationally inexpensive, and the AVS is of a small physical size. These merits are favorable to many applications, such as robot auditory systems.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Experimental demonstration of single carrier underwater acoustic communication using a vector sensor
    Han, Xiao
    Yin, Jing-wei
    Yu, Ge
    Du, Peng-yu
    [J]. APPLIED ACOUSTICS, 2015, 98 : 1 - 5
  • [22] Multisource DOA Estimation in a Reverberant Environment Using a Single Acoustic Vector Sensor
    Wu, Kai
    Reju, Vaninirappuputhenpurayil Gopalan
    Khong, Andy W. H.
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (10) : 1848 - 1859
  • [23] Single channel speech enhancement using temporal masking
    Gunawan, TS
    Ambikairajah, E
    [J]. 2004 9TH IEEE SINGAPORE INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS (ICCS), 2004, : 250 - 254
  • [24] Research on Acoustic Three-user Communication Based on Single Vector Sensor
    Wang Dayu
    Zhao Anbang
    Hui Junying
    Li Xu
    Hao Xiuqiang
    Chen Yang
    [J]. 2009 5TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-8, 2009, : 1163 - 1166
  • [25] A method for target depth estimation in the deep ocean using a single vector sensor
    Zhang, Liyuan
    Shi, Jie
    Cheng, Yuezhu
    Lu, Zhenghua
    Yang, Jiyuan
    [J]. Harbin Gongcheng Daxue Xuebao/Journal of Harbin Engineering University, 2024, 45 (11): : 2168 - 2175
  • [26] Cepstral Smoothing of Spectral Masks for Acoustic Vector-Sensor based Convolutive Speech Separation
    Chen, Xiaoyi
    Wang, Yingmin
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (ICSPCC), 2014, : 855 - 858
  • [27] Acoustic Vector Sensor based Speech Source Separation with Mixed Gaussian-Laplacian Distributions
    Chen, Xiaoyi
    Alinaghi, Atiyeh
    Zhong, Xionghu
    Wang, Wenwu
    [J]. 2013 18TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2013,
  • [28] Comparative Study of Beamformers for a Single Acoustic Vector Sensor
    Lu, Da
    Li, Hui
    Yang, Kunde
    Xu, Zhezhen
    [J]. GLOBAL OCEANS 2020: SINGAPORE - U.S. GULF COAST, 2020,
  • [29] An Improved DOA Method for Single Acoustic Vector Sensor
    Shi, Junjie
    Sun, Dajun
    [J]. 2009 5TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-8, 2009, : 1937 - 1941
  • [30] A Soft Decision-based Speech Enhancement using Acoustic Noise Classification
    Choi, Jae-Hun
    Kim, Sang-Kyun
    Chang, Joon-Hyuk
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1200 - 1203