Enhancing Target Speech Based on Nonlinear Soft Masking Using a Single Acoustic Vector Sensor

被引:7
|
作者
Zou, Yuexian [1 ]
Liu, Zhaoyi [1 ]
Ritz, Christian H. [2 ]
机构
[1] Peking Univ, Shenzhen Grad Sch, ADSPLAB, Sch Elect Comp Engn, Shenzhen 518055, Peoples R China
[2] Univ Wollongong, Sch Elect Comp & Telecommun Engn, Wollongong, NSW 2500, Australia
来源
APPLIED SCIENCES-BASEL | 2018年 / 8卷 / 09期
基金
中国国家自然科学基金;
关键词
Direction of Arrival (DOA); time-frequency (TF) mask; speech sparsity; speech enhancement (SE); acoustic vector sensor (AVS); intelligent service robot; PERFORMANCE;
D O I
10.3390/app8091436
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Enhancing speech captured by distant microphones is a challenging task. In this study, we investigate the multichannel signal properties of the single acoustic vector sensor (AVS) to obtain the inter-sensor data ratio (ISDR) model in the time-frequency (TF) domain. Then, the monotone functions describing the relationship between the ISDRs and the direction of arrival (DOA) of the target speaker are derived. For the target speech enhancement (SE) task, the DOA of the target speaker is given, and the ISDRs are calculated. Hence, the TF components dominated by the target speech are extracted with high probability using the established monotone functions, and then, a nonlinear soft mask of the target speech is generated. As a result, a masking-based speech enhancement method is developed, which is termed the AVS-SMASK method. Extensive experiments with simulated data and recorded data have been carried out to validate the effectiveness of our proposed AVS-SMASK method in terms of suppressing spatial speech interferences and reducing the adverse impact of the additive background noise while maintaining less speech distortion. Moreover, our AVS-SMASK method is computationally inexpensive, and the AVS is of a small physical size. These merits are favorable to many applications, such as robot auditory systems.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] An Effective Target Speech Enhancement with Single Acoustic Vector Sensor Based on the Speech Time-Frequency Sparsity
    Zou, Y. X.
    Wang, Y. Q.
    Wang, Peng
    Ritz, C. H.
    Xi, Jiangtao
    [J]. 2014 19TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2014, : 547 - 551
  • [2] A speech enhancement method for spatial target using an acoustic vector sensor
    Zou, Yuexian
    Wang, Peng
    Wang, Wenmin
    [J]. Qinghua Daxue Xuebao/Journal of Tsinghua University, 2013, 53 (06): : 883 - 887
  • [3] ACOUSTIC VECTOR SENSOR BASED REVERBERANT SPEECH SEPARATION WITH PROBABILISTIC TIME-FREQUENCY MASKING
    Zhong, Xionghu
    Chen, Xiaoyi
    Wang, Wenwu
    Alinaghi, Atiyeh
    Premkumar, A. B.
    [J]. 2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
  • [4] SEPARATION OF SPEECH SOURCES USING AN ACOUSTIC VECTOR SENSOR
    Shujau, M.
    Ritz, C. H.
    Burnett, I. S.
    [J]. 2011 IEEE 13TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2011,
  • [5] Nonlinear Soft Sensor Development Based on Relevance Vector Machine
    Ge, Zhiqiang
    Song, Zhihuan
    [J]. INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2010, 49 (18) : 8685 - 8693
  • [6] SPEECH DEREVERBERATION BASED ON LINEAR PREDICTION: AN ACOUSTIC VECTOR SENSOR APPROACH
    Shujau, M.
    Ritz, C. H.
    Burnett, I. S.
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 639 - 643
  • [7] Acoustic Masking Based on Time-reversed Speech
    Jiang Jingsai
    Li Ye
    Zhang Peng
    Fan Yanhong
    Ma Xiaofeng
    Hao Qiuyun
    [J]. 2016 IEEE INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC), 2016, : 905 - 909
  • [8] Informational masking of monaural target speech by a single contralateral formant
    Roberts, Brian
    Summers, Robert J.
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2015, 137 (05): : 2726 - 2736
  • [9] Multi-target bearing tracking with a single acoustic vector sensor based on multi-Bernoulli filter
    Gunes, Ahmet
    Guldogan, Mehmet B.
    [J]. OCEANS 2015 - GENOVA, 2015,
  • [10] Acoustic direction finding using single acoustic vector sensor under high reverberation
    Aktas, Metin
    Ozkan, Huseyin
    [J]. DIGITAL SIGNAL PROCESSING, 2018, 75 : 56 - 70