Dual-Channel Cosine Function Based ITD Estimation for Robust Speech Separation

被引:1
|
作者
Li, Xuliang [1 ]
Ding, Zhaogui [1 ]
Li, Weifeng [1 ]
Liao, Qingmin [1 ]
机构
[1] Tsinghua Univ, Grad Sch Shenzhen, Dept Elect Engn, Beijing 100084, Peoples R China
来源
SENSORS | 2017年 / 17卷 / 06期
关键词
delay-and-sum beamforming; binary time-frequency mask; cosine function;
D O I
10.3390/s17061447
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In speech separation tasks, many separation methods have the limitation that the microphones are closely spaced, which means that these methods are unprevailing for phase wrap-around. In this paper, we present a novel speech separation scheme by using two microphones that does not have this restriction. The technique utilizes the estimation of interaural time difference (ITD) statistics and binary time-frequency mask for the separation of mixed speech sources. The novelties of the paper consist in: (1) the extended application of delay-and-sum beamforming (DSB) and cosine function for ITD calculation; and (2) the clarification of the connection between ideal binary mask and DSB amplitude ratio. Our objective quality evaluation experiments demonstrate the effectiveness of the proposed method.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] BEAMFORMED FEATURE FOR LEARNING-BASED DUAL-CHANNEL SPEECH SEPARATION
    Li, Hao
    Zhang, Xueliang
    Gao, Guanglai
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 4722 - 4726
  • [2] Dual-Channel Speech Enhancement Based on Extended Kalman Filter Relative Transfer Function Estimation
    Martin-Donas, Juan M.
    Peinado, Antonio M.
    Lopez-Espejo, Ivan
    Gomez, Angel
    [J]. APPLIED SCIENCES-BASEL, 2019, 9 (12):
  • [3] SPEAKER AND DIRECTION INFERRED DUAL-CHANNEL SPEECH SEPARATION
    Li, Chenxing
    Xu, Jiaming
    Mesgarani, Nima
    Xu, Bo
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 5779 - 5783
  • [4] Noise variance estimation based on dual-channel phase difference for speech enhancement
    Kim, Seon Man
    Kim, Hong Kook
    [J]. DIGITAL SIGNAL PROCESSING, 2014, 26 : 169 - 182
  • [5] ROBUST DUAL-CHANNEL NOISE POWER SPECTRAL DENSITY ESTIMATION
    Jeub, Marco
    Nelke, Christoph
    Krueger, Hauke
    Beaugeant, Christophe
    Vary, Peter
    [J]. 19TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO-2011), 2011, : 2304 - 2308
  • [6] Dual-channel spectral weighting for robust speech recognition in mobile devices
    Lopez-Espejo, Ivan
    Peinado, Antonio M.
    Gomez, Angel M.
    Gonzalez, Jose A.
    [J]. DIGITAL SIGNAL PROCESSING, 2018, 75 : 13 - 24
  • [7] Dual-channel speech separation by sub-segmental directional statistics
    Ding, Zhaogui
    Li, Weifeng
    Liao, Qingmin
    [J]. PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2016, : 2287 - 2291
  • [8] Dual-channel speech intelligibility enhancement based on the psychoacoustics
    Lee, Sang-Hoon
    Jeong, Hong
    [J]. LECTURE NOTES IN SIGNAL SCIENCE, INTERNET AND EDUCATION (SSIP'07/MIV'07/DIWEB'07), 2007, : 83 - +
  • [9] Unscented Transform-Based Dual-Channel Noise Estimation: Application to Speech Enhancement on Smartphones
    Lopez-Espejo, Ivan
    Martin-Donas, Juan M.
    Gomez, Angel M.
    Peinado, Antonio M.
    [J]. 2018 41ST INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2018, : 88 - 91
  • [10] Dual-channel DNN-based Speech Enhancement for Smartphones
    Martin-Donas, Juan M.
    Gomez, Angel M.
    Lopez-Espejo, Ivan
    Peinado, Antonio M.
    [J]. 2017 IEEE 19TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2017,