PHASE TIME-FREQUENCY MASKING BASED SPEECH ENHANCEMENT ALGORITHM USING CIRCULAR MICROPHONE ARRAY

被引:3
|
作者
He, Li [1 ]
Zhou, Yi [1 ]
Liu, Hongqing [1 ]
机构
[1] Chongqing Univ Posts & Telecommun, Sch Commun & Informat Engn, Chongqing, Peoples R China
关键词
Time-frequency masking; phase; microphone array; postfilter;
D O I
10.1109/ICME.2019.00144
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
A novel time-frequency masking approach for circular microphone array speech enhancement in the presence of competing interference and background noise is proposed in this paper. Multichannel speech enhancement systems can often be constructed by a concatenation of a beamformer and a single-channel postfilter, which rely on accurate estimation of steering vector and the residual interference plus noise power spectrum density (PSD), respectively. However, the performance of existing multiple microphone speech enhancement algorithm will degrade in the presence of competing interference. The proposed phase-based time-frequency masking approach can improve the estimation of the steering vector and residual interference plus noise PSD in the presence of competing interference and background noise. The experimental analysis verifies the advantages achieved by the proposed method, in comparison with the state-of-the-art multiple microphone speech enhancement methods.
引用
收藏
页码:808 / 813
页数:6
相关论文
共 50 条
  • [1] Time-frequency masking for BSS problem using equilateral triangular microphone array
    Takenouchi, Y
    Hamada, N
    [J]. ISPACS 2005: PROCEEDINGS OF THE 2005 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS, 2005, : 185 - 188
  • [2] TIME-FREQUENCY MASKING-BASED SPEECH ENHANCEMENT USING GENERATIVE ADVERSARIAL NETWORK
    Soni, Meet H.
    Shah, Neil
    Patil, Hemant A.
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5039 - 5043
  • [3] A Phase-Based Time-Frequency masking for multi-channel speech enhancement in domestic environments
    Brutti, Alessio
    Tsiami, Antigoni
    Katsamanis, Athanasios
    Maragos, Petros
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2875 - 2879
  • [4] Time-frequency masking based supervised speech enhancement framework using fuzzy deep belief network
    Samui, Suman
    Chakrabarti, Indrajit
    Ghosh, Soumya K.
    [J]. APPLIED SOFT COMPUTING, 2019, 74 : 583 - 602
  • [5] Robust speech separation using time-frequency masking
    Aarabi, P
    Shi, GJ
    Jahromi, O
    [J]. 2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 741 - 744
  • [6] Speech enhancement and recognition using circular microphone array for service robots
    Choi, C
    Kong, D
    Kim, J
    Bang, S
    [J]. IROS 2003: PROCEEDINGS OF THE 2003 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, 2003, : 3516 - 3521
  • [7] On time-frequency masking in voiced speech
    Skoglund, J
    Kleijn, WB
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (04): : 361 - 369
  • [8] Segmented Time-Frequency Masking Algorithm for Speech Separation Based on Deep Neural Networks
    Guo, Xinyu
    Ou, Shifeng
    Gao, Meng
    Gao, Ying
    [J]. 2020 13TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2020), 2020, : 445 - 450
  • [9] MULTICHANNEL SPEECH ENHANCEMENT BASED ON TIME-FREQUENCY MASKING USING SUBBAND LONG SHORT-TERM MEMORY
    Li, Xiaofei
    Horaud, Radu
    [J]. 2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2019, : 298 - 302
  • [10] A subband adaptive learning algorithm for microphone array based speech enhancement
    Wang, DX
    Yin, FL
    [J]. ADVANCES IN NEURAL NETWORKS - ISNN 2005, PT 2, PROCEEDINGS, 2005, 3497 : 592 - 597