PHASE TIME-FREQUENCY MASKING BASED SPEECH ENHANCEMENT ALGORITHM USING CIRCULAR MICROPHONE ARRAY

被引：3

作者：

He, Li ^{[1
]}

Zhou, Yi ^{[1
]}

Liu, Hongqing ^{[1
]}

机构：

[1] Chongqing Univ Posts & Telecommun, Sch Commun & Informat Engn, Chongqing, Peoples R China

来源：

2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME) | 2019年

关键词：

Time-frequency masking; phase; microphone array; postfilter;

D O I：

10.1109/ICME.2019.00144

中图分类号：

TP31 [计算机软件];

学科分类号：

081202 ; 0835 ;

摘要：

A novel time-frequency masking approach for circular microphone array speech enhancement in the presence of competing interference and background noise is proposed in this paper. Multichannel speech enhancement systems can often be constructed by a concatenation of a beamformer and a single-channel postfilter, which rely on accurate estimation of steering vector and the residual interference plus noise power spectrum density (PSD), respectively. However, the performance of existing multiple microphone speech enhancement algorithm will degrade in the presence of competing interference. The proposed phase-based time-frequency masking approach can improve the estimation of the steering vector and residual interference plus noise PSD in the presence of competing interference and background noise. The experimental analysis verifies the advantages achieved by the proposed method, in comparison with the state-of-the-art multiple microphone speech enhancement methods.

引用

页码：808 / 813

页数：6

共 50 条

[1] Time-frequency masking for BSS problem using equilateral triangular microphone array
Takenouchi, Y
Hamada, N
[J]. ISPACS 2005: PROCEEDINGS OF THE 2005 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS, 2005, : 185 - 188
[2] TIME-FREQUENCY MASKING-BASED SPEECH ENHANCEMENT USING GENERATIVE ADVERSARIAL NETWORK
Soni, Meet H.
Shah, Neil
Patil, Hemant A.
[J]. 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5039 - 5043
[3] A Phase-Based Time-Frequency masking for multi-channel speech enhancement in domestic environments
Brutti, Alessio
Tsiami, Antigoni
Katsamanis, Athanasios
Maragos, Petros
[J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2875 - 2879
[4] Time-frequency masking based supervised speech enhancement framework using fuzzy deep belief network
Samui, Suman
Chakrabarti, Indrajit
Ghosh, Soumya K.
[J]. APPLIED SOFT COMPUTING, 2019, 74 : 583 - 602
[5] Robust speech separation using time-frequency masking
Aarabi, P
Shi, GJ
Jahromi, O
[J]. 2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 741 - 744
[6] Speech enhancement and recognition using circular microphone array for service robots
Choi, C
Kong, D
Kim, J
Bang, S
[J]. IROS 2003: PROCEEDINGS OF THE 2003 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, 2003, : 3516 - 3521
[7] On time-frequency masking in voiced speech
Skoglund, J
Kleijn, WB
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2000, 8 (04): : 361 - 369
[8] Segmented Time-Frequency Masking Algorithm for Speech Separation Based on Deep Neural Networks
Guo, Xinyu
Ou, Shifeng
Gao, Meng
Gao, Ying
[J]. 2020 13TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2020), 2020, : 445 - 450
[9] MULTICHANNEL SPEECH ENHANCEMENT BASED ON TIME-FREQUENCY MASKING USING SUBBAND LONG SHORT-TERM MEMORY
Li, Xiaofei
Horaud, Radu
[J]. 2019 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2019, : 298 - 302
[10] A subband adaptive learning algorithm for microphone array based speech enhancement
Wang, DX
Yin, FL
[J]. ADVANCES IN NEURAL NETWORKS - ISNN 2005, PT 2, PROCEEDINGS, 2005, 3497 : 592 - 597

← 1 2 3 4 5 →