CNN-Based Feature Integration Network for Speech Enhancement in Microphone Arrays

被引：0

作者：

Xi, Ji ^{[1
]}

Jiang, Pengxu ^{[2
]}

Xie, Yue ^{[3
]}

Jiang, Wei ^{[1
]}

Ding, Hao ^{[1
]}

机构：

[1] Changzhou Inst Technol, Sch Comp Informat Engn, Changzhou 213022, Peoples R China

[2] Southeast Univ, Sch Informat Sci & Engn, Nanjing 210096, Peoples R China

[3] Nanjing Inst Technol, Sch Informat & Commun Engn, Nanjing 211167, Peoples R China

来源：

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2024年 / E107卷 / 12期

关键词：

key speech enhancement; convolutional neural network; microphone arrays; deep learning;

D O I：

10.1587/transinf.2024EDL8014

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The relevant model based on convolutional neural networks (CNNs) has been proven to be an effective solution in speech enhancement algorithms. However, there needs to be more research on CNNs based on microphone arrays, especially in exploring the correlation between networks associated with different microphones. In this paper, we proposed a CNN-based feature integration network for speech enhancement in microphone arrays. The input of CNN is composed of short-time Fourier transform (STFT) from different microphones. CNN includes the encoding layer, decoding layer, and skip structure. In addition, the designed feature integration layer enables information exchange between different microphones, and the designed feature fusion layer integrates additional information. The experiment proved the superiority of the designed structure.

引用

页码：1546 / 1549

页数：4

共 50 条

[21] DNN-BASED DISTRIBUTED MULTICHANNEL MASK ESTIMATION FOR SPEECH ENHANCEMENT IN MICROPHONE ARRAYS
Furnon, Nicolas
Serizel, Romain
Illina, Irina
Essid, Slim
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 4672 - 4676
[22] Microphone Array Speech Enhancement Via Beamforming Based Deep Learning Network
Pathrose, Jeyasingh
Ismail, M. Mohamed
Mohan, P. Madhan
INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (07) : 781 - 790
[23] CNN-Based Chinese Character Recognition with Skeleton Feature
Tang, Wei
Su, Yijun
Li, Xiang
Zha, Daren
Jiang, Weiyu
Gao, Neng
Xiang, Ji
NEURAL INFORMATION PROCESSING (ICONIP 2018), PT V, 2018, 11305 : 461 - 472
[24] A CNN-based approach to identification of degradations in speech signals
Yuki Saishu
Amir Hossein Poorjam
Mads Græsbøll Christensen
EURASIP Journal on Audio, Speech, and Music Processing, 2021
[25] A CNN-based approach to identification of degradations in speech signals
Saishu, Yuki
Poorjam, Amir Hossein
Christensen, Mads Graesboll
EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2021, 2021 (01)
[26] CNN-Based Feature Fusion Motor Fault Diagnosis
Qian, Long
Li, Binbin
Chen, Lijuan
ELECTRONICS, 2022, 11 (17)
[27] FPGA based dual microphone speech enhancement
Tanmay Biswas
Sudhindu Bikash Mandal
Debasri Saha
Amlan Chakrabarti
Microsystem Technologies, 2019, 25 : 765 - 775
[28] FPGA based dual microphone speech enhancement
Biswas, Tanmay
Mandal, Sudhindu Bikash
Saha, Debasri
Chakrabarti, Amlan
MICROSYSTEM TECHNOLOGIES-MICRO-AND NANOSYSTEMS-INFORMATION STORAGE AND PROCESSING SYSTEMS, 2019, 25 (03): : 765 - 775
[29] An iterative spatio-temporal speech enhancement algorithm for microphone arrays
Gupta, Malay
Douglas, Scott C.
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 81 - 84
[30] Speech Enhancement in Distributed Microphone Arrays Using Polynomial Eigenvalue Decomposition
d'Olne, Emilie
Neo, Vincent W.
Naylor, Patrick A.
2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 55 - 59

← 1 2 3 4 5 →