CNN-Based Feature Integration Network for Speech Enhancement in Microphone Arrays

被引：0

作者：

Xi, Ji ^{[1
]}

Jiang, Pengxu ^{[2
]}

Xie, Yue ^{[3
]}

Jiang, Wei ^{[1
]}

Ding, Hao ^{[1
]}

机构：

[1] Changzhou Inst Technol, Sch Comp Informat Engn, Changzhou 213022, Peoples R China

[2] Southeast Univ, Sch Informat Sci & Engn, Nanjing 210096, Peoples R China

[3] Nanjing Inst Technol, Sch Informat & Commun Engn, Nanjing 211167, Peoples R China

来源：

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2024年 / E107卷 / 12期

关键词：

key speech enhancement; convolutional neural network; microphone arrays; deep learning;

D O I：

10.1587/transinf.2024EDL8014

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The relevant model based on convolutional neural networks (CNNs) has been proven to be an effective solution in speech enhancement algorithms. However, there needs to be more research on CNNs based on microphone arrays, especially in exploring the correlation between networks associated with different microphones. In this paper, we proposed a CNN-based feature integration network for speech enhancement in microphone arrays. The input of CNN is composed of short-time Fourier transform (STFT) from different microphones. CNN includes the encoding layer, decoding layer, and skip structure. In addition, the designed feature integration layer enables information exchange between different microphones, and the designed feature fusion layer integrates additional information. The experiment proved the superiority of the designed structure.

引用

页码：1546 / 1549

页数：4

共 50 条

[1] Location Feature Integration for Clustering-Based Speech Separation in Distributed Microphone Arrays
Souden, Mehrez
Kinoshita, Keisuke
Delcroix, Marc
Nakatani, Tomohiro
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (02) : 354 - 367
[2] A New Framework for CNN-Based Speech Enhancement in the Time Domain
Pandey, Ashutosh
Wang, DeLiang
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (07) : 1179 - 1188
[3] Effects of Skip Connections in CNN-Based Architectures for Speech Enhancement
Nengheng Zheng
Yupeng Shi
Weicong Rong
Yuyong Kang
Journal of Signal Processing Systems, 2020, 92 : 875 - 884
[4] Effects of Skip Connections in CNN-Based Architectures for Speech Enhancement
Zheng, Nengheng
Shi, Yupeng
Rong, Weicong
Kang, Yuyong
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2020, 92 (08): : 875 - 884
[5] Hybrid Acceleration of CNN-based Speech Enhancement on Embedded Platforms
Li, Kaixu
Pan, Ruixiang
Wei, Lei
Yan, Bo
Lin, Jiazhen
Zhang, Xiaoyan
2021 6TH INTERNATIONAL CONFERENCE ON UK-CHINA EMERGING TECHNOLOGIES (UCET 2021), 2021, : 53 - 58
[6] Sequential CNN-based Enhancement of Ultrafast Ultrasound Imaging for Sparse Arrays
Vinals, Roser
Thiran, Jean-Philippe
32ND EUROPEAN SIGNAL PROCESSING CONFERENCE, EUSIPCO 2024, 2024, : 755 - 759
[7] Speech enhancement by LSTM-based noise suppression followed by CNN-based speech restoration
Strake, Maximilian
Defraene, Bruno
Fluyt, Kristoff
Tirry, Wouter
Fingscheidt, Tim
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2020, 2020 (01)
[8] Speech enhancement by LSTM-based noise suppression followed by CNN-based speech restoration
Maximilian Strake
Bruno Defraene
Kristoff Fluyt
Wouter Tirry
Tim Fingscheidt
EURASIP Journal on Advances in Signal Processing, 2020
[9] Signals hierarchical feature enhancement method for CNN-based fault diagnosis
Zhang, Huang
Zhang, Shuyou
Wang, Zili
Qiu, Lemiao
Zhang, Yiming
ADVANCES IN MECHANICAL ENGINEERING, 2022, 14 (09)
[10] Speech Enhancement Algorithm Based on Microphone Array and Multi-Channel Parallel GRU-CNN Network
Xi, Ji
Xu, Zhe
Zhang, Weiqi
Xie, Yue
Zhao, Li
ELECTRONICS, 2025, 14 (04):

← 1 2 3 4 5 →