CNN-Based Feature Integration Network for Speech Enhancement in Microphone Arrays

被引:0
|
作者
Xi, Ji [1 ]
Jiang, Pengxu [2 ]
Xie, Yue [3 ]
Jiang, Wei [1 ]
Ding, Hao [1 ]
机构
[1] Changzhou Inst Technol, Sch Comp Informat Engn, Changzhou 213022, Peoples R China
[2] Southeast Univ, Sch Informat Sci & Engn, Nanjing 210096, Peoples R China
[3] Nanjing Inst Technol, Sch Informat & Commun Engn, Nanjing 211167, Peoples R China
关键词
key speech enhancement; convolutional neural network; microphone arrays; deep learning;
D O I
10.1587/transinf.2024EDL8014
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The relevant model based on convolutional neural networks (CNNs) has been proven to be an effective solution in speech enhancement algorithms. However, there needs to be more research on CNNs based on microphone arrays, especially in exploring the correlation between networks associated with different microphones. In this paper, we proposed a CNN-based feature integration network for speech enhancement in microphone arrays. The input of CNN is composed of short-time Fourier transform (STFT) from different microphones. CNN includes the encoding layer, decoding layer, and skip structure. In addition, the designed feature integration layer enables information exchange between different microphones, and the designed feature fusion layer integrates additional information. The experiment proved the superiority of the designed structure.
引用
收藏
页码:1546 / 1549
页数:4
相关论文
共 50 条
  • [1] Location Feature Integration for Clustering-Based Speech Separation in Distributed Microphone Arrays
    Souden, Mehrez
    Kinoshita, Keisuke
    Delcroix, Marc
    Nakatani, Tomohiro
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2014, 22 (02) : 354 - 367
  • [2] A New Framework for CNN-Based Speech Enhancement in the Time Domain
    Pandey, Ashutosh
    Wang, DeLiang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (07) : 1179 - 1188
  • [3] Effects of Skip Connections in CNN-Based Architectures for Speech Enhancement
    Nengheng Zheng
    Yupeng Shi
    Weicong Rong
    Yuyong Kang
    Journal of Signal Processing Systems, 2020, 92 : 875 - 884
  • [4] Effects of Skip Connections in CNN-Based Architectures for Speech Enhancement
    Zheng, Nengheng
    Shi, Yupeng
    Rong, Weicong
    Kang, Yuyong
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2020, 92 (08): : 875 - 884
  • [5] Hybrid Acceleration of CNN-based Speech Enhancement on Embedded Platforms
    Li, Kaixu
    Pan, Ruixiang
    Wei, Lei
    Yan, Bo
    Lin, Jiazhen
    Zhang, Xiaoyan
    2021 6TH INTERNATIONAL CONFERENCE ON UK-CHINA EMERGING TECHNOLOGIES (UCET 2021), 2021, : 53 - 58
  • [6] Sequential CNN-based Enhancement of Ultrafast Ultrasound Imaging for Sparse Arrays
    Vinals, Roser
    Thiran, Jean-Philippe
    32ND EUROPEAN SIGNAL PROCESSING CONFERENCE, EUSIPCO 2024, 2024, : 755 - 759
  • [7] Speech enhancement by LSTM-based noise suppression followed by CNN-based speech restoration
    Strake, Maximilian
    Defraene, Bruno
    Fluyt, Kristoff
    Tirry, Wouter
    Fingscheidt, Tim
    EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING, 2020, 2020 (01)
  • [8] Speech enhancement by LSTM-based noise suppression followed by CNN-based speech restoration
    Maximilian Strake
    Bruno Defraene
    Kristoff Fluyt
    Wouter Tirry
    Tim Fingscheidt
    EURASIP Journal on Advances in Signal Processing, 2020
  • [9] Signals hierarchical feature enhancement method for CNN-based fault diagnosis
    Zhang, Huang
    Zhang, Shuyou
    Wang, Zili
    Qiu, Lemiao
    Zhang, Yiming
    ADVANCES IN MECHANICAL ENGINEERING, 2022, 14 (09)
  • [10] Speech Enhancement Algorithm Based on Microphone Array and Multi-Channel Parallel GRU-CNN Network
    Xi, Ji
    Xu, Zhe
    Zhang, Weiqi
    Xie, Yue
    Zhao, Li
    ELECTRONICS, 2025, 14 (04):