CNN-Based Feature Integration Network for Speech Enhancement in Microphone Arrays

被引:0
|
作者
Xi, Ji [1 ]
Jiang, Pengxu [2 ]
Xie, Yue [3 ]
Jiang, Wei [1 ]
Ding, Hao [1 ]
机构
[1] Changzhou Inst Technol, Sch Comp Informat Engn, Changzhou 213022, Peoples R China
[2] Southeast Univ, Sch Informat Sci & Engn, Nanjing 210096, Peoples R China
[3] Nanjing Inst Technol, Sch Informat & Commun Engn, Nanjing 211167, Peoples R China
关键词
key speech enhancement; convolutional neural network; microphone arrays; deep learning;
D O I
10.1587/transinf.2024EDL8014
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The relevant model based on convolutional neural networks (CNNs) has been proven to be an effective solution in speech enhancement algorithms. However, there needs to be more research on CNNs based on microphone arrays, especially in exploring the correlation between networks associated with different microphones. In this paper, we proposed a CNN-based feature integration network for speech enhancement in microphone arrays. The input of CNN is composed of short-time Fourier transform (STFT) from different microphones. CNN includes the encoding layer, decoding layer, and skip structure. In addition, the designed feature integration layer enables information exchange between different microphones, and the designed feature fusion layer integrates additional information. The experiment proved the superiority of the designed structure.
引用
收藏
页码:1546 / 1549
页数:4
相关论文
共 50 条
  • [21] DNN-BASED DISTRIBUTED MULTICHANNEL MASK ESTIMATION FOR SPEECH ENHANCEMENT IN MICROPHONE ARRAYS
    Furnon, Nicolas
    Serizel, Romain
    Illina, Irina
    Essid, Slim
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 4672 - 4676
  • [22] Microphone Array Speech Enhancement Via Beamforming Based Deep Learning Network
    Pathrose, Jeyasingh
    Ismail, M. Mohamed
    Mohan, P. Madhan
    INTERNATIONAL JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING SYSTEMS, 2023, 14 (07) : 781 - 790
  • [23] CNN-Based Chinese Character Recognition with Skeleton Feature
    Tang, Wei
    Su, Yijun
    Li, Xiang
    Zha, Daren
    Jiang, Weiyu
    Gao, Neng
    Xiang, Ji
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT V, 2018, 11305 : 461 - 472
  • [24] A CNN-based approach to identification of degradations in speech signals
    Yuki Saishu
    Amir Hossein Poorjam
    Mads Græsbøll Christensen
    EURASIP Journal on Audio, Speech, and Music Processing, 2021
  • [25] A CNN-based approach to identification of degradations in speech signals
    Saishu, Yuki
    Poorjam, Amir Hossein
    Christensen, Mads Graesboll
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2021, 2021 (01)
  • [26] CNN-Based Feature Fusion Motor Fault Diagnosis
    Qian, Long
    Li, Binbin
    Chen, Lijuan
    ELECTRONICS, 2022, 11 (17)
  • [27] FPGA based dual microphone speech enhancement
    Tanmay Biswas
    Sudhindu Bikash Mandal
    Debasri Saha
    Amlan Chakrabarti
    Microsystem Technologies, 2019, 25 : 765 - 775
  • [28] FPGA based dual microphone speech enhancement
    Biswas, Tanmay
    Mandal, Sudhindu Bikash
    Saha, Debasri
    Chakrabarti, Amlan
    MICROSYSTEM TECHNOLOGIES-MICRO-AND NANOSYSTEMS-INFORMATION STORAGE AND PROCESSING SYSTEMS, 2019, 25 (03): : 765 - 775
  • [29] An iterative spatio-temporal speech enhancement algorithm for microphone arrays
    Gupta, Malay
    Douglas, Scott C.
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 81 - 84
  • [30] Speech Enhancement in Distributed Microphone Arrays Using Polynomial Eigenvalue Decomposition
    d'Olne, Emilie
    Neo, Vincent W.
    Naylor, Patrick A.
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 55 - 59