FULLSUBNET: A FULL-BAND AND SUB-BAND FUSION MODEL FOR REAL-TIME SINGLE-CHANNEL SPEECH ENHANCEMENT

被引:77
|
作者
Hao, Xiang [1 ,2 ,3 ]
Su, Xiangdong [3 ]
Horaud, Radu [4 ]
Li, Xiaofei [1 ,2 ]
机构
[1] Westlake Univ, Hangzhou, Peoples R China
[2] Westlake Inst Adv Study, Hangzhou, Peoples R China
[3] Inner Mongolia Univ, Coll Comp Sci, Hohhot, Peoples R China
[4] Inria Grenoble Rhone Alpes, Montbonnot St Martin, France
关键词
FullSubNet; Full-band and Sub-band Fusion; Sub-band; Speech Enhancement;
D O I
10.1109/ICASSP39728.2021.9414177
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes a full-band and sub-band fusion model, named as FullSubNet, for single-channel real-time speech enhancement. Full-band and sub-band refer to the models that input full-band and sub-band noisy spectral feature, output full-band and sub-band speech target, respectively. The sub-band model processes each frequency independently. Its input consists of one frequency and several context frequencies. The output is the prediction of the clean speech target for the corresponding frequency. These two types of models have distinct characteristics. The full-band model can capture the global spectral context and the long-distance cross-band dependencies. However, it lacks the ability to modeling signal stationarity and attending the local spectral pattern. The sub-band model is just the opposite. In our proposed FullSubNet, we connect a pure full-band model and a pure sub-band model sequentially and use practical joint training to integrate these two types of models' advantages. We conducted experiments on the DNS challenge (INTERSPEECH 2020) dataset to evaluate the proposed method. Experimental results show that full-band and sub-band information are complementary, and the FullSubNet can effectively integrate them. Besides, the performance of the FullSubNet also exceeds that of the top-ranked methods in the DNS Challenge (INTERSPEECH 2020).
引用
收藏
页码:6633 / 6637
页数:5
相关论文
共 50 条
  • [1] Lightweight Full-band and Sub-band Fusion Network for Real Time Speech Enhancement
    Chen, Zhuangqi
    Zhang, Pingjian
    [J]. INTERSPEECH 2022, 2022, : 921 - 925
  • [2] Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Full-Band Speech Enhancement
    Yu, Guochen
    Li, Andong
    Liu, Wenzhe
    Zheng, Chengshi
    Wang, Yutian
    Wang, Hui
    [J]. 2022 13TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2022, : 483 - 487
  • [3] Single-Channel Speech Enhancement Based on Sub-Band Spectral Entropy
    Wei, Yi
    Zeng, Yumin
    Li, Chen
    [J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2018, 66 (03): : 100 - 113
  • [4] DETECTING ADHD FROM SPEECH USING FULL-BAND AND SUB-BAND CONVOLUTION FUSION NETWORK
    Li, Shuanglin
    Nair, Rajesh
    Naqvi, Syed Mohsen
    [J]. 2023 IEEE SENSORS, 2023,
  • [5] DPT-FSNET: DUAL-PATH TRANSFORMER BASED FULL-BAND AND SUB-BAND FUSION NETWORK FOR SPEECH ENHANCEMENT
    Dang, Feng
    Chen, Hangting
    Zhangt, Pengyuan
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6857 - 6861
  • [6] TS-CGANet: A Two-Stage Complex and Real Dual-Path Sub-Band Fusion Network for Full-Band Speech Enhancement
    Chen, Haozhe
    Zhang, Xiaojuan
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (07):
  • [7] ADAPTIVE-FSN: INTEGRATING FULL-BAND EXTRACTION AND ADAPTIVE SUB-BAND ENCODING FOR MONAURAL SPEECH ENHANCEMENT
    Tsao, Yu-Sheng
    Ho, Kuan-Hsun
    Hung, Jeih-Weih
    Chen, Berlin
    [J]. 2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 458 - 464
  • [8] A Hybrid DSP/Deep Learning Approach to Real-Time Full-Band Speech Enhancement
    Valin, Jean-Marc
    [J]. 2018 IEEE 20TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2018,
  • [9] FSI-Net: A dual-stage full- and sub-band integration network for full-band speech enhancement
    Yu, Guochen
    Wang, Hui
    Li, Andong
    Liu, Wenzhe
    Zhang, Yuan
    Wang, Yutian
    Zheng, Chengshi
    [J]. APPLIED ACOUSTICS, 2023, 211
  • [10] Real-Time Full-Band Voice Conversion with Sub-Band Modeling and Data-Driven Phase Estimation of Spectral Differentials
    Saeki, Takaaki
    Saito, Yuki
    Takamichi, Shinnosuke
    Saruwatari, Hiroshi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2021, E104D (07) : 1002 - 1016