SPATIAL-TEMPORAL GRAPH CONVOLUTION NETWORK FOR MULTICHANNEL SPEECH ENHANCEMENT

被引:4
|
作者
Hao, Minghui [1 ]
Yu, Jingjing [1 ]
Zhang, Luyao [1 ]
机构
[1] Beijing Jiaotong Univ, Elect & Informat Engn, Beijing, Peoples R China
关键词
Graph convolution network; spatial dependency extraction; spatial-temporal convolution module; SII-weighted loss function; speech enhancement;
D O I
10.1109/ICASSP43922.2022.9746054
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Spatial dependency related to distributed microphone positions is essential for multichannel speech enhancement task. It is still challenging due to lack of accurate array positions and complex spatial-temporal relations of multichannel noisy signals This paper proposes a spatial-temporal graph convolutional network composed of cascaded spatial-temporal (ST) modules with channel fusion. Without any prior information of array and acoustic scene, a graph convolution block is designed with learnable adjacency matrix to capture the spatial dependency of pairwise channels. Then, it is embedded with time-frequency convolution block as the ST module to fuse the multi-dimensional correlation features for target speech estimation. Furthermore, a novel weighted loss function based on speech intelligibility index (SII) is proposed to assign more attention for the important bands of human understanding during network training. Our framework is demonstrated to achieve over 11% performance improvement on PESQ and intelligibility against prior state-of-the-art approaches in multi-scene speech enhancement experiments.
引用
收藏
页码:6512 / 6516
页数:5
相关论文
共 50 条
  • [1] Multichannel spatial-temporal graph convolution network based on spectrum decomposition for traffic prediction
    Lei, Tianyang
    Yang, Kewei
    Li, Jichao
    Chen, Gang
    Jiang, Jiuyao
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [2] STAGCN: Spatial-Temporal Attention Graph Convolution Network for Traffic Forecasting
    Gu, Yafeng
    Deng, Li
    MATHEMATICS, 2022, 10 (09)
  • [3] Spatial-Temporal Complex Graph Convolution Network for Traffic Flow Prediction
    Bao, Yinxin
    Huang, Jiashuang
    Shen, Qinqin
    Cao, Yang
    Ding, Weiping
    Shi, Zhenquan
    Shi, Quan
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 121
  • [4] Dual Dynamic Spatial-Temporal Graph Convolution Network for Traffic Prediction
    Sun, Yanfeng
    Jiang, Xiangheng
    Hu, Yongli
    Duan, Fuqing
    Guo, Kan
    Wang, Boyue
    Gao, Junbin
    Yin, Baocai
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (12) : 23680 - 23693
  • [5] Spatial-temporal Graph Transformer Network for Spatial-temporal Forecasting
    Dao, Minh-Son
    Zetsu, Koji
    Hoang, Duy-Tang
    Proceedings - 2024 IEEE International Conference on Big Data, BigData 2024, 2024, : 1276 - 1281
  • [6] Spatial-Temporal Dynamic Graph Convolution Neural Network for Air Quality Prediction
    Xiaocao, Ouyang
    Yang, Yan
    Zhang, Yiling
    Zhou, Wei
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [7] Power load forecasting based on spatial-temporal fusion graph convolution network
    Jiang, He
    Dong, Yawei
    Dong, Yao
    Wang, Jianzhou
    TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 2024, 204
  • [8] Spatial-Temporal Residual Multi-Graph Convolution Network for Traffic Forecasting
    Xi'an Jiaotong University, School of Computer Science and Technology, Xi'an, China
    不详
    IEEE Int. Conf. Data Sci. Adv. Anal., DSAA - Proc., 2023,
  • [9] Attention Mechanism Based Spatial-Temporal Graph Convolution Network for Traffic Prediction
    Xiao, Wenjuan
    Wang, Xiaoming
    Journal of Computers (Taiwan), 2024, 35 (04) : 93 - 108
  • [10] Multi-dimensional spatial-temporal graph convolution for urban sensors imputation and enhancement
    Huang, Longji
    Huang, Jianbin
    Li, He
    Cui, Jiangtao
    KNOWLEDGE-BASED SYSTEMS, 2023, 278