SPATIAL-TEMPORAL GRAPH CONVOLUTION NETWORK FOR MULTICHANNEL SPEECH ENHANCEMENT

被引:4
|
作者
Hao, Minghui [1 ]
Yu, Jingjing [1 ]
Zhang, Luyao [1 ]
机构
[1] Beijing Jiaotong Univ, Elect & Informat Engn, Beijing, Peoples R China
关键词
Graph convolution network; spatial dependency extraction; spatial-temporal convolution module; SII-weighted loss function; speech enhancement;
D O I
10.1109/ICASSP43922.2022.9746054
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Spatial dependency related to distributed microphone positions is essential for multichannel speech enhancement task. It is still challenging due to lack of accurate array positions and complex spatial-temporal relations of multichannel noisy signals This paper proposes a spatial-temporal graph convolutional network composed of cascaded spatial-temporal (ST) modules with channel fusion. Without any prior information of array and acoustic scene, a graph convolution block is designed with learnable adjacency matrix to capture the spatial dependency of pairwise channels. Then, it is embedded with time-frequency convolution block as the ST module to fuse the multi-dimensional correlation features for target speech estimation. Furthermore, a novel weighted loss function based on speech intelligibility index (SII) is proposed to assign more attention for the important bands of human understanding during network training. Our framework is demonstrated to achieve over 11% performance improvement on PESQ and intelligibility against prior state-of-the-art approaches in multi-scene speech enhancement experiments.
引用
收藏
页码:6512 / 6516
页数:5
相关论文
共 50 条
  • [41] Graph Spatial-Temporal Transformer Network for Traffic Prediction
    Zhao, Zhenzhen
    Shen, Guojiang
    Wang, Lei
    Kong, Xiangjie
    BIG DATA RESEARCH, 2024, 36
  • [42] Spatial-temporal knowledge graph network for event prediction
    Huai, Zepeng
    Zhang, Dawei
    Yang, Guohua
    Tao, Jianhua
    NEUROCOMPUTING, 2023, 553
  • [43] Spatial-temporal graph convolution network model with traffic fundamental diagram information informed for network traffic flow prediction
    Liu, Zhao
    Ding, Fan
    Dai, Yunqi
    Li, Linchao
    Chen, Tianyi
    Tan, Huachun
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 249
  • [44] Spatial-Temporal Synchronous Graph Convolutional Networks: A New Framework for Spatial-Temporal Network Data Forecasting
    Song, Chao
    Lin, Youfang
    Guo, Shengnan
    Wan, Huaiyu
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 914 - 921
  • [45] Graph Convolution Based Spatial-Temporal Attention LSTM Model for Flood Forecasting
    Feng, Jun
    Sha, Haichao
    Ding, Yukai
    Yan, Le
    Yu, Zhangheng
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [46] Sign Language Recognition Based on Spatial-Temporal Graph Convolution-Transformer
    Takayama, Natsuki
    Benitez-Garcia, Gibran
    Takahashi, Hiroki
    Seimitsu Kogaku Kaishi/Journal of the Japan Society for Precision Engineering, 2021, 87 (12): : 1028 - 1035
  • [47] Predicting Traffic Flow Using Dynamic Spatial-Temporal Graph Convolution Networks
    Liu, Yunchang
    Wan, Fei
    Liang, Chengwu
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 78 (03): : 4343 - 4361
  • [48] Multi-Stream and Enhanced Spatial-Temporal Graph Convolution Network for Skeleton-Based Action Recognition
    Li, Fanjia
    Zhu, Aichun
    Xu, Yonggang
    Cui, Ran
    Hua, Gang
    IEEE ACCESS, 2020, 8 : 97757 - 97770
  • [49] Multi-Branch Spatial-Temporal Attention Graph Convolution Network for Skeleton-based Action Recognition
    Wang, Daoshuai
    Li, Dewei
    Guan, Yaonan
    Wang, Gang
    Shao, Haibin
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 6487 - 6492
  • [50] Network-wide Traffic Flow Prediction Research Based on DTW Algorithm Spatial-temporal Graph Convolution
    Liu Y.-C.
    Li Z.-P.
    Lv C.-P.
    Zhang T.
    Liu Y.
    Jiaotong Yunshu Xitong Gongcheng Yu Xinxi/Journal of Transportation Systems Engineering and Information Technology, 2022, 22 (03): : 147 - 157and178