Lightweight target speaker separation network based on joint training

被引:0
|
作者
Jing Wang
Hanyue Liu
Liang Xu
Wenjing Yang
Weiming Yi
Fang Liu
机构
[1] Beijing Institute of Technology,School of Information and Electronics
[2] Beijing Institute of Technology,Key Laboratory of Language, Cognition and Computation Ministry of Industry and Information Technology, School of Foreign Languages
关键词
Target speaker separation; Lightweight network; Loss function; Joint training;
D O I
暂无
中图分类号
学科分类号
摘要
Target speaker separation aims to separate the speech components of the target speaker from mixed speech and remove extraneous components such as noise. In recent years, deep learning-based speech separation methods have made significant breakthroughs and have gradually become mainstream. However, these existing methods generally face problems with system latency and performance upper limits due to the large model size. To solve these problems, this paper proposes improvements in the network structure and training methods to enhance the model’s performance. A lightweight target speaker separation network based on long-short-term memory (LSTM) is proposed, which can reduce the model size and computational delay while maintaining the separation performance. Based on this, a target speaker separation method based on joint training is proposed to achieve the overall training and optimization of the target speaker separation system. Joint loss functions based on speaker registration and speaker separation are proposed for joint training of the network to further improve the system’s performance. The experimental results show that the lightweight target speaker separation network proposed in this paper has better performance while being lightweight, and joint training of the target speaker separation network with our proposed loss function can further improve the separation performance of the original model.
引用
收藏
相关论文
共 50 条
  • [1] Lightweight target speaker separation network based on joint training
    Wang, Jing
    Liu, Hanyue
    Xu, Liang
    Yang, Wenjing
    Yi, Weiming
    Liu, Fang
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)
  • [2] A Target Speaker Separation Neural Network with Joint-Training
    Yang, Wenjing
    Wang, Jing
    Li, Hongfeng
    Xu, Na
    Xiang, Fei
    Qian, Kai
    Hu, Shenghua
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 614 - 618
  • [3] Atss-Net: Target Speaker Separation via Attention-based Neural Network
    Li, Tingle
    Lin, Qingjian
    Bao, Yuanyuan
    Li, Ming
    INTERSPEECH 2020, 2020, : 1411 - 1415
  • [4] Rotating Target Detection Based on Lightweight Network
    Jiao, Yunxu
    Zhu, Qingmeng
    He, Hao
    Zhao, Tianci
    Wang, Haihui
    PRICAI 2022: TRENDS IN ARTIFICIAL INTELLIGENCE, PT III, 2022, 13631 : 619 - 630
  • [5] Speech Separation of A Target Speaker Based on Deep Neural Networks
    Du Jun
    Tu Yanhui
    Xu Yong
    Dai Lirong
    Chin-Hui, Lee
    2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 473 - 477
  • [6] Speaker Recognition Based on Lightweight Neural Network for Smart Home Solutions
    Ai, Haojun
    Xia, Wuyang
    Zhang, Quanxin
    CYBERSPACE SAFETY AND SECURITY, PT II, 2019, 11983 : 421 - 431
  • [7] Fast Armored Target Detection Based on Lightweight Network
    Sun H.
    Chang T.
    Zhang L.
    Yang G.
    Han B.
    Li Y.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (07): : 1110 - 1121
  • [8] Time-Domain Target-Speaker Speech Separation With Waveform-Based Speaker Embedding
    Zhao, Jianshu
    Gao, Shengzhou
    Shinozaki, Takahiro
    INTERSPEECH 2020, 2020, : 1436 - 1440
  • [9] Joint Sound Source Separation and Speaker Recognition
    Zegers, Jeroen
    Van Hamme, Hugo
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2228 - 2232
  • [10] A Lightweight Infrared Small Target Detection Network Based on Target Multiscale Context
    Ma, Tianlei
    Yang, Zhen
    Liu, Benxue
    Sun, Siyuan
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2023, 20