Speech Separation based on Deep Belief Network

被引:0
|
作者
Wu Haijia [1 ]
Zhang Xiongwei [1 ]
Zhang Liangliang [1 ]
Zou Xia [1 ]
机构
[1] PLA Univ Sci & Technol, Coll Command Informat & Syst, Nanjing 210007, Jiangsu, Peoples R China
关键词
speech separation; deep learning; deep belief network; restricted Boltzmann machine; autoencoder; SIGNAL; SEGREGATION;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Thanks to its hierarchical and generative nature, Deep Belief Network (DBN) is effective to feature representation and extraction in signal processing. In this paper, DBN is investigated and implemented to monaural speech separation. Firstly, two separate DBNs are trained to extract features from mixed noisy signals and target clean speech respectively. Subsequently, the two types of extracted features are associated together by training a BP neural network to obtain a mapping from the features of mixed signals to the features of target speech. Finally, by performing DBN and the above mapping neural network, target speech can be estimated from the input mixed signals. Experiments are conducted on different kinds of mixed signals including female/male speech mixtures, human-speech/Gaussian-noise audio mixtures, and human-speech/music audio mixtures. The PESQ scores of the extracted speech are 3.32, 2.59, and 3.42 respectively, which illustrates that the model performs well on speech separation tasks, especially on the mixed signals where the inference signals have obvious spectral structures.
引用
收藏
页码:1486 / 1493
页数:8
相关论文
共 50 条
  • [1] Speech Emotion Recognition Based on Deep Belief Network
    Shi, Peng
    [J]. 2018 IEEE 15TH INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL (ICNSC), 2018,
  • [2] Deep Neural Network Based Speech Separation for Robust Speech Recognition
    Tu Yanhui
    Jun, Du
    Xu Yong
    Dai Lirong
    Chin-Hui, Lee
    [J]. 2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 532 - 536
  • [3] Deep Belief Network Optimization in Speech Recognition
    Prasetio, Murman Dwi
    Hayashida, Tomohiro
    Nishizaki, Ichiro
    Sekizaki, Shinya
    [J]. 2017 INTERNATIONAL CONFERENCE ON SUSTAINABLE INFORMATION ENGINEERING AND TECHNOLOGY (SIET), 2017, : 138 - 143
  • [4] Speech Recognition of Oral English Teaching Based on Deep Belief Network
    Wang, Jianmei
    [J]. INTERNATIONAL JOURNAL OF EMERGING TECHNOLOGIES IN LEARNING, 2020, 15 (10) : 100 - 112
  • [5] Speech Expression Multimodal Emotion Recognition Based on Deep Belief Network
    Liu, Dong
    Chen, Longxi
    Wang, Zhiyong
    Diao, Guangqiang
    [J]. JOURNAL OF GRID COMPUTING, 2021, 19 (02)
  • [6] A Research of Speech Emotion Recognition Based on Deep Belief Network and SVM
    Huang, Chenchen
    Gong, Wei
    Fu, Wenlong
    Feng, Dongyu
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2014, 2014
  • [7] A Study of Deep Belief Network Based Chinese Speech Emotion Recognition
    Chen, Bu
    Yin, Qian
    Guo, Ping
    [J]. 2014 TENTH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS), 2014, : 180 - 184
  • [8] Speech Expression Multimodal Emotion Recognition Based on Deep Belief Network
    Dong Liu
    Longxi Chen
    Zhiyong Wang
    Guangqiang Diao
    [J]. Journal of Grid Computing, 2021, 19
  • [9] Deep Belief Network Based Part-of-Speech Tagger for Telugu Language
    Jagadeesh, M.
    Kumar, M. Anand
    Soman, K. P.
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION TECHNOLOGIES, IC3T 2015, VOL 3, 2016, 381 : 75 - 84
  • [10] ONE MICROPHONE SPEECH SEPARACTION WITH DEEP BELIEF NETWORK
    Lin, Jie
    Fu, Bo
    Chen, Jianzhang
    Zheng, Jie
    [J]. 2013 10TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2013, : 21 - 24