SEMI-BLIND SPEECH ENHANCEMENT BASED ON RECURRENT NEURAL NETWORK FOR SOURCE SEPARATION AND DEREVERBERATION

被引:0
|
作者
Wake, Masaya [1 ]
Bando, Yoshiaki [1 ]
Mimura, Masato [1 ]
Itoyama, Katsutoshi [1 ]
Yoshii, Kazuyoshi [1 ]
Kawahara, Tatsuya [1 ]
机构
[1] Kyoto Univ, Grad Sch Informat, Sakyo Ku, Kyoto 6068501, Japan
关键词
Semi-blind source separation; Blind dereverberation; Recurrent neural network; NONNEGATIVE MATRIX FACTORIZATION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper describes a semi-blind speech enhancement method using a semi-blind recurrent neural network (SBRNN) for human-robot speech interaction. When a robot interacts with a human using speech signals, the robot inputs not only audio signals recorded by its own microphone but also speech signals made by the robot itself, which can be used for semi-blind speech enhancement. The SB-RNN consists of cascaded two modules: a semi-blind source separation module and a blind dereverberation module. Each module has a recurrent layer to capture the temporal correlations of speech signals. The SB-RNN is trained in a manner of multi-task learning, i.e., isolated echoic speech signals are used as teacher signals for the output of the separation module in addition to isolated unechoic signals for the output of the dereverberation module. Experimental results showed that the source to distortion ratio was improved by 2.30 dB on average compared to a conventional method based on a semi-blind independent component analysis. The results also showed the effectiveness of modularization of the network, multi-task learning, the recurrent structure, and semi-blind source separation.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] A Semi-blind Source Separation Approach for Speech Dereverberation
    Wang, Ziteng
    Na, Yueyue
    Liu, Zhang
    Li, Yun
    Tian, Biao
    Fu, Qiang
    [J]. INTERSPEECH 2020, 2020, : 3925 - 3929
  • [2] Over-Determined Semi-Blind Speech Source Separation
    Togami, Masahito
    Scheibler, Robin
    [J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 640 - 645
  • [3] Binaural semi-blind dereverberation of noisy convoluted speech signals
    Lee, Jong-Hwan
    Oh, Sang-Hoon
    Lee, Soo-Young
    [J]. NEUROCOMPUTING, 2008, 72 (1-3) : 636 - 642
  • [4] Blind Speech Separation and Dereverberation using neural beamforming
    Pfeifenberger, Lukas
    Pernkopf, Franz
    [J]. SPEECH COMMUNICATION, 2022, 140 : 29 - 41
  • [5] Semi-Blind Source Separation with Learned Constraints
    Gertosio, Remi Carloni
    Bobin, Jerome
    Acero, Fabio
    [J]. SIGNAL PROCESSING, 2023, 202
  • [6] Semi-Blind Source Separation in Smart Home
    Liu, Mingfei
    Li, Shouliang
    Yang, Yi
    Li, Caihong
    Li, Lian
    [J]. 2016 IEEE INFORMATION TECHNOLOGY, NETWORKING, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (ITNEC), 2016, : 381 - 385
  • [7] Online blind source separation and dereverberation of speech based on a joint diagonalizability constraint
    Yu, Ho-Gun
    Kim, Do-Hui
    Song, Min-Hwan
    Park, Hyung-Min
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2021, 40 (05): : 503 - 514
  • [8] Stereophonic acoustic echo cancellation based on semi-blind source separation
    Cheng, Guoliang
    Ruan, Haoxin
    Hu, Yuxiang
    Zhu, Changbao
    Cao, Zhanzhong
    Lu, Jing
    [J]. APPLIED ACOUSTICS, 2024, 216
  • [9] Speech Separation Based on Semi-blind Kurtosis Maximization with Magnitude and Energy Distance
    Li, Long
    Lin, Qiu-Hua
    Gong, Xiao-Feng
    [J]. 4TH INTERNATIONAL CONFERENCE ON AWARENESS SCIENCE AND TECHNOLOGY (ICAST 2012), 2012, : 50 - 53
  • [10] A semi-blind approach to the separation of real world speech mixtures
    Tordini, F
    Piazza, F
    [J]. PROCEEDING OF THE 2002 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-3, 2002, : 1293 - 1298