A Two-Stage Approach to Noisy Cochannel Speech Separation with Gated Residual Networks

被引:2
|
作者
Tan, Ke [1 ]
Wang, DeLiang [1 ,2 ]
机构
[1] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA
[2] Ohio State Univ, Ctr Cognit & Brain Sci, Columbus, OH 43210 USA
关键词
noisy cochannel speech separation; gated residual networks; ideal ratio mask; denoising; cochannel separation;
D O I
10.21437/Interspeech.2018-1406
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cochannel speech separation is the task of separating two speech signals from a single mixture. The task becomes even more challenging if the speech mixture is further corrupted by background noise. In this study, we focus on a gender dependent scenario, where target speech is from a male speaker and interfering speech from a female speaker. We propose a two-stage separation strategy to address this problem in a noise-independent way. In the proposed system, denoising and cochannel separation are performed successively by two modules, which are based on a newly-introduced convolutional neural network for speech separation. The evaluation results demonstrate that the proposed system substantially outperforms one-stage baselines in terms of objective intelligibility and perceptual quality.
引用
收藏
页码:3484 / 3488
页数:5
相关论文
共 50 条
  • [41] A TWO-STAGE FRAMEWORK FOR COMPOUND FIGURE SEPARATION
    Jiang, Weixin
    Schwenker, Eric
    Spreadbury, Trevor
    Ferrier, Nicola
    Chan, Maria K. Y.
    Cossairt, Oliver
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1204 - 1208
  • [42] Two-stage separation and alignment of cellulose nanocrystals
    Hu, Yang
    Abidi, Noureddine
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2015, 249
  • [43] Technology of streptomycin sulfate separation by two-stage foam separation
    Li, Juan
    Wu, Zhaoliang
    Li, Rui
    BIOTECHNOLOGY PROGRESS, 2012, 28 (03) : 733 - 739
  • [44] A Two-Stage transient stability prediction method using convolutional residual memory network and gated recurrent unit
    Zhan, Xianwen
    Han, Song
    Rong, Na
    Liu, Peili
    Ao, Weizhi
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2022, 138
  • [45] Two-stage deep learning approach for speech enhancement and reconstruction in the frequency and time domains
    Nossier, Soha A.
    Wall, Julie
    Moniri, Mansour
    Glackin, Cornelius
    Cannings, Nigel
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [46] Bifurcation and Reunion: A Loss-Guided Two-Stage Approach for Monaural Speech Dereverberation
    Luo, Xiaoxue
    Zheng, Chengshi
    Li, Andong
    Ke, Yuxuan
    Li, Xiaodong
    INTERSPEECH 2022, 2022, : 2503 - 2507
  • [47] Two-stage speech/non-speech classification of telephone signals
    Li Jian-Bin
    Yan Ji-Kun
    Zheng Hui
    Niu Zhong-Xia
    2006 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS PROCEEDINGS, VOLS 1-4: VOL 1: SIGNAL PROCESSING, 2006, : 490 - +
  • [48] Investigations on the Optimal Estimation of Speech Envelopes for the Two-Stage Speech Enhancement
    Song, Yanjue
    Madhu, Nilesh
    SENSORS, 2023, 23 (14)
  • [49] A two-stage approach to fingerprint classification
    Ping, Y
    Wang, LM
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON INTELLIGENT MECHATRONICS AND AUTOMATION, 2004, : 918 - 921
  • [50] A Two-Stage Approach for Network Monitoring
    Bai, Linda
    Roy, Sumit
    JOURNAL OF NETWORK AND SYSTEMS MANAGEMENT, 2013, 21 (02) : 238 - 263