Adversarial Dictionary Learning for Monaural Speech Enhancement

被引:0
|
作者
Ji, Yunyun [1 ]
Xu, Longting [2 ]
Zhu, Wei-Ping [3 ]
机构
[1] Agora IO Inc, Shanghai, Peoples R China
[2] Donghua Univ, Coll Informat Sci & Technol, Shanghai, Peoples R China
[3] Concordia Univ, Montreal, PQ, Canada
来源
关键词
speech enhancement; dictionary learning; adversarial training; sparse coding; low rank matrix decomposition; ALGORITHM;
D O I
10.21437/Interspeech.2020-2500
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
In this paper, we propose an adversarial dictionary learning method to train a speaker independent speech dictionary and a universal noise dictionary for improving the generality of the dictionary learning based speech enhancement system. In the learning stage, two discriminators are employed separately to identify the components in speech and noise which are highly correlated with each other. The residuals in the speech and noise magnitude spectral matrices are then utilized to train the speech and noise dictionaries via the alternating direction method of multiplier algorithm, which can effectively reduce the mutual coherence between speech and noise. In the enhancement stage, a new optimization technique is proposed for enhancing the speech based on the low-rank decomposition and sparse coding. Experimental results show that our proposed method achieves better performance in improving the speech quality and intelligibility than the reference methods in terms of three objective performance evaluation measures.
引用
收藏
页码:4034 / 4038
页数:5
相关论文
共 50 条
  • [1] Monaural speech enhancement using joint dictionary learning with cross-coherence penalties
    Zhang, Long
    Bao, Guangzhao
    Luo, You
    Ye, Zhongfu
    [J]. 2015 8TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2015, : 518 - 522
  • [2] Double Adversarial Network based Monaural Speech Enhancement for Robust Speech Recognition
    Du, Zhihao
    Han, Jiqing
    Zhang, Xueliang
    [J]. INTERSPEECH 2020, 2020, : 309 - 313
  • [3] Joint Ideal Ratio Mask and Generative Adversarial Networks for Monaural Speech Enhancement
    Yuan, Jing
    Bao, Changchun
    [J]. PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 276 - 280
  • [4] Self-supervised Adversarial Multi-task Learning for Vocoder-based Monaural Speech Enhancement
    Du, Zhihao
    Lei, Ming
    Han, Jiqing
    Zhang, Shiliang
    [J]. INTERSPEECH 2020, 2020, : 3271 - 3275
  • [5] A Time-domain Monaural Speech Enhancement with Feedback Learning
    Li, Andong
    Zheng, Chengshi
    Cheng, Linjuan
    Peng, Renhua
    Li, Xiaodong
    [J]. 2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 769 - 774
  • [6] Adversarial Latent Representation Learning for Speech Enhancement
    Qiu, Yuanhang
    Wang, Ruili
    [J]. INTERSPEECH 2020, 2020, : 2662 - 2666
  • [7] An Improved Dictionary Learning Method for Speech Enhancement
    Hao, Yue
    Bao, Changchun
    [J]. 2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 144 - 147
  • [8] Speech Enhancement Using Generative Dictionary Learning
    Sigg, Christian D.
    Dikk, Tomas
    Buhmann, Joachim M.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (06): : 1698 - 1712
  • [9] Harmonic Attention for Monaural Speech Enhancement
    Wang, Tianrui
    Zhu, Weibin
    Gao, Yingying
    Zhang, Shilei
    Feng, Junlan
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 2424 - 2436
  • [10] Monaural speech enhancement with dilated convolutions
    Pirhosseinloo, Shadi
    Brumberg, Jonathan S.
    [J]. INTERSPEECH 2019, 2019, : 3143 - 3147