A TWO-STAGE APPROACH FOR IMPROVING THE PERCEPTUAL QUALITY OF SEPARATED SPEECH

被引:0
|
作者
Williamson, Donald S. [1 ]
Wang, Yuxuan [1 ]
Wang, DeLiang [1 ]
机构
[1] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA
关键词
nonnegative matrix factorization; speech separation; speech quality; binary masking; NOISE;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Binary time-frequency masking and model-based non-negative matrix factorization (NMF) are two common approaches to speech separation. However, binary masking often suffers from poor perceptual quality, while NMF typically requires pretrained models for both speech and noise and frequently does not perform well. In this paper we examine whether a single or two-stage approach should be used for performing separation. We propose a two-stage algorithm that uses a soft mask in the first stage for separation, and NMF in the second stage for improving perceptual quality where only a speech model needs to be trained. We show that the proposed two-stage approach achieves higher objective perceptual quality and intelligibility compared to related single-stage methods.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Perceptual Improvement of a Two-Stage Algorithm for Speech Dereverberation
    Prego, Thiago de M.
    de Lima, Amaro A.
    Netto, Sergio L.
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 216 - 219
  • [2] A SPARSE REPRESENTATION APPROACH FOR PERCEPTUAL QUALITY IMPROVEMENT OF SEPARATED SPEECH
    Williamson, Donald S.
    Wang, Yuxuan
    Wang, DeLiang
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7015 - 7019
  • [3] A Two-Stage Approach to Quality Restoration of Bone-Conducted Speech
    Li, Changtao
    Yang, Feiran
    Yang, Jun
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 818 - 829
  • [4] A Two-stage Approach to Speech Bandwidth Extension
    Lin, Ju
    Wang, Yun
    Kalgaonkar, Kaustubh
    Keren, Gil
    Zhang, Didi
    Fuegen, Christian
    [J]. INTERSPEECH 2021, 2021, : 1689 - 1693
  • [5] Two-Stage Perceptual Quality Oriented Rate Control Algorithm for HEVC
    Yan, Yunyao
    Xiang, Guoqing
    Jia, Huizhu
    Chen, Jie
    Huang, Xiaofeng
    Xie, Xiaodong
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (05)
  • [6] A two-stage reaction with initially separated reactants
    Cox, SM
    Clifford, MJ
    Roberts, EPL
    [J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 1998, 256 (1-2) : 65 - 86
  • [7] A Two-Stage Approach for Improving Service Management in Retail Banking
    Emel, Guel Goekay
    Taskin, Cagatan
    [J]. OPERATIONS RESEARCH PROCEEDINGS 2007, 2008, : 257 - 262
  • [8] TWO-STAGE APPROACH FOR IMPROVING QUALITY OF LIFE IN PATIENTS WITH MORBID OBESITY Endoscopic and Percutaneous Interventional Procedures
    Ioffe, O.
    Kryvopustov, M.
    Tarasiuk, T.
    Tsiura, Y.
    Stetsenko, O.
    [J]. OBESITY SURGERY, 2019, 29 : 531 - 531
  • [9] Improving single-and two-stage anaerobic digestion of source separated organics by hydrothermal pretreatment
    Azizi, A.
    Koupaie, E. Hosseini
    Hafez, H.
    Elbeshbishy, E.
    [J]. BIOCHEMICAL ENGINEERING JOURNAL, 2019, 148 : 77 - 86
  • [10] Two-stage binaural speech enhancement with Wiener filter for high-quality speech communication
    Li, Junfeng
    Sakamoto, Shuichi
    Hongo, Satoshi
    Akagi, Masato
    Suzuki, Yoiti
    [J]. SPEECH COMMUNICATION, 2011, 53 (05) : 677 - 689