Evaluating single-channel speech separation performance in transform-domain

被引:8
|
作者
Mowlaee, Pejman [1 ]
Sayadiyan, Abolghasem [1 ]
Sheikhzadeh, Hamid [1 ]
机构
[1] Amirkabir Univ Technol, Dept Elect Engn, Tehran 158754413, Iran
关键词
Single-channel separation (SCS); Magnitude spectrum; Vector quantization (VQ); Subband perceptually weighted transformation (SPWT); Spectral distortion (SD); ENHANCEMENT; MODEL; QUANTIZATION; TRACKING;
D O I
10.1631/jzus.C0910087
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Single-channel separation (SCS) is a challenging scenario where the objective is to segregate speaker signals from their mixture with high accuracy. In this research a novel framework called subband perceptually weighted transformation (SPWT) is developed to offer a perceptually relevant feature to replace the commonly used magnitude of the short-time Fourier transform (STFT). The main objectives of the proposed SPWT are to lower the spectral distortion (SD) and to improve the ideal separation quality. The performance of the SPWT is compared to those obtained using mixmax and Wiener filter methods. A comprehensive statistical analysis is conducted to compare the SPWT quantization performance as well as the ideal separation quality with other features of log-spectrum and magnitude spectrum. Our evaluations show that the SPWT provides lower SD values and a more compact distribution of SD, leading to more acceptable subjective separation quality as evaluated using the mean opinion score.
引用
收藏
页码:160 / 174
页数:15
相关论文
共 50 条
  • [1] Evaluating single-channel speech separation performance in transform-domain
    Pejman MOWLAEE
    Abolghasem SAYADIYAN
    Hamid SHEIKHZADEH
    [J]. Frontiers of Information Technology & Electronic Engineering, 2010, (03) : 160 - 174
  • [2] Evaluating single-channel speech separation performance in transform-domain
    Pejman Mowlaee
    Abolghasem Sayadiyan
    Hamid Sheikhzadeh
    [J]. Journal of Zhejiang University SCIENCE C, 2010, 11 : 160 - 174
  • [3] Performance Evaluation for Transform Domain Model-based Single-channel Speech Separation
    Mowlaee, Pejman
    Sayadiyan, Abolghasem
    [J]. 2009 IEEE/ACS INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS, VOLS 1 AND 2, 2009, : 935 - 942
  • [4] Deep Clustering in Complex Domain for Single-Channel Speech Separation
    Liu, Runling
    Tang, Yu
    Mang, Hongwei
    [J]. 2022 IEEE 17TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2022, : 1463 - 1468
  • [5] Transform-Domain Speech Bandwidth Extension
    Nizampatnam, Prasad
    Raju, G. R. L. V. N. S.
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2019, 38 (12) : 5717 - 5733
  • [6] Transform-Domain Speech Bandwidth Extension
    Prasad Nizampatnam
    G. R. L. V. N. S. Raju
    [J]. Circuits, Systems, and Signal Processing, 2019, 38 : 5717 - 5733
  • [7] Time-domain adaptive attention network for single-channel speech separation
    Wang, Kunpeng
    Zhou, Hao
    Cai, Jingxiang
    Li, Wenna
    Yao, Juan
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)
  • [8] Time-domain adaptive attention network for single-channel speech separation
    Kunpeng Wang
    Hao Zhou
    Jingxiang Cai
    Wenna Li
    Juan Yao
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2023
  • [9] JOINT SINGLE-CHANNEL SPEECH SEPARATION AND SPEAKER IDENTIFICATION
    Mowlaee, P.
    Saeidi, R.
    Tan, Z. -H.
    Christensen, M. G.
    Franti, P.
    Jensen, S. H.
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4430 - 4433
  • [10] WHAMR!: NOISY AND REVERBERANT SINGLE-CHANNEL SPEECH SEPARATION
    Maciejewski, Matthew
    Wichern, Gordon
    McQuinn, Emmett
    Le Roux, Jonathan
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 696 - 700