IMPROVED SINGLE-CHANNEL SPEECH SEPARATION USING SINUSOIDAL MODELING

被引:11
|
作者
Mowlaee, Pejman [1 ]
Christensen, Mads Graesboll [2 ]
Jensen, Soren Holdt [1 ]
机构
[1] Aalborg Univ, Dept Elect Syst, Aalborg, Denmark
[2] Aalborg Univ, Dept Media Technol, Aalborg, Denmark
关键词
Mixture estimation; single-channel speech separation; mask-based methods; speaker codebook; RECOGNITION;
D O I
10.1109/ICASSP.2010.5496263
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We present a novel single-channel separation approach to improve the separation performance while recovering the signals from a mixture. The key idea in this research is to employ a mixture estimator based on unconstrained modified sinusoidal parameters. Compared to the mixmax (binary mask) and Wiener filter (softmask) approaches, the proposed approach works independently of pitch estimates. Furthermore, it is observed that it can achieve acceptable perceptual speech quality with less cross-talk at different signal-to-signal ratios while bringing down the complexity by replacing STFT with sinusoidal parameters. Improvements made by the proposed approach are demonstrated by employing PESQ as our objective measure and MUSHRA listening test as our subjective evaluation.
引用
收藏
页码:21 / 24
页数:4
相关论文
共 50 条
  • [1] New Results on Single-Channel Speech Separation Using Sinusoidal Modeling
    Mowlaee, Pejman
    Christensen, Mads Graesboll
    Jensen, Soren Holdt
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (05): : 1265 - 1277
  • [2] Sinusoidal Approach for the Single-Channel Speech Separation and Recognition Challenge
    Mowlaee, P.
    Saeidi, R.
    Tan, Z. -H.
    Christensen, M. G.
    Kinnunen, T.
    Franti, P.
    Jensen, S. H.
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 684 - +
  • [3] Improved Phase Reconstruction in Single-Channel Speech Separation
    Mayer, Florian
    Mowlaee, Pejman
    [J]. 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1795 - 1799
  • [4] Single Channel Speech Separation Based on Sinusoidal Modeling
    Wiem, Belhedi
    anouar, Ben messaoud Mohamed
    Aicha, Bouzid
    [J]. 2016 2ND INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR SIGNAL AND IMAGE PROCESSING (ATSIP), 2016, : 672 - 676
  • [5] An Improved Unsupervised Single-Channel Speech Separation Algorithm for Processing Speech Sensor Signals
    Jiang, Dazhi
    He, Zhihui
    Lin, Yingqing
    Chen, Yifei
    Xu, Linyan
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [6] Single-channel speech separation using soft mask filtering
    Radfar, Mohammad H.
    Dansereau, Richard M.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (08): : 2299 - 2310
  • [7] SINUSOIDAL MASKS FOR SINGLE CHANNEL SPEECH SEPARATION
    Mowlaee, Pejman
    Christensen, Mads Graesboll
    Jensen, Soren Holdt
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4262 - 4265
  • [8] Improved single-channel noise reduction method of speech by blind source separation
    Hamid, Mohammad Ekramul
    Ogawa, Keita
    Fukabayashi, Takeshi
    [J]. ACOUSTICAL SCIENCE AND TECHNOLOGY, 2007, 28 (03) : 153 - 164
  • [9] SINGLE-CHANNEL SPEECH SEPARATION BY USING A SPARSE DECOMPOSITION WITH PERIODIC STRUCTURE
    Nakashizuka, Makoto
    Okumura, Hiroyuki
    Iiguni, Youji
    [J]. 2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS SYSTEMS (ISPACS 2008), 2008, : 339 - 342
  • [10] Speaker Separation Using Visual Speech Features and Single-channel Audio
    Khan, Faheem
    Milner, Ben
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3263 - 3267