Single-channel speech separation using combined EMD and speech-specific information

被引:9
|
作者
Prasanna Kumar M.K. [1 ]
Kumaraswamy R. [2 ]
机构
[1] BMS College of Engineering, Bangalore, 560019, Karnataka
[2] Siddaganga Institute of Technology, Tumkur, 572103, Karnataka
关键词
BSS; EMD; IMF; Multi pitch information; SCSS; SIFT;
D O I
10.1007/s10772-017-9468-3
中图分类号
学科分类号
摘要
Multi-channel blind source separation (BSS) methods use more than one microphone. There is a need to develop speech separation algorithms under single microphone scenario. In this paper we propose a method for single channel speech separation (SCSS) by combining empirical mode decomposition (EMD) and speech specific information. Speech specific information is derived in the form of source-filter features. Source features are obtained using multi pitch information. Filter information is estimated using formant analysis. To track multi pitch information in the mixed signal we apply simple-inverse filtering tracking (SIFT) and histogram based pitch estimation to excitation source information. Formant estimation is done using linear predictive (LP) analysis. Pitch and formant estimation are done with and without EMD decomposition for better extraction of the individual speakers in the mixture. Combining EMD with speech specific information provides encouraging results for single-channel speech separation. © 2017, Springer Science+Business Media, LLC.
引用
收藏
页码:1037 / 1047
页数:10
相关论文
共 50 条
  • [1] Source-Filter-Based Single-Channel Speech Separation Using Pitch Information
    Stark, Michael
    Wohlmayr, Michael
    Pernkopf, Franz
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (02): : 242 - 255
  • [2] IMPROVED SINGLE-CHANNEL SPEECH SEPARATION USING SINUSOIDAL MODELING
    Mowlaee, Pejman
    Christensen, Mads Graesboll
    Jensen, Soren Holdt
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 21 - 24
  • [3] Single-channel speech separation using soft mask filtering
    Radfar, Mohammad H.
    Dansereau, Richard M.
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (08): : 2299 - 2310
  • [4] Speech/music classification using speech-specific features
    Khonglah, Banriskhem K.
    Prasanna, S. R. Mahadeva
    [J]. DIGITAL SIGNAL PROCESSING, 2016, 48 : 71 - 83
  • [5] SINGLE-CHANNEL SPEECH SEPARATION BY USING A SPARSE DECOMPOSITION WITH PERIODIC STRUCTURE
    Nakashizuka, Makoto
    Okumura, Hiroyuki
    Iiguni, Youji
    [J]. 2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS SYSTEMS (ISPACS 2008), 2008, : 339 - 342
  • [6] New Results on Single-Channel Speech Separation Using Sinusoidal Modeling
    Mowlaee, Pejman
    Christensen, Mads Graesboll
    Jensen, Soren Holdt
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (05): : 1265 - 1277
  • [7] Speaker Separation Using Visual Speech Features and Single-channel Audio
    Khan, Faheem
    Milner, Ben
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3263 - 3267
  • [8] Single-Channel Speech Separation Using Phase-Based Methods
    Lee, Yun-Kyung
    Lee, In Sung
    Kwon, Oh-Wook
    [J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2010, 56 (04) : 2453 - 2459
  • [9] Single-channel speech separation using sequential discriminative dictionary learning
    Xu, Yangfei
    Bao, Guangzhao
    Xu, Xu
    Ye, Zhongfu
    [J]. SIGNAL PROCESSING, 2015, 106 : 134 - 140
  • [10] SINGLE-CHANNEL SPEECH SEPARATION AND RECOGNITION USING LOOPY BELIEF PROPAGATION
    Rennie, Steven J.
    Hershey, John R.
    Olsen, Peder A.
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3845 - 3848