Single-channel speech separation using combined EMD and speech-specific information

被引：9

作者：

Prasanna Kumar M.K. ^{[1
]}

Kumaraswamy R. ^{[2
]}

机构：

[1] BMS College of Engineering, Bangalore, 560019, Karnataka

[2] Siddaganga Institute of Technology, Tumkur, 572103, Karnataka

来源：

International Journal of Speech Technology | 2017年 / 20卷 / 4期

关键词：

BSS; EMD; IMF; Multi pitch information; SCSS; SIFT;

D O I：

10.1007/s10772-017-9468-3

中图分类号：

学科分类号：

摘要：

Multi-channel blind source separation (BSS) methods use more than one microphone. There is a need to develop speech separation algorithms under single microphone scenario. In this paper we propose a method for single channel speech separation (SCSS) by combining empirical mode decomposition (EMD) and speech specific information. Speech specific information is derived in the form of source-filter features. Source features are obtained using multi pitch information. Filter information is estimated using formant analysis. To track multi pitch information in the mixed signal we apply simple-inverse filtering tracking (SIFT) and histogram based pitch estimation to excitation source information. Formant estimation is done using linear predictive (LP) analysis. Pitch and formant estimation are done with and without EMD decomposition for better extraction of the individual speakers in the mixture. Combining EMD with speech specific information provides encouraging results for single-channel speech separation. © 2017, Springer Science+Business Media, LLC.

引用

页码：1037 / 1047

页数：10

共 50 条

[1] Source-Filter-Based Single-Channel Speech Separation Using Pitch Information
Stark, Michael
Wohlmayr, Michael
Pernkopf, Franz
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (02): : 242 - 255
[2] IMPROVED SINGLE-CHANNEL SPEECH SEPARATION USING SINUSOIDAL MODELING
Mowlaee, Pejman
Christensen, Mads Graesboll
Jensen, Soren Holdt
[J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 21 - 24
[3] Single-channel speech separation using soft mask filtering
Radfar, Mohammad H.
Dansereau, Richard M.
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (08): : 2299 - 2310
[4] Speech/music classification using speech-specific features
Khonglah, Banriskhem K.
Prasanna, S. R. Mahadeva
[J]. DIGITAL SIGNAL PROCESSING, 2016, 48 : 71 - 83
[5] SINGLE-CHANNEL SPEECH SEPARATION BY USING A SPARSE DECOMPOSITION WITH PERIODIC STRUCTURE
Nakashizuka, Makoto
Okumura, Hiroyuki
Iiguni, Youji
[J]. 2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS SYSTEMS (ISPACS 2008), 2008, : 339 - 342
[6] New Results on Single-Channel Speech Separation Using Sinusoidal Modeling
Mowlaee, Pejman
Christensen, Mads Graesboll
Jensen, Soren Holdt
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (05): : 1265 - 1277
[7] Speaker Separation Using Visual Speech Features and Single-channel Audio
Khan, Faheem
Milner, Ben
[J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3263 - 3267
[8] Single-Channel Speech Separation Using Phase-Based Methods
Lee, Yun-Kyung
Lee, In Sung
Kwon, Oh-Wook
[J]. IEEE TRANSACTIONS ON CONSUMER ELECTRONICS, 2010, 56 (04) : 2453 - 2459
[9] Single-channel speech separation using sequential discriminative dictionary learning
Xu, Yangfei
Bao, Guangzhao
Xu, Xu
Ye, Zhongfu
[J]. SIGNAL PROCESSING, 2015, 106 : 134 - 140
[10] SINGLE-CHANNEL SPEECH SEPARATION AND RECOGNITION USING LOOPY BELIEF PROPAGATION
Rennie, Steven J.
Hershey, John R.
Olsen, Peder A.
[J]. 2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3845 - 3848

← 1 2 3 4 5 →