Single-Channel Speech Separation using Sparse Non-Negative Matrix Factorization

被引：0

作者：

Schmidt, Mikkel N. ^{[1
]}

Olsson, Rasmus K. ^{[1
]}

机构：

[1] Tech Univ Denmark, Lyngby, Denmark

来源：

INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5 | 2006年

关键词：

Single-channel source separation; sparse non-negative matrix factorization;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We apply machine learning techniques to the problem of separating multiple speech sources from a single microphone recording. The method of choice is a sparse non-negative matrix factorization algorithm, which in an unsupervised manner can learn sparse representations of the data. This is applied to the learning of personalized dictionaries from a speech corpus, which in turn are used to separate the audio stream into its components. We show that computational savings can be achieved by segmenting the training data on a phoneme level. To split the data, a conventional speech recognizer is used. The performance of the unsupervised and supervised adaptation schemes result in significant improvements in terms of the target-to-masker ratio.

引用

页码：2614 / 2617

页数：4

共 50 条

[1] DISCRIMINATIVE NON-NEGATIVE MATRIX FACTORIZATION FOR SINGLE-CHANNEL SPEECH SEPARATION
Wang, Zi
Sha, Fei
[J]. 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[2] Single Channel Music and Speech Separation Using Non-negative Matrix Factorization
Yidirim, Sinan
Saraclar, Murat
[J]. 2009 IEEE 17TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2009, : 543 - 546
[3] Non-negative Matrix Factorization with Linear Constraints for Single-Channel Speech Enhancement
Lyubimov, Nikolay
Kotov, Mikhail
[J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 446 - 450
[4] Performance Evaluation of Single Channel Speech Separation Using Non-Negative Matrix Factorization
Nandakumar, Mona M.
Bijoy, Edet K.
[J]. 2014 NATIONAL CONFERENCE ON COMMUNICATION, SIGNAL PROCESSING AND NETWORKING (NCCSN), 2014,
[5] Single-Channel Speech Separation Based on Non-negative Matrix Factorization and Factorial Conditional Random Field
LI Xu
TU Ming
WANG Xiaofei
WU Chao
FU Qiang
YAN Yonghong
[J]. Chinese Journal of Electronics, 2018, 27 (05) : 1063 - 1070
[6] Single-Channel Speech Separation Based on Non-negative Matrix Factorization and Factorial Conditional Random Field
Li Xu
Tu Ming
Wang Xiaofei
Wu Chao
Fu Qiang
Yan Yonghong
[J]. CHINESE JOURNAL OF ELECTRONICS, 2018, 27 (05) : 1063 - 1070
[7] Adaptive Sparsity Non-Negative Matrix Factorization for Single-Channel Source Separation
Gao, Bin
Woo, W. L.
Dlay, S. S.
[J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2011, 5 (05) : 989 - 1001
[8] SINGLE-CHANNEL SPEECH SEPARATION BY INCLUDING SPECTRAL STRUCTURE INFORMATION WITHIN NON-NEGATIVE MATRIX FACTORIZATION
Feng, Yuxiao
Ritz, Christian
[J]. 2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 620 - 624
[9] Perceptual Single-Channel Audio Source Separation by Non-negative Matrix Factorization
Kirbiz, Serap
Gunsel, Bilge
[J]. 2009 IEEE 17TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2009, : 654 - 657
[10] Single-channel blind separation using L1-sparse complex non-negative matrix factorization for acoustic signals
Parathai, P.
Woo, W. L.
Dlay, S. S.
Gao, Bin
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2015, 137 (01): : EL124 - EL129

← 1 2 3 4 5 →