Supervised Separation of Speech from Background Piano Music using a Nonnegative Matrix Factorization Approach

被引:0
|
作者
Martinez-Colon, A. [1 ]
Canadas-Quesada, F. J. [1 ]
Vera-Candeas, P. [1 ]
Ruiz-Reyes, N. [1 ]
Moreno-Fuentes, F. [1 ]
机构
[1] Univ Jaen, Telecommun Engn Dept, Jaen, Spain
来源
STAIRS 2014 | 2014年 / 264卷
关键词
Sound separation; Non-negative matrix factorization; training; supervised; sparse; interference;
D O I
10.3233/978-1-61499-421-3-181
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a supervised algorithm for separating speech from background non-stationary noise (piano music) in single-channel recordings. The proposed algorithm, based on a nonnegative matrix factorization (NMF) approach, is able to extract speech sounds from isolated or chords piano sounds learning the set of spectral patterns generated by independent syllables and piano notes. Moroever, a sparsity constraint is used to improve the quality of the separated signals. Our proposal was tested using several audio mixtures composed of real-world piano recordings and Spanish speech showing promising results.
引用
收藏
页码:181 / 190
页数:10
相关论文
共 50 条
  • [31] Supervised kernel nonnegative matrix factorization for face recognition
    Chen, Wen-Sheng
    Zhao, Yang
    Pan, Binbin
    Chen, Bo
    [J]. NEUROCOMPUTING, 2016, 205 : 165 - 181
  • [32] Self-Supervised Symmetric Nonnegative Matrix Factorization
    Jia, Yuheng
    Liu, Hui
    Hou, Junhui
    Kwong, Sam
    Zhang, Qingfu
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (07) : 4526 - 4537
  • [33] Robust Semi-supervised Nonnegative Matrix Factorization
    Wang, Jing
    Tian, Feng
    Liu, Chang Hong
    Wang, Xiao
    [J]. 2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [34] Speech Enhancement Using Convolutive Nonnegative Matrix Factorization with Cosparsity Regularization
    Mirbagheri, Majid
    Xu, Yanbo
    Akram, Sahar
    Shamma, Shihab
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 456 - 459
  • [35] A STRUCTURED NONNEGATIVE MATRIX FACTORIZATION FOR SOURCE SEPARATION
    Laroche, Clement
    Kowalski, Matthieu
    Papadopoulos, Helene
    Richard, Gael
    [J]. 2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 2033 - 2037
  • [36] A multilevel approach for nonnegative matrix factorization
    Gillis, Nicolas
    Glineur, Francois
    [J]. JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2012, 236 (07) : 1708 - 1723
  • [37] A PROJECTIVE APPROACH TO NONNEGATIVE MATRIX FACTORIZATION
    Groetzner, Patrick
    [J]. ELECTRONIC JOURNAL OF LINEAR ALGEBRA, 2021, 37 : 583 - 597
  • [38] Network Embedding Using Semi-Supervised Kernel Nonnegative Matrix Factorization
    He, Chaobo
    Zhang, Qiong
    Tang, Yong
    Liu, Shuangyin
    Liu, Hai
    [J]. IEEE ACCESS, 2019, 7 : 92732 - 92744
  • [39] A DISCRIMINATIVE APPROACH TO POLYPHONIC PIANO NOTE TRANSCRIPTION USING SUPERVISED NON-NEGATIVE MATRIX FACTORIZATION
    Weninger, Felix
    Kirst, Christian
    Schuller, Bjoern
    Bungartz, Hans-Joachim
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 6 - 10
  • [40] Spectro-temporal Filtering based on The Beta-divergence for Speech Separation using Nonnegative Matrix Factorization
    Fakhry, Mahmoud
    [J]. 2021 4TH INTERNATIONAL SEMINAR ON RESEARCH OF INFORMATION TECHNOLOGY AND INTELLIGENT SYSTEMS (ISRITI 2021), 2020,