VOCAL SEPARATION USING EXTENDED ROBUST PRINCIPAL COMPONENT ANALYSIS WITH SCHATTEN P/LP-NORM AND SCALE COMPRESSION

被引:0
|
作者
Jeong, Il-Young [1 ]
Lee, Kyogu [1 ,2 ]
机构
[1] Seoul Natl Univ, Grad Sch Convergence Sci & Technol, Mus & Audio Res Grp, Seoul, South Korea
[2] Adv Inst Convergence Technol, Suwon, South Korea
关键词
Vocal separation; robust principal component analysis; Schatten p-norm; l(p)-norm; scale compression; SINGING-VOICE SEPARATION; MONAURAL RECORDINGS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Separating vocal and accompaniment signals from a monaural music signal is a challenging task. Recently, robust principal component analysis (RPCA) has been proposed for use in the magnitude spectrogram domain to separate the low-rank and sparse residual matrices, which are assumed to represent the accompaniment and vocal signals, respectively. In this paper, we propose two extended methods based on the RPCA algorithm for more effective vocal separation. First, we extend the conventional RPCA and propose to use in the spectrogram decomposition framework Schatten p-and l(p)-norms, which are generalized versions of the nuclear norm and l(1)-norm used in RPCA, respectively. Second, we apply proper scale compression to the magnitude spectrogram, making it a more appropriate representation for the decomposition. Experiments using the MIR-1K dataset show that the proposed methods yield significantly better separation performance than the conventional RPCA.
引用
收藏
页数:6
相关论文
共 28 条
  • [21] Singing Voice Separation and Vocal FO Estimation Based on Mutual Combination of Robust Principal Component Analysis and Subharmonic Summation
    Ikemiya, Yukara
    Itoyama, Katsutoshi
    Yoshii, Kazuyoshi
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (11) : 2084 - 2095
  • [22] FEATURE AWARE 3D MESH COMPRESSION USING ROBUST PRINCIPAL COMPONENT ANALYSIS
    Lalos, Aris S.
    Arvanitis, Gerasimos
    Spathis-Papadiotis, Aristotelis
    Moustakas, Konstantinos
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2018,
  • [23] Robust principal component analysis via optimal mean by joint l2,1 and Schatten p-norms minimization
    Shi, Xiaoshuang
    Nie, Feiping
    Lai, Zhihui
    Guo, Zhenhua
    [J]. NEUROCOMPUTING, 2018, 283 : 205 - 213
  • [24] SINGING-VOICE SEPARATION FROM MONAURAL RECORDINGS USING ROBUST PRINCIPAL COMPONENT ANALYSIS
    Huang, Po-Sen
    Chen, Scott Deeann
    Smaragdis, Paris
    Hasegawa-Johnson, Mark
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 57 - 60
  • [25] Manifold Regularized Principal Component Analysis Method Using L2,p-Norm
    Wan, Minghua
    Wang, Xichen
    Tan, Hai
    Yang, Guowei
    [J]. MATHEMATICS, 2022, 10 (23)
  • [26] Unsupervised Singing Voice Separation Using Gammatone Auditory Filterbank and Constraint Robust Principal Component Analysis
    Li, Feng
    Akagi, Masato
    [J]. 2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 1924 - 1928
  • [27] Unsupervised Single-Channel Singing Voice Separation with Weighted Robust Principal Component Analysis Based on Gammatone Auditory Filterbank and Vocal Activity Detection
    Li, Feng
    Hu, Yujun
    Wang, Lingling
    [J]. SENSORS, 2023, 23 (06)
  • [28] The Nonconvex Tensor Robust Principal Component Analysis Approximation Model via the Weighted ℓp\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\ell _p$$\end{document}-Norm Regularization
    Minghui Li
    Wen Li
    Yannan Chen
    Mingqing Xiao
    [J]. Journal of Scientific Computing, 2021, 89 (3)