Monaural Musical Sound Separation Based on Pitch and Common Amplitude Modulation

被引:35
|
作者
Li, Yipeng [1 ]
Woodruff, John [1 ]
Wang, DeLiang [1 ,2 ]
机构
[1] Ohio State Univ, Dept Comp Sci & Engn, Columbus, OH 43210 USA
[2] Ohio State Univ, Ctr Cognit Sci, Columbus, OH 43210 USA
基金
美国国家科学基金会;
关键词
Common amplitude modulation (CAM); musical sound separation; sinusoidal modeling; time-frequency masking; underdetermined sound separation; SPEECH;
D O I
10.1109/TASL.2009.2020886
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Monaural musical sound separation has been extensively studied recently. An important problem in separation of pitched musical sounds is the estimation of time-frequency regions where harmonics overlap. In this paper, we propose a sinusoidal modeling-based separation system that can effectively resolve overlapping harmonics. Our strategy is based on the observations that harmonics of the same source have correlated amplitude envelopes and that the change in phase of a harmonic is related to the instrument's pitch. We use these two observations in a least squares estimation framework for separation of overlapping harmonics. The system directly distributes mixture energy for harmonics that are unobstructed by other sources. Quantitative evaluation of the proposed system is shown when ground truth pitch information is available, when rough pitch estimates are provided in the form of a MIDI score, and finally, when a multi-pitch tracking algorithm is used. We also introduce a technique to improve the accuracy of rough pitch estimates. Results show that the proposed system significantly outperforms related monaural musical sound separation systems.
引用
收藏
页码:1361 / 1371
页数:11
相关论文
共 50 条
  • [1] Monaural Musical Octave Sound Separation Using Relaxed Extended Common Amplitude Modulation
    Gong, Yukai
    Dai, Longquan
    [J]. JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2022, 31 (04)
  • [2] Recovering Overlapping Partials for Monaural Perfect Harmonic Musical Sound Separation Using Modified Common Amplitude Modulation
    Gong, Yukai
    Shu, Xiangbo
    Tang, Jinhui
    [J]. ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT I, 2018, 10735 : 903 - 912
  • [3] Monaural speech segregation based on pitch tracking and amplitude modulation
    Hu, GN
    Wang, DL
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2004, 15 (05): : 1135 - 1150
  • [4] Monaural speech segregation based on pitch tracking and amplitude modulation
    Hu, GN
    Wang, DL
    [J]. 2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 553 - 556
  • [5] Unison Sound Separation Using Localized Extended Common Amplitude Modulation
    Gong, Yukai
    Dai, Longquan
    [J]. JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2023, 32 (06)
  • [6] Clustering Algorithm for Unsupervised Monaural Musical Sound Separation Based on Non-negative Matrix Factorization
    Park, Sang Ha
    Lee, Seokjin
    Sung, Koeng-Mo
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2012, E95A (04) : 818 - 823
  • [7] On amplitude modulation for monaural speech segregation
    Hu, GN
    Wang, DL
    [J]. PROCEEDING OF THE 2002 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-3, 2002, : 69 - 74
  • [8] Correlation-Based Amplitude Estimation of Coincident Partials in Monaural Musical Signals
    Arnal Barbedo, Jayme Garcia
    Tzanetakis, George
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2010,
  • [9] Correlation-Based Amplitude Estimation of Coincident Partials in Monaural Musical Signals
    JaymeGarciaArnal Barbedo
    George Tzanetakis
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2010
  • [10] Sound source separation for a robot based on pitch
    Heckmann, M
    Joublin, F
    Körner, E
    [J]. 2005 IEEE/RSJ International Conference on Intelligent Robots and Systems, Vols 1-4, 2005, : 203 - 208