A BLIND SEPARATION OF MONAURAL SOUND BASED ON PEAK TRACKING OF FREQUENCY SPECTRA

被引:1
|
作者
Yamahata, Shoko [1 ]
Matsumoto, Mitsuharu [2 ]
Hashimoto, Shuji [1 ]
机构
[1] Waseda Univ, Shinjuku Ku, 3-4-1 Okubo, Tokyo 1698555, Japan
[2] Univ Electro Commun, Tokyo 1828585, Japan
关键词
D O I
10.1109/ICIME.2009.33
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes a blind separation algorithm of monaural sound based on peak tracking of frequency spectra. We have already reported a blind separation method based on the change ratio of frequency components. However, it cannot handle a signal with frequency fluctuation such as a speech signal or a vibrato tone, because such type of signal is regarded as the mixture of different sounds. Our new method proposed in this paper can handle a sound with frequency fluctuation by tracking frequency peaks along time axis. The effectiveness of the proposed method is evaluated with some experiments on real voice data.
引用
收藏
页码:305 / +
页数:3
相关论文
共 50 条
  • [1] New Distance Measure for Monaural Model-based Sound Separation
    Mahale, P. Mowlaee Begzade
    Sayadiyan, A.
    [J]. 2008 3RD INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES: FROM THEORY TO APPLICATIONS, VOLS 1-5, 2008, : 668 - 671
  • [2] Monaural Musical Sound Separation Based on Pitch and Common Amplitude Modulation
    Li, Yipeng
    Woodruff, John
    Wang, DeLiang
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (07): : 1361 - 1371
  • [3] Monaural Voiced Speech Separation with Multipitch Tracking
    Jiang, Wei
    Hu, Pengfei
    Liang, Shan
    Liu, Wenju
    Yang, Zhanlei
    [J]. PATTERN RECOGNITION, 2012, 321 : 564 - 571
  • [4] Underdetermined blind separation of delayed sound sources in the frequency domain
    Bofill, P
    [J]. NEUROCOMPUTING, 2003, 55 (3-4) : 627 - 641
  • [5] Agglomerative Hierarchical Clustering of Basis Vector for Monaural Sound Source Separation Based on NMF
    Murai, Kentaro
    Takeuchi, Taiho
    Tatekura, Yosuke
    [J]. 2018 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2018, : 1653 - 1657
  • [6] LINEAR MULTICHANNEL BLIND SOURCE SEPARATION BASED ON TIME-FREQUENCY MASK OBTAINED BY HARMONIC/PERCUSSIVE SOUND SEPARATION
    Oyabu, Soichiro
    Kitamura, Daichi
    Yatabe, Kohei
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 201 - 205
  • [7] Blind separation and sound localization by using frequency-domain ICA
    Azetsu, Tadahiro
    Uchino, Eiji
    Suetake, Noriaki
    [J]. SOFT COMPUTING, 2007, 11 (02) : 185 - 192
  • [8] Blind Separation and Sound Localization by Using Frequency-domain ICA
    Tadahiro Azetsu
    Eiji Uchino
    Noriaki Suetake
    [J]. Soft Computing, 2007, 11 : 185 - 192
  • [9] MONAURAL SOUND SOURCE SEPARATION USING COVARIANCE PROFILE OF PARTIALS
    Goel, Priyank
    Ramakrishnan, K. R.
    [J]. 2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2452 - 2456
  • [10] NMF WITH SPECTRAL AND TEMPORAL CONTINUITY CRITERIA FOR MONAURAL SOUND SOURCE SEPARATION
    Becker, Julian M.
    Sohn, Christian
    Rohlfing, Christian
    [J]. 2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 316 - 320