Supervised and semi-supervised separation of sounds from single-channel mixtures

被引:0
|
作者
Smaragdis, Paris [1 ]
Raj, Bhiksha [1 ]
Shashanka, Madhusudana [2 ]
机构
[1] Mitsubishi Elect Res Labs, Cambridge, MA USA
[2] Boston Univ, Dept Cognit & Neural Syst, Boston, MA USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we describe a methodology for model-based single channel separation of sounds. We present a sparse latent variable model that can learn sounds based on their distribution of time/frequency energy. This model can then be used to extract known types of sounds from mixtures in two scenarios. One being the case where all sound types in the mixture are known, and the other being being the case where only the target or the interference models are known. The model we propose has close ties to non-negative decompositions and latent variable models commonly used for semantic analysis.
引用
收藏
页码:414 / +
页数:2
相关论文
共 50 条
  • [1] Semi-supervised Single-Channel Speech-Music Separation for Automatic Speech Recognition
    Demir, Cemil
    Cemgil, A. Taylan
    Saraclar, Murat
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 688 - +
  • [2] Semi-Supervised Learning of Speech Sounds
    Jansen, Aren
    Niyogi, Partha
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2264 - 2267
  • [3] SHNN: A single-channel EEG sleep staging model based on semi-supervised learning
    Zhang, Yongqing
    Cao, Wenpeng
    Feng, Lixiao
    Wang, Manqing
    Geng, Tianyu
    Zhou, Jiliu
    Gao, Dongrui
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
  • [4] Semi-Supervised Learning by Gaussian Mixtures
    Choi, Byoung-Jeong
    Chae, Youn-Seok
    Choi, Woo-Young
    Park, Changyi
    Koo, Ja-Yong
    KOREAN JOURNAL OF APPLIED STATISTICS, 2008, 21 (05) : 825 - 833
  • [5] INTERACTIVE REFINEMENT OF SUPERVISED AND SEMI-SUPERVISED SOUND SOURCE SEPARATION ESTIMATES
    Bryan, Nicholas J.
    Mysore, Gautham J.
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 883 - 887
  • [6] SEMI-SUPERVISED MONAURAL SINGING VOICE SEPARATION WITH A MASKING NETWORK TRAINED ON SYNTHETIC MIXTURES
    Michelashvili, Michael
    Benaim, Sagie
    Wolf, Lior
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 291 - 295
  • [7] Adversarial Dropout for Supervised and Semi-Supervised Learning
    Park, Sungrae
    Park, JunKeon
    Shin, Su-Jin
    Moon, Il-Chul
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 3917 - 3924
  • [8] The High Separation Probability Assumption for Semi-Supervised Learning
    Huang, Gao
    Du, Chaoqun
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (12): : 7561 - 7573
  • [9] Semi-supervised MarginBoost
    d'Alché-Buc, F
    Grandvalet, Y
    Arnbroise, C
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 14, VOLS 1 AND 2, 2002, 14 : 553 - 560
  • [10] Supervised and semi-supervised machine learning ranking
    Vittaut, Jean-Noel
    Gallinari, Patrick
    COMPARATIVE EVALUATION OF XML INFORMATION RETRIEVAL SYSTEMS, 2007, 4518 : 213 - 222