Semi-supervised Single-Channel Speech-Music Separation for Automatic Speech Recognition

被引:0
|
作者
Demir, Cemil [1 ,3 ]
Cemgil, A. Taylan [2 ]
Saraclar, Murat [3 ]
机构
[1] TUBITAK BILGEM, Kocaeli, Turkey
[2] Bogazici Univ, Dept Comp Engn, Istanbul, Turkey
[3] Bogazici Univ, Dept Elect & Elect Engn, Istanbul, Turkey
关键词
speech-music separation; semi-supervised; speech recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, we propose a semi-supervised speech-music separation method which uses the speech, music and speech-music segments in a given segmented audio signal to separate speech and music signals from each other in the mixed speech-music segments. In this strategy, we assume, the background music of the mixed signal is partially composed of the repetition of the music segment in the audio. Therefore, we used a mixture model to represent the music signal. The speech signal is modeled using Non-negative Matrix Factorization (NMF) model. The prior model of the template matrix of the NMF model is estimated using the speech segment and updated using the mixed segment of the audio. The separation performance of the proposed method is evaluated in automatic speech recognition task.
引用
收藏
页码:688 / +
页数:2
相关论文
共 50 条
  • [31] Unsupervised and semi-supervised adaptation of a hybrid speech recognition system
    Trmal, Jan
    Zelinka, Jan
    Mueller, Ludek
    PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 527 - 530
  • [32] Momentum Pseudo-Labeling for Semi-Supervised Speech Recognition
    Higuchi, Yosuke
    Moritz, Niko
    Le Roux, Jonathan
    Hori, Takaaki
    INTERSPEECH 2021, 2021, : 726 - 730
  • [33] Correction to: Semi-supervised Ladder Networks for Speech Emotion Recognition
    Jian-Hua Tao
    Jian Huang
    Ya Li
    Zheng Lian
    Ming-Yue Niu
    International Journal of Automation and Computing, 2021, 18 : 680 - 680
  • [34] Regularized Urdu Speech Recognition with Semi-Supervised Deep Learning
    Humayun, Mohammad Ali
    Hameed, Ibrahim A.
    Shah, Syed Muslim
    Khan, Sohaib Hassan
    Zafar, Irfan
    Bin Ahmed, Saad
    Shuja, Junaid
    APPLIED SCIENCES-BASEL, 2019, 9 (09):
  • [35] Semi-supervised cross-lingual speech emotion recognition
    Agarla, Mirko
    Bianco, Simone
    Celona, Luigi
    Napoletano, Paolo
    Petrovsky, Alexey
    Piccoli, Flavio
    Schettini, Raimondo
    Shanin, Ivan
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237
  • [36] Semi-supervised parallel shared encoders for speech emotion recognition
    Pourebrahim, Yousef
    Razzazi, Farbod
    Sameti, Hossein
    DIGITAL SIGNAL PROCESSING, 2021, 118
  • [37] JOINT SINGLE-CHANNEL SPEECH SEPARATION AND SPEAKER IDENTIFICATION
    Mowlaee, P.
    Saeidi, R.
    Tan, Z. -H.
    Christensen, M. G.
    Franti, P.
    Jensen, S. H.
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4430 - 4433
  • [38] BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition
    Zhang, Yu
    Park, Daniel S.
    Han, Wei
    Qin, James
    Gulati, Anmol
    Shor, Joel
    Jansen, Aren
    Xu, Yuanzhong
    Huang, Yanping
    Wang, Shibo
    Zhou, Zongwei
    Li, Bo
    Ma, Min
    Chan, William
    Yu, Jiahui
    Wang, Yongqiang
    Cao, Liangliang
    Sim, Khe Chai
    Ramabhadran, Bhuvana
    Sainath, Tara N.
    Beaufays, Francoise
    Chen, Zhifeng
    Le, Quoc, V
    Chiu, Chung-Cheng
    Pang, Ruoming
    Wu, Yonghui
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2022, 16 (06) : 1519 - 1532
  • [39] WHAMR!: NOISY AND REVERBERANT SINGLE-CHANNEL SPEECH SEPARATION
    Maciejewski, Matthew
    Wichern, Gordon
    McQuinn, Emmett
    Le Roux, Jonathan
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 696 - 700
  • [40] LEARNING A HIERARCHICAL DICTIONARY FOR SINGLE-CHANNEL SPEECH SEPARATION
    Bao, Guangzhao
    Xu, Yangfei
    Xu, Xu
    Ye, Zhongfu
    2014 IEEE WORKSHOP ON STATISTICAL SIGNAL PROCESSING (SSP), 2014, : 476 - 479