Automatic Multi-Speaker Speech Recognition System Based on Time-Frequency Blind Source Separation under Ubiquitous Environment

被引:0
|
作者
Wang, Zhe [1 ]
Zhang, Haijian [1 ]
Bi, Guoan [1 ]
Li, Xiumei [2 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[2] Hangzhou Normal Univ, Sch Informat Sci & Engn, Hangzhou, Peoples R China
关键词
FOURIER-TRANSFORM; NOISE; DOMAIN;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, an automatic speech recognition (ASR) system under ubiquitous environment is proposed, which is successfully implemented in a personalized voice command system under vehicle and living room environment. The proposed ASR system describes a novel scheme of separating speech sources from multi-speakers, detecting speech presence/absence by tracking the higher portion of speech power spectrum and adaptively suppressing noises. An automatic recognition algorithm to adapt with the multi-speaker task is designed and conducted. Evaluation tests are carried out using noise database NOISEX-92 and speech database YOHO Corpus. Experimental results show that the proposed algorithm manages to achieve very impressive improvements.
引用
收藏
页码:101 / +
页数:2
相关论文
共 50 条
  • [21] An efficient time-frequency approach to blind source separation based on wavelets
    Hesse, CW
    James, CJ
    INDEPENDENT COMPONENT ANALYSIS AND BLIND SIGNAL SEPARATION, 2004, 3195 : 1048 - 1055
  • [22] Blind source separation using time-frequency concepts
    Fahmy, MF
    Amin, OMH
    Hasan, YMY
    El-Raheem, GM
    Proceedings of the Fourth IEEE International Symposium on Signal Processing and Information Technology, 2004, : 250 - 253
  • [23] Blind source separation using time-frequency analysis
    Ashino, Ryuichi
    Mandai, Takeshi
    Morimoto, Akira
    Sasaki, Fumio
    PSEUDO-DIFFERENTIAL OPERATORS: PARTIAL DIFFERENTIAL EQUATIONS AND TIME-FREQUENCY ANALYSIS, 2007, 52 : 401 - 414
  • [24] Time-Frequency Approach to Underdetermined Blind Source Separation
    Xie, Shengli
    Yang, Liu
    Yang, Jun-Mei
    Zhou, Guoxu
    Xiang, Yong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (02) : 306 - 316
  • [25] Blind source separation using time-frequency masking
    Mohammed, Abbas
    Ballal, Tarig
    Grbic, Nedelko
    RADIOENGINEERING, 2007, 16 (04) : 96 - 100
  • [26] Blind separation of speech mixtures via time-frequency masking
    Yilmaz, Ö
    Rickard, S
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (07) : 1830 - 1847
  • [27] A TIME-FREQUENCY BLIND SEPARATION METHOD FOR UNDERDETERMINED SPEECH MIXTURES
    Lv Yao Li Shuangtian(Institute of Acoustics
    JournalofElectronics(China), 2008, (05) : 702 - 708
  • [28] Multi-speaker Speech Separation under Reverberation Conditions Using Conv-Tasnet
    Wang, Chunxi
    Jia, Maoshen
    Zhang, Yanyan
    Li, Lu
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2023, 14 (04) : 694 - 700
  • [29] Astrophysical image separation by blind time-frequency source separation methods
    Ozgen, Mehmet Tankut
    Kuruoglu, Ercan Engin
    Herranz, Diego
    DIGITAL SIGNAL PROCESSING, 2009, 19 (02) : 360 - 369
  • [30] SOURCE-AWARE CONTEXT NETWORK FOR SINGLE-CHANNEL MULTI-SPEAKER SPEECH SEPARATION
    Li, Zeng-Xi
    Song, Yan
    Dai, Li-Rong
    McLoughlin, Ian
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 681 - 685