Automatic Multi-Speaker Speech Recognition System Based on Time-Frequency Blind Source Separation under Ubiquitous Environment

被引:0
|
作者
Wang, Zhe [1 ]
Zhang, Haijian [1 ]
Bi, Guoan [1 ]
Li, Xiumei [2 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[2] Hangzhou Normal Univ, Sch Informat Sci & Engn, Hangzhou, Peoples R China
关键词
FOURIER-TRANSFORM; NOISE; DOMAIN;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, an automatic speech recognition (ASR) system under ubiquitous environment is proposed, which is successfully implemented in a personalized voice command system under vehicle and living room environment. The proposed ASR system describes a novel scheme of separating speech sources from multi-speakers, detecting speech presence/absence by tracking the higher portion of speech power spectrum and adaptively suppressing noises. An automatic recognition algorithm to adapt with the multi-speaker task is designed and conducted. Evaluation tests are carried out using noise database NOISEX-92 and speech database YOHO Corpus. Experimental results show that the proposed algorithm manages to achieve very impressive improvements.
引用
收藏
页码:101 / +
页数:2
相关论文
共 50 条
  • [41] Hybrid Time-Frequency Blind Source Separation Towards Ambient System Identification of Structures
    Hazra, B.
    Sadhu, A.
    Roffel, A. J.
    Narasimhan, S.
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2012, 27 (05) : 314 - 332
  • [42] System identification through nonstationary data using Time-Frequency Blind Source Separation
    Guo, Yanlin
    Kareem, Ahsan
    JOURNAL OF SOUND AND VIBRATION, 2016, 371 : 110 - 131
  • [43] A NEW TIME-FREQUENCY APPROACH FOR UNDERDETERMINED CONVOLUTIVE BLIND SPEECH SEPARATION
    Bouafif, Mariem
    Lachiri, Zied
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 3226 - 3230
  • [44] Separation of noisy astrophysical images by blind time-frequency source separation methods
    Oezgen, Mehmet Tankut
    Herranz, Diego
    Kuruoglu, Ercan Engin
    2007 IEEE 15TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS, VOLS 1-3, 2007, : 1264 - +
  • [45] Real-time blind source separation system with applications to distant speech recognition
    Ferreira, Alberto E. A.
    Alarcao, Diogo
    APPLIED ACOUSTICS, 2016, 113 : 170 - 184
  • [46] Blind separation of sources based on their time-frequency signatures
    Zhang, YM
    Amin, MG
    2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 3132 - 3135
  • [47] An Underdetermined Blind Source Separation Algorithm based on Clustering Analysis and Time-frequency Representation
    Yu Lu
    Qu Jian-ling
    Gao Feng
    Tian Yan-ping
    PROCEEDINGS OF THE 2018 13TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2018), 2018, : 1951 - 1956
  • [48] Contrast functions for blind source separation based on time-frequency information-theory
    Sahmoudi, M
    Amin, MG
    Abed-Meraim, K
    Belouchrani, A
    INDEPENDENT COMPONENT ANALYSIS AND BLIND SIGNAL SEPARATION, PROCEEDINGS, 2006, 3889 : 876 - 884
  • [49] Single Channel multi-speaker speech Separation based on quantized ratio mask and residual network
    Shanfa Ke
    Ruimin Hu
    Xiaochen Wang
    Tingzhao Wu
    Gang Li
    Zhongyuan Wang
    Multimedia Tools and Applications, 2020, 79 : 32225 - 32241
  • [50] Weighting Time-Frequency Representation of Speech using Auditory Saliency for Automatic Speech Recognition
    Cong-Thanh Do
    Stylianou, Yannis
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1591 - 1595