Automatic Multi-Speaker Speech Recognition System Based on Time-Frequency Blind Source Separation under Ubiquitous Environment

被引:0
|
作者
Wang, Zhe [1 ]
Zhang, Haijian [1 ]
Bi, Guoan [1 ]
Li, Xiumei [2 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
[2] Hangzhou Normal Univ, Sch Informat Sci & Engn, Hangzhou, Peoples R China
关键词
FOURIER-TRANSFORM; NOISE; DOMAIN;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, an automatic speech recognition (ASR) system under ubiquitous environment is proposed, which is successfully implemented in a personalized voice command system under vehicle and living room environment. The proposed ASR system describes a novel scheme of separating speech sources from multi-speakers, detecting speech presence/absence by tracking the higher portion of speech power spectrum and adaptively suppressing noises. An automatic recognition algorithm to adapt with the multi-speaker task is designed and conducted. Evaluation tests are carried out using noise database NOISEX-92 and speech database YOHO Corpus. Experimental results show that the proposed algorithm manages to achieve very impressive improvements.
引用
收藏
页码:101 / +
页数:2
相关论文
共 50 条
  • [31] A time-frequency blind source separation method based on segmented coherence function
    Albouy, B
    Deville, Y
    ARTIFICIAL NEURAL NETS PROBLEM SOLVING METHODS, PT II, 2003, 2687 : 289 - 296
  • [32] Blind Source Separation Based on Time-Frequency Sparseness in the Presence of Spatial Aliasing
    Loesch, Benedikt
    Yang, Bin
    LATENT VARIABLE ANALYSIS AND SIGNAL SEPARATION, 2010, 6365 : 1 - 8
  • [33] Time-frequency analysis and auditory modeling for automatic recognition of speech
    Pitton, JW
    Wang, KS
    Juang, BH
    PROCEEDINGS OF THE IEEE, 1996, 84 (09) : 1199 - 1215
  • [34] Real-time End-to-End Monaural Multi-speaker Speech Recognition
    Li, Song
    Ouyang, Beibei
    Tong, Fuchuan
    Liao, Dexin
    Li, Lin
    Hong, Qingyang
    INTERSPEECH 2021, 2021, : 3750 - 3754
  • [35] A new block based time-frequency approach for underdetermined blind source separation
    Luo, Y
    Lambotharan, S
    Chambers, JA
    2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL V, PROCEEDINGS: DESIGN AND IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS INDUSTRY TECHNOLOGY TRACKS MACHINE LEARNING FOR SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING SIGNAL PROCESSING FOR EDUCATION, 2004, : 537 - 540
  • [36] Blind source separation in the time-frequency domain based on multiple hypothesis testing
    Cirillo, Luke
    Zoubir, Abdelhak
    Amin, Moeness
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2008, 56 (06) : 2267 - 2279
  • [37] Blind source separation based on high-resolution time-frequency distributions
    Guo, Jing
    Zeng, Xiaoping
    She, Zhishun
    COMPUTERS & ELECTRICAL ENGINEERING, 2012, 38 (01) : 175 - 184
  • [38] Towards a hardware realization of time-frequency source separation of speech
    Harte, N
    Hurley, N
    Fearon, C
    Rickard, S
    PROCEEDINGS OF THE 2005 EUROPEAN CONFERENCE ON CIRCUIT THEORY AND DESIGN, VOL 1, 2005, : 71 - 74
  • [39] Underdetermined blind source separation by a novel time-frequency method
    Su, Qiao
    Shen, Yuehong
    Wei, Yimin
    Deng, Changliang
    AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2017, 77 : 43 - 49
  • [40] Non-negative Matrix Based Optimization Scheme for Blind Source Separation in Automatic Speech Recognition System
    Santosh, Kumar S.
    Bharathi, S. H.
    Archana, M.
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON COMMUNICATION AND ELECTRONICS SYSTEMS (ICCES), 2016, : 782 - 787