Automatic Multi-Speaker Speech Recognition System Based on Time-Frequency Blind Source Separation under Ubiquitous Environment

被引：0

作者：

Wang, Zhe ^{[1
]}

Zhang, Haijian ^{[1
]}

Bi, Guoan ^{[1
]}

Li, Xiumei ^{[2
]}

机构：

[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore

[2] Hangzhou Normal Univ, Sch Informat Sci & Engn, Hangzhou, Peoples R China

来源：

PROCEEDINGS OF THE 2014 9TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA) | 2014年

关键词：

FOURIER-TRANSFORM; NOISE; DOMAIN;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, an automatic speech recognition (ASR) system under ubiquitous environment is proposed, which is successfully implemented in a personalized voice command system under vehicle and living room environment. The proposed ASR system describes a novel scheme of separating speech sources from multi-speakers, detecting speech presence/absence by tracking the higher portion of speech power spectrum and adaptively suppressing noises. An automatic recognition algorithm to adapt with the multi-speaker task is designed and conducted. Evaluation tests are carried out using noise database NOISEX-92 and speech database YOHO Corpus. Experimental results show that the proposed algorithm manages to achieve very impressive improvements.

引用

页码：101 / +

页数：2

共 50 条

[41] Hybrid Time-Frequency Blind Source Separation Towards Ambient System Identification of Structures
Hazra, B.
Sadhu, A.
Roffel, A. J.
Narasimhan, S.
COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2012, 27 (05) : 314 - 332
[42] System identification through nonstationary data using Time-Frequency Blind Source Separation
Guo, Yanlin
Kareem, Ahsan
JOURNAL OF SOUND AND VIBRATION, 2016, 371 : 110 - 131
[43] A NEW TIME-FREQUENCY APPROACH FOR UNDERDETERMINED CONVOLUTIVE BLIND SPEECH SEPARATION
Bouafif, Mariem
Lachiri, Zied
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 3226 - 3230
[44] Separation of noisy astrophysical images by blind time-frequency source separation methods
Oezgen, Mehmet Tankut
Herranz, Diego
Kuruoglu, Ercan Engin
2007 IEEE 15TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS, VOLS 1-3, 2007, : 1264 - +
[45] Real-time blind source separation system with applications to distant speech recognition
Ferreira, Alberto E. A.
Alarcao, Diogo
APPLIED ACOUSTICS, 2016, 113 : 170 - 184
[46] Blind separation of sources based on their time-frequency signatures
Zhang, YM
Amin, MG
2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 3132 - 3135
[47] An Underdetermined Blind Source Separation Algorithm based on Clustering Analysis and Time-frequency Representation
Yu Lu
Qu Jian-ling
Gao Feng
Tian Yan-ping
PROCEEDINGS OF THE 2018 13TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2018), 2018, : 1951 - 1956
[48] Contrast functions for blind source separation based on time-frequency information-theory
Sahmoudi, M
Amin, MG
Abed-Meraim, K
Belouchrani, A
INDEPENDENT COMPONENT ANALYSIS AND BLIND SIGNAL SEPARATION, PROCEEDINGS, 2006, 3889 : 876 - 884
[49] Single Channel multi-speaker speech Separation based on quantized ratio mask and residual network
Shanfa Ke
Ruimin Hu
Xiaochen Wang
Tingzhao Wu
Gang Li
Zhongyuan Wang
Multimedia Tools and Applications, 2020, 79 : 32225 - 32241
[50] Weighting Time-Frequency Representation of Speech using Auditory Saliency for Automatic Speech Recognition
Cong-Thanh Do
Stylianou, Yannis
19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1591 - 1595

← 1 2 3 4 5 →