On the integration of time-frequency masking speech separation and recognition in underdetermined environments

被引：0

作者：

Jafari, Ingrid ^{[1
]}

Haque, Serajul ^{[1
]}

Togneri, Roberto ^{[1
]}

Nordholm, Sven ^{[2
]}

机构：

[1] Univ Western Australia, Crawley, WA 6009, Australia

[2] Curtin Univ, Perth, WA 6845, Australia

来源：

2012 CONFERENCE RECORD OF THE FORTY SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR) | 2012年

基金：

澳大利亚研究理事会;

关键词：

SPARSE SOURCE SEPARATION; BLIND;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The successful application of automatic speech recognition systems in the real world is conditional on its ability to handle realistic environments with unfavorable conditions such as reverberation and multiple sources of inteference. Previous research has identified time-frequency masking based approaches to blind source separation as a viable approach for multisource reverberant source separation. It is proposed the use of such separation techniques as a front-end to speech recognition will encourage greater recognition accuracy. Experimental evaluations confirmed the hypothesis with an improvement in recognition accuracy of over 20% at a reverberation time of RT60 = 300ms; this is indicative of the potential for future research in this field.

引用

页码：1613 / 1617

页数：5

共 50 条

[31] Underdetermined source separation of EEG signals in the time-frequency domain
Shan, Zeyong
Swary, Jacob
Aviyente, Selin
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 3637 - 3640
[32] Underdetermined blind separation of nondisjoint sources in the time-frequency domain
Aissa-El-Bey, Abdeldjalil
Linh-Trung, Nguyen
Abed-Meraim, Karim
Belouchrani, Adel
Grenier, Yves
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2007, 55 (03) : 897 - 907
[33] Underdetermined blind source separation by a novel time-frequency method
Su, Qiao
Shen, Yuehong
Wei, Yimin
Deng, Changliang
AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2017, 77 : 43 - 49
[34] Underdetermined DOA Estimation via Independent Component Analysis and Time-Frequency Masking
Jancovic, Peter
Zou, Xin
Kokuer, Munevver
JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2010, 2010
[35] Robust Automatic Speech Recognition System Based on Using Adaptive Time-Frequency Masking
Gouda, Ahmed Mostafa
Tamazin, Mohamed
Khedr, Mohamed
PROCEEDINGS OF 2016 11TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS (ICCES), 2016, : 181 - 186
[36] Underdetermined blind separation of convolutive mixtures of speech using time-frequency mask and mixing matrix estimation
Blin, A
Araki, S
Makino, S
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (07) : 1693 - 1700
[37] Blind separation of underdetermined Convolutive speech mixtures by time–frequency masking with the reduction of musical noise of separated signals
Mahbanou Zohrevandi
Saeed Setayeshi
Azam Rabiee
Midia Reshadi
Multimedia Tools and Applications, 2021, 80 : 12601 - 12618
[38] Blind source separation using time-frequency masking
Mohammed, Abbas
Ballal, Tarig
Grbic, Nedelko
RADIOENGINEERING, 2007, 16 (04) : 96 - 100
[39] Impact of phase estimation on single-channel speech separation based on time-frequency masking
Mayer, Florian
Williamson, Donald S.
Mowlaee, Pejman
Wang, DeLiang
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 141 (06): : 4668 - 4679
[40] Underdetermined blind separation of delayed sound source in the time-frequency domain
Xia, X. (xiaxxy@163.com), 1600, Sichuan University (46):

← 1 2 3 4 5 →