On the integration of time-frequency masking speech separation and recognition in underdetermined environments

被引:0
|
作者
Jafari, Ingrid [1 ]
Haque, Serajul [1 ]
Togneri, Roberto [1 ]
Nordholm, Sven [2 ]
机构
[1] Univ Western Australia, Crawley, WA 6009, Australia
[2] Curtin Univ, Perth, WA 6845, Australia
来源
2012 CONFERENCE RECORD OF THE FORTY SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR) | 2012年
基金
澳大利亚研究理事会;
关键词
SPARSE SOURCE SEPARATION; BLIND;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The successful application of automatic speech recognition systems in the real world is conditional on its ability to handle realistic environments with unfavorable conditions such as reverberation and multiple sources of inteference. Previous research has identified time-frequency masking based approaches to blind source separation as a viable approach for multisource reverberant source separation. It is proposed the use of such separation techniques as a front-end to speech recognition will encourage greater recognition accuracy. Experimental evaluations confirmed the hypothesis with an improvement in recognition accuracy of over 20% at a reverberation time of RT60 = 300ms; this is indicative of the potential for future research in this field.
引用
收藏
页码:1613 / 1617
页数:5
相关论文
共 50 条
  • [31] Underdetermined source separation of EEG signals in the time-frequency domain
    Shan, Zeyong
    Swary, Jacob
    Aviyente, Selin
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 3637 - 3640
  • [32] Underdetermined blind separation of nondisjoint sources in the time-frequency domain
    Aissa-El-Bey, Abdeldjalil
    Linh-Trung, Nguyen
    Abed-Meraim, Karim
    Belouchrani, Adel
    Grenier, Yves
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2007, 55 (03) : 897 - 907
  • [33] Underdetermined blind source separation by a novel time-frequency method
    Su, Qiao
    Shen, Yuehong
    Wei, Yimin
    Deng, Changliang
    AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2017, 77 : 43 - 49
  • [34] Underdetermined DOA Estimation via Independent Component Analysis and Time-Frequency Masking
    Jancovic, Peter
    Zou, Xin
    Kokuer, Munevver
    JOURNAL OF ELECTRICAL AND COMPUTER ENGINEERING, 2010, 2010
  • [35] Robust Automatic Speech Recognition System Based on Using Adaptive Time-Frequency Masking
    Gouda, Ahmed Mostafa
    Tamazin, Mohamed
    Khedr, Mohamed
    PROCEEDINGS OF 2016 11TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS (ICCES), 2016, : 181 - 186
  • [36] Underdetermined blind separation of convolutive mixtures of speech using time-frequency mask and mixing matrix estimation
    Blin, A
    Araki, S
    Makino, S
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2005, E88A (07) : 1693 - 1700
  • [37] Blind separation of underdetermined Convolutive speech mixtures by time–frequency masking with the reduction of musical noise of separated signals
    Mahbanou Zohrevandi
    Saeed Setayeshi
    Azam Rabiee
    Midia Reshadi
    Multimedia Tools and Applications, 2021, 80 : 12601 - 12618
  • [38] Blind source separation using time-frequency masking
    Mohammed, Abbas
    Ballal, Tarig
    Grbic, Nedelko
    RADIOENGINEERING, 2007, 16 (04) : 96 - 100
  • [39] Impact of phase estimation on single-channel speech separation based on time-frequency masking
    Mayer, Florian
    Williamson, Donald S.
    Mowlaee, Pejman
    Wang, DeLiang
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2017, 141 (06): : 4668 - 4679
  • [40] Underdetermined blind separation of delayed sound source in the time-frequency domain
    Xia, X. (xiaxxy@163.com), 1600, Sichuan University (46):