A data-driven approach for estimating the time-frequency binary mask

被引:0
|
作者
Kim, Gibak [1 ]
Loizou, Philipos C. [1 ]
机构
[1] Univ Texas Dallas, Dept Elect Engn, Dallas, TX 75230 USA
关键词
ideal binary mask; SNR estimation; Bayes risk; SPEECH; NOISE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The ideal binary mask, often used in robust speech recognition applications, requires an estimate of the local SNR in each time-frequency (T-F) unit. A data-driven approach is proposed for estimating the instantaneous SNR of each T-F unit. By assuming that the a priori SNR and a posteriori SNR are uniformly distributed within a small region, the instantaneous SNR is estimated by minimizing the localized Bayes risk. The binary mask estimator derived by the proposed approach is evaluated in terms of hit and false alarm rates. Compared to the binary mask estimator that uses the decision-directed approach to compute the SNR, the proposed data-driven approach yielded substantial improvements (up to 40%) in classification performance, when assessed in terms of a sensitivity metric which is based on the difference between the hit and false alarm rates.
引用
收藏
页码:884 / 887
页数:4
相关论文
共 50 条
  • [1] Data-driven time-frequency analysis
    Hou, Thomas Y.
    Shi, Zuoqiang
    [J]. APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2013, 35 (02) : 284 - 308
  • [2] Data-driven time-frequency and time-scale detectors
    Sayeed, AM
    [J]. ADVANCED SIGNAL PROCESSING: ALGORITHMS, ARCHITECTURES, AND IMPLEMENTATIONS VII, 1997, 3162 : 66 - 77
  • [3] Convergence of a data-driven time-frequency analysis method
    Hou, Thomas Y.
    Shi, Zuoqiang
    Tavallali, Peyman
    [J]. APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2014, 37 (02) : 235 - 270
  • [4] A Data-Driven High-Resolution Time-Frequency Distribution
    Jiang, Lei
    Zhang, Haijian
    Yu, Lei
    Hua, Guang
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1512 - 1516
  • [5] Data-driven design and complexity control of time-frequency detectors
    Richard, C
    Lengellé, R
    [J]. SIGNAL PROCESSING, 1999, 77 (01) : 37 - 48
  • [6] Time-Frequency Tracking of Spectral Structures Estimated by a Data-Driven Method
    Gerber, Timothee
    Martin, Nadine
    Mailhes, Corinne
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2015, 62 (10) : 6616 - 6626
  • [7] Binaural Speech Separation Based on the Time-Frequency Binary Mask
    Mahmoodzadeh, A.
    Abutalebi, H. R.
    Soltanian-Zadeh, H.
    Sheikhzadeh, H.
    [J]. 2012 SIXTH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2012, : 848 - 853
  • [8] Detecting Dynamic Load Altering Attacks: A Data-Driven Time-Frequency Analysis
    Amini, Sajjad
    Pasqualetti, Fabio
    Mohsenian-Rad, Hamed
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SMART GRID COMMUNICATIONS (SMARTGRIDCOMM), 2015, : 503 - 508
  • [9] A CONVEX OPTIMIZATION APPROACH FOR TIME-FREQUENCY MASK ESTIMATION
    Bao, Feng
    Abdulla, Waleed H.
    [J]. 2017 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2017, : 31 - 35
  • [10] A data driven compressive sensing approach for time-frequency signal enhancement
    Volaric, Ivan
    Sucic, Victor
    Stankovic, Srdjan
    [J]. SIGNAL PROCESSING, 2017, 141 : 229 - 239