A data-driven approach for estimating the time-frequency binary mask

被引:0
|
作者
Kim, Gibak [1 ]
Loizou, Philipos C. [1 ]
机构
[1] Univ Texas Dallas, Dept Elect Engn, Dallas, TX 75230 USA
关键词
ideal binary mask; SNR estimation; Bayes risk; SPEECH; NOISE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The ideal binary mask, often used in robust speech recognition applications, requires an estimate of the local SNR in each time-frequency (T-F) unit. A data-driven approach is proposed for estimating the instantaneous SNR of each T-F unit. By assuming that the a priori SNR and a posteriori SNR are uniformly distributed within a small region, the instantaneous SNR is estimated by minimizing the localized Bayes risk. The binary mask estimator derived by the proposed approach is evaluated in terms of hit and false alarm rates. Compared to the binary mask estimator that uses the decision-directed approach to compute the SNR, the proposed data-driven approach yielded substantial improvements (up to 40%) in classification performance, when assessed in terms of a sensitivity metric which is based on the difference between the hit and false alarm rates.
引用
收藏
页码:884 / 887
页数:4
相关论文
共 50 条
  • [21] A High Resolution Approach to Estimating Time-Frequency Spectra and Their Amplitudes
    Hengliang Wang
    Kin Siu
    Kihwan Ju
    Ki H. Chon
    [J]. Annals of Biomedical Engineering, 2006, 34 : 326 - 338
  • [22] A data-driven scheme for the approximated computing of Alias-Free Generalized Discrete time-frequency distributions
    Le, T
    Glesner, M
    [J]. ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, : 1717 - 1720
  • [23] High-resolution seismic inversion method based on joint data-driven in the time-frequency domain
    Liu, Yu
    Miao, Sisi
    [J]. Artificial Intelligence in Geosciences, 2024, 5
  • [24] Data-driven scheme for the approximated computing of alias-free generalized discrete time-frequency distributions
    Le, Thuyen
    Glesner, Manfred
    [J]. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 1999, 3 : 1717 - 1720
  • [25] Data-Driven Cyber-Attack Detection for PV Farms via Time-Frequency Domain Features
    Guo, Lulu
    Zhang, Jinan
    Ye, Jin
    Coshatt, Stephen James
    Song, Wenzhan
    [J]. IEEE TRANSACTIONS ON SMART GRID, 2022, 13 (02) : 1582 - 1597
  • [26] A data-driven approach for seismic damage detection of shear-type building structures using the fractal dimension of time-frequency features
    Li, Hui
    Tao, Dongwang
    Huang, Yong
    Bao, Yuequan
    [J]. STRUCTURAL CONTROL & HEALTH MONITORING, 2013, 20 (09): : 1191 - 1210
  • [27] Analysing the Data-Driven Approach of Dynamically Estimating Positioning Accuracy
    Anagnostopoulos, Grigorios G.
    Kalousis, Alexandros
    [J]. IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2021), 2021,
  • [28] CTNet: A data-driven time-frequency technique for wind turbines fault diagnosis under time-varying speeds
    Zhao, Dezun
    Shao, Depei
    Cui, Lingli
    [J]. ISA Transactions, 2024, 154 : 335 - 351
  • [29] Estimating the benefits of cooperation in a residential microgrid: A data-driven approach
    Rieger, Alexander
    Thummert, Robert
    Fridgen, Gilbert
    Kahlen, Micha
    Ketter, Wolfgang
    [J]. APPLIED ENERGY, 2016, 180 : 130 - 141
  • [30] Data-Driven Time-Frequency Method and Its Application in Detection of Free Gas Beneath a Gas Hydrate Deposit
    Yang, Yang
    Gao, Jinghuai
    Wang, Zhiguo
    Liu, Naihao
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60