A CONVEX OPTIMIZATION APPROACH FOR TIME-FREQUENCY MASK ESTIMATION

被引:0
|
作者
Bao, Feng [1 ]
Abdulla, Waleed H. [1 ]
机构
[1] Univ Auckland, Elect & Comp Engn Dept, 20 Symond St, Auckland 1010, New Zealand
关键词
Computational auditory scene analysis (CASA); Ideal binary mask (IBM); Convex optimization; Speech enhancement; SPEECH; NOISE; ENHANCEMENT;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a new time-frequency mask method for computational auditory scene analysis (CASA) based on convex optimization of the binary mask. In the proposed method, the pitch estimation and segment segregation in conventional CASA are completely replaced by the convex optimization of speech power. Considering the cross-correlation between the power spectra of noisy speech and noise in each of a Gammatone filterbank channel, the objective function of speech power used for convex optimization is built. The speech power is estimated by gradient descent method. Thus, the time-frequency units dominated by speech and noise are labeled by comparing the powers of noisy and estimated speech, and noise. The erroneous local masks are also removed by using the Teager energy of the estimated speech and time-frequency unit smoothing. The results from the average segmental signal-to-noise ratio improvement, HIT-False Alarm rate and subjective test show that the performance of the proposed method outperforms the reference methods.
引用
收藏
页码:31 / 35
页数:5
相关论文
共 50 条
  • [41] SIMULTANEOUS OPTIMIZATION OF FORGETTING FACTOR AND TIME-FREQUENCY MASK FOR BLOCK ONLINE MULTI-CHANNEL SPEECH ENHANCEMENT
    Togami, Masahito
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2702 - 2706
  • [42] Fetal Heart Rate Estimation: Adaptive Filtering Approach vs Time-Frequency Analysis
    Ahmad, Ashraf A.
    Nyitamen, Dominic S.
    Lawan, Sagir
    Wamdeo, Chiroma L.
    [J]. 2019 2ND INTERNATIONAL CONFERENCE OF THE IEEE NIGERIA COMPUTER CHAPTER (NIGERIACOMPUTCONF), 2019, : 82 - 86
  • [43] A time-frequency domain approach of heart rate estimation from photoplethysmographic (PPG) signal
    Islam, Mohammad Tariqul
    Zabir, Ishmam
    Ahamed, Sk. Tanvir
    Yasar, Md. Tahmid
    Shahnaz, Celia
    Fattah, Shaikh Anowarul
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2017, 36 : 146 - 154
  • [44] Novel Approach of Acceleration Estimation via Time-Frequency Image of GPS Carrier Signal
    Xia, Xuan
    Zhao, Jiankang
    Zang, Zhongyuan
    Zhong, Kewei
    Dong, Liang
    [J]. 2016 3RD INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2016, : 831 - 836
  • [45] A Structured Sparse Bayesian Channel Estimation Approach for Orthogonal Time-Frequency Space Modulation
    Zhang, Mi
    Xia, Xiaochen
    Xu, Kui
    Yang, Xiaoqin
    Xie, Wei
    Li, Yunkun
    Liu, Yang
    [J]. ENTROPY, 2023, 25 (05)
  • [46] If estimation of linear FM signals corrupted by multiplicative and additive noise: A time-frequency approach
    Barkat, B
    Boashash, B
    [J]. 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, 2000, : 661 - 664
  • [47] Neural approach to time-frequency signal decomposition
    Grabowski, D
    Walczak, J
    [J]. ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING - ICAISC 2004, 2004, 3070 : 1118 - 1123
  • [48] A time-frequency approach to the adjustable bandwidth concept
    Galleani, Lorenzo
    Cohen, Leon
    Noga, Andrew
    [J]. DIGITAL SIGNAL PROCESSING, 2006, 16 (05) : 454 - 467
  • [49] Testing Stationarity With Surrogates: A Time-Frequency Approach
    Borgnat, Pierre
    Flandrin, Patrick
    Honeine, Paul
    Richard, Cedric
    Xiao, Jun
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2010, 58 (07) : 3459 - 3470
  • [50] An Implementation Approach For Ideal Time-Frequency Distribution
    Zhang, Liming
    Qian, Tao
    [J]. 2014 19TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2014, : 114 - 118