Weighted Von Mises Distribution-based Loss Function for Real-time STFT Phase Reconstruction Using DNN

被引:0
|
作者
Thien, Nguyen Binh [1 ]
Wakabayashi, Yukoh [2 ]
Geng Yuting [1 ]
Iwai, Kenta [1 ]
Nishiura, Takanobu [1 ]
机构
[1] Ritsumeikan Univ, Shiga, Japan
[2] Toyohashi Univ Technol, Toyohashi, Aichi, Japan
来源
关键词
Deep neural network; phase reconstruction; instantaneous frequency; group delay; von Mises distribution; CHANNEL SPEECH ENHANCEMENT; SIGNAL ESTIMATION; NETWORKS;
D O I
10.21437/Interspeech.2023-580
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper presents improvements to real-time phase reconstruction using deep neural networks (DNNs). The advantage of DNN-based approaches in phase reconstruction is that they can leverage prior knowledge from data and are adaptable to real-time applications by using causal models. However, conventional DNN-based methods do not consider the varying properties of the phase at different time-frequency bins. Our paper proposes loss functions for phase reconstruction that incorporate frequency-specific and amplitude weights to distinguish the importance of phase elements based on their properties. We also use an extension of the group delay to improve the phase connections along the frequency. To improve the generalization, we augment the data by randomly shifting the signals in the time domain for each epoch during training. Experimental results show the superior performance of the proposed methods compared to conventional DNN-based and non-DNN real-time phase reconstruction methods.
引用
收藏
页码:3864 / 3868
页数:5
相关论文
共 50 条
  • [31] Real-Time Deformations of Function-Based Surfaces using Perturbation Functions
    Vyatkin, S. I.
    Romanyuk, A. N.
    Savytska, L. A.
    Troianovska, T. I.
    Dobrovolska, N. V.
    INTERNATIONAL CONFERENCE INFORMATION TECHNOLOGIES IN BUSINESS AND INDUSTRY 2018, PTS 1-4, 2018, 1015
  • [32] A Distribution-based Regression for Real-time COVID-19 Cases Detection from Chest X-ray and CT Images
    Zamzami, Nuha
    Koochemeshkian, Pantea
    Bouguila, Nizar
    2020 IEEE 21ST INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2020), 2020, : 104 - 111
  • [33] Self-correction of cycle threshold values by a normal distribution-based process to improve accuracy of quantification in real-time digital PCR
    Zang, Peilin
    Xu, Qi
    Li, Chuanyu
    Tao, Mingli
    Zhang, Zhiqi
    Li, Jinze
    Zhang, Wei
    Li, Shuli
    Li, Chao
    Yang, Qi
    Guo, Zhen
    Yao, Jia
    Zhou, Lianqun
    ANALYTICAL AND BIOANALYTICAL CHEMISTRY, 2024, 416 (10) : 2453 - 2464
  • [34] MNSSD: A Real-time DNN based Companion Image Data Annotation using MobileNet and Single Shot Multibox Detector
    Morshed, Md Golam
    Lee, Young-Koo
    2022 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (IEEE BIGCOMP 2022), 2022, : 251 - 258
  • [35] Real-time, full-band, online DNN-based voice conversion system using a single CPU
    Saeki, Takaaki
    Saito, Yuki
    Takamichi, Shinnosuke
    Saruwatari, Hiroshi
    INTERSPEECH 2020, 2020, : 1021 - 1022
  • [36] Real-time monitoring of cardiac regional function using FastHARP MRI and region-of-interest reconstruction
    Abd-Elmoniem, Khaled Z.
    Sampath, Smita
    Osman, Nael F.
    Prince, Jerry L.
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2007, 54 (09) : 1650 - 1656
  • [37] Real-Time Image Inpainting using PatchMatch Based Two-Generator Adversarial Networks with Optimized Edge Loss Function
    Ding, Luchang
    Zhang, Jing
    Wu, Chang
    Cai, Chang
    Chen, Gengsheng
    2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 3145 - 3149
  • [38] REAL-TIME GENERATION OF THE WIGNER DISTRIBUTION OF COMPLEX FUNCTIONS USING PHASE CONJUGATION IN PHOTOREFRACTIVE MATERIALS
    SUN, PC
    FAINMAN, Y
    OPTICS LETTERS, 1990, 15 (17) : 941 - 943
  • [39] Block-Based Adaptive Compressed Sensing by Using Edge Information for Real-Time Reconstruction
    Pavitra, V.
    Dutt, V. B. S. Srilatha Indira
    IEEE ACCESS, 2024, 12 : 159414 - 159425
  • [40] Segmentation Masks for Real-time Traffic Sign Recognition using Weighted HOG-based Trees
    Zaklouta, Fatin
    Stanciulescu, Bogdan
    2011 14TH INTERNATIONAL IEEE CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2011, : 1954 - 1959